Dyson spheres are hypothetical megastructures proposed to partially or fully enclose a star and reprocess its radiation, producing mid-infrared excesses that may appear as rare anomalies in large stellar catalogues. No confirmed Dyson sphere is known, but the systematic prioritization of stars with unexplained photometric behaviour offers a practical way to support targeted astrophysical follow-up. In this work, we present a machine learning framework for the prioritization of potential Dyson sphere candidates. Rather than aiming at physical confirmation, the proposed approach ranks stars according to their similarity to a small set of literature candidates and to their deviation from well characterised normal stellar populations. The method is based on a symmetric two-phase learning scheme that integrates anomaly detection, metric-based few-shot learning and ensemble classifiers within a unified probabilistic ranking pipeline. Applied to Gaia DR3 and AllWISE photometry, the framework recovers previously reported mid-infrared excess sources and highlights new high-priority outliers that currently lack straightforward astrophysical interpretations. Among the tested models, isolation forests and prototypical networks with hard negative mining provide the most stable and effective rankings. These results indicate that data-driven prioritization can support large-scale technosignature searches by identifying anomalous stars that merit closer astrophysical investigation.

Machine learning for the prioritization of Dyson sphere candidates

Paolo Mignone
;
2026-01-01

Abstract

Dyson spheres are hypothetical megastructures proposed to partially or fully enclose a star and reprocess its radiation, producing mid-infrared excesses that may appear as rare anomalies in large stellar catalogues. No confirmed Dyson sphere is known, but the systematic prioritization of stars with unexplained photometric behaviour offers a practical way to support targeted astrophysical follow-up. In this work, we present a machine learning framework for the prioritization of potential Dyson sphere candidates. Rather than aiming at physical confirmation, the proposed approach ranks stars according to their similarity to a small set of literature candidates and to their deviation from well characterised normal stellar populations. The method is based on a symmetric two-phase learning scheme that integrates anomaly detection, metric-based few-shot learning and ensemble classifiers within a unified probabilistic ranking pipeline. Applied to Gaia DR3 and AllWISE photometry, the framework recovers previously reported mid-infrared excess sources and highlights new high-priority outliers that currently lack straightforward astrophysical interpretations. Among the tested models, isolation forests and prototypical networks with hard negative mining provide the most stable and effective rankings. These results indicate that data-driven prioritization can support large-scale technosignature searches by identifying anomalous stars that merit closer astrophysical investigation.
2026
Astroinformatics,Dyson sphere,Technosignatures,Machine learning,Gaia,WISE
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.12607/76925
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
social impact