TY - JOUR
T1 - Gaia GraL: Gaia gravitational lens systems
T2 - IX. Using XGBoost to explore the Gaia Focused Product Release GravLens catalogue
AU - Petit, Quentin
AU - Ducourant, Christine
AU - Slezak, Eric
AU - Krone-Martins, Alberto
AU - Bœhm, Céline
AU - Connor, Thomas
AU - Delchambre, Ludovic
AU - Djorgovski, S. G.
AU - Galluccio, Laurent
AU - Graham, Matthew J.
AU - Jalan, Priyanka
AU - Klioner, Sergei A.
AU - Klüter, Jonas
AU - Mignard, François
AU - Negi, Vibhore
AU - Jr, Sergio Scarano
AU - den Brok, Jakob Sebastian
AU - Sluse, Dominique
AU - Stern, Daniel
AU - Surdej, Jean
AU - Teixeira, Ramachrisna
AU - Vale-Cunha, P. H.
AU - Walton, Dominic J.
AU - Wambsganss, Joachim
N1 - © 2025 The Author(s). This is an open access article distributed under the Creative Commons Attribution License (CC BY), https://creativecommons.org/licenses/by/4.0/
PY - 2025/4/2
Y1 - 2025/4/2
N2 - Aims. Quasar strong gravitational lenses are important tools for putting constraints on the dark matter distribution, dark energy contribution, and the Hubble-Lemaître parameter. We aim to present a new supervised machine learning-based method to identify these lenses in large astrometric surveys. The Gaia Focused Product Release (FPR) GravLens catalogue is designed for the identification of multiply imaged quasars, as it provides astrometry and photometry of all sources in the field of 4.7 million quasars. Methods. Our new approach for automatically identifying four-image lens configurations in large catalogues is based on the eXtreme Gradient Boosting classification algorithm. To train this supervised algorithm, we performed realistic simulations of lenses with four images that account for the statistical distribution of the morphology of the deflecting halos as measured in the EAGLE simulation. We identified the parameters discriminant for the classification and performed two different trainings, namely, with and without distance information. Results. The performances of this method on the simulated data are quite good, with a true positive rate and a true negative rate of about 99.99% and 99.84%, respectively. Our validation of the method on a small set of known quasar lenses demonstrates its efficiency, with 75% of known lenses being correctly identified. We applied our algorithm (both trainings) to more than 0.9 million quadruplets selected from the Gaia FPR GravLens catalogue. We derived a list of 1127 candidates with at least one score larger than 0.75, where each candidate has two scores-one from the model trained with distance information and one from the model trained without distance information-and including 201 very good candidates with both high scores.
AB - Aims. Quasar strong gravitational lenses are important tools for putting constraints on the dark matter distribution, dark energy contribution, and the Hubble-Lemaître parameter. We aim to present a new supervised machine learning-based method to identify these lenses in large astrometric surveys. The Gaia Focused Product Release (FPR) GravLens catalogue is designed for the identification of multiply imaged quasars, as it provides astrometry and photometry of all sources in the field of 4.7 million quasars. Methods. Our new approach for automatically identifying four-image lens configurations in large catalogues is based on the eXtreme Gradient Boosting classification algorithm. To train this supervised algorithm, we performed realistic simulations of lenses with four images that account for the statistical distribution of the morphology of the deflecting halos as measured in the EAGLE simulation. We identified the parameters discriminant for the classification and performed two different trainings, namely, with and without distance information. Results. The performances of this method on the simulated data are quite good, with a true positive rate and a true negative rate of about 99.99% and 99.84%, respectively. Our validation of the method on a small set of known quasar lenses demonstrates its efficiency, with 75% of known lenses being correctly identified. We applied our algorithm (both trainings) to more than 0.9 million quadruplets selected from the Gaia FPR GravLens catalogue. We derived a list of 1127 candidates with at least one score larger than 0.75, where each candidate has two scores-one from the model trained with distance information and one from the model trained without distance information-and including 201 very good candidates with both high scores.
KW - astro-ph.GA
KW - Galaxy: halo
KW - Gravitational lensing: strong
KW - Methods: data analysis
UR - http://www.scopus.com/inward/record.url?scp=105002367489&partnerID=8YFLogxK
U2 - 10.1051/0004-6361/202451690
DO - 10.1051/0004-6361/202451690
M3 - Article
SN - 0004-6361
VL - 696
SP - 1
EP - 13
JO - Astronomy & Astrophysics
JF - Astronomy & Astrophysics
M1 - A51
ER -