Phase 1 of the NeurIPS 2025 Weak Lensing ML Uncertainty Challenge has come to an end. We have announced the winners of the competition.
In the final evaluation, the final submitted models are tested on a new holdout dataset that contains new realizations of convergence maps obtained from (i) the new simulations with the original cosmological parameters in the training data, and (ii) the new simulations with the cosmological parameters that were not seen in the training data. The new parameters are sampled from the same prior distributions as the training data.
After extensive studies, we have decided to award the prizes based on the final rankings in three final leaderboards. The three leaderboards evaluate the model performance on (i), (ii), and the average of (i) and (ii), respectively, to reflect the strengths and robustness of each model.
| Rank | Particpant | Final Score | Mean MSE (Standerdized) | Mean Coverage |
|---|---|---|---|---|
| 1st | cmbagent | 11.7029 | 0.1033 | 0.7000 |
| 2nd | eiffl | 11.6535 | 0.1038 | 0.7087 |
| 3rd | Shubhojit | 11.5987 | 0.1032 | 0.6583 |
We will award the prizes to cmbagent, eiffl, and Shubhojit for extraordinary performance on the original cosmologies.
| Rank | Particpant | Final Score | Mean MSE (Standerdized) | Mean Coverage |
|---|---|---|---|---|
| 1st | Shubhojit | 11.3606 | 0.0968 | 0.6619 |
| 2nd Tie | THUML | 11.0511 | 0.1051 | 0.6733 |
| 2nd Tie | jagoncalves | 11.0367 | 0.1073 | 0.6683 |
| 2nd Tie | andry834 | 11.0014 | 0.1076 | 0.7228 |
| 2nd Tie | jhu_suicee | 10.9892 | 0.1067 | 0.6451 |
| 2nd Tie | eiffl | 10.9883 | 0.1074 | 0.6818 |
We recognize Shubhojit for the achievement in the best model generalization, with a score clearly separated from the other participants. The other five participants on the leaderboard cannot be separated in a significant way due to the limited samples of (ii).
| Rank | Particpant | Final Score | Mean MSE (Standerdized) | Mean Coverage |
|---|---|---|---|---|
| 1st | Shubhojit | 11.4796 | 0.1000 | 0.6601 |
| 2nd | eiffl | 11.3209 | 0.1056 | 0.6953 |
| 3rd | THUML | 11.2848 | 0.1060 | 0.6789 |
We will award the prizes to Shubhojit, eiffl, and THUML for demonstrating excellent performance on both new and old cosmologies.
All winners will present at our NeurIPS 2025 Competition Workshop:
We will prepare a paper documenting the top solutions with the winning teams after the workshop. The dataset will also be released permanently to serve as a permanent benchmark.
Congratulations to all the winning teams!
The large-scale structure of the universe—the cosmic web of galaxies, galaxy clusters, and dark matter spanning hundreds of millions of light-years—encodes essential information about the composition, evolution, and fundamental laws governing the cosmos. However, the majority of matter in the universe is dark matter, which does not interact with light and can only be observed indirectly through its gravitational effects. According to Einstein’s theory of general relativity, the gravitational field of this large-scale structure bends the path of light traveling through the universe. Weak gravitational lensing refers to the subtle, coherent distortions in the observed shapes of distant galaxies caused by the deflection of light as it traverses the inhomogeneous matter distribution of the universe. By statistically analyzing these distortions across large regions of the sky, weak lensing provides a powerful probe of the matter distribution and the underlying cosmological model that governs the expansion of the universe.
Traditional analysis based on two-point correlation functions can only capture limited amount of information from the weak lensing data (2D fields similar to images). To fully exploit the non-Gaussian features present in the cosmic web, higher-order statistics and modern machine learning (ML) methods have become increasingly important. These approaches, including deep learning and simulation-based inference, have been shown to extract significant more information in weak lensing maps than traditional techniques. However, different analyses assume different dataset setups and lead to different results, making it hard to directly compare with existing approaches. Furthermore, most (if not all) of these methods rely heavily on simulations that may not accurately represent real data due to modeling approximations and missing systematics.
This competition is motivated by the need to quantify and compare the information content that different analysis methods—ranging from classical statistics to ML-based models—can extract from weak lensing maps, while also evaluating their robustness to simulation inaccuracies and observational systematics.
The outcomes of this competition are expected to guide the development of next-generation weak lensing analysis pipelines, foster cross-disciplinary collaboration between the astrophysics and machine learning communities, and ultimately improve the reliability of cosmological inference from current and upcoming surveys such as LSST, Euclid, and the Roman Space Telescope. By explicitly addressing simulation-model mismatch and the need to quantify systematic uncertainties, this competition emphasizes scientific robustness and interpretability, aligning with the growing emphasis on trustworthy ML in scientific domains.
We gratefully acknowledge the efforts of the FAIR Universe Team, composed of researchers dedicated to advancing high energy physics, cosmology, and machine learning for the benefit of the scientific community.
For inquiries, please contact us at: fair-universe@lbl.gov
Find all the competition resources below:
| Rank | Username | Submission ID | Score | MSE | R² | Coverage | Method Name |
|---|---|---|---|---|---|---|---|
| 1 | cmbagent (team) | 424324 | 11.7321 | 0.1022 | 0.8958 | 0.7056 | tomborrett_cmbagent |
| 2 | eiffl (team) | 424310 | 11.6612 | 0.1028 | 0.8952 | 0.7095 | b-remy_e36uxobt |
| 3 | shubhojit | 417744 | 11.6192 | 0.1025 | 0.8956 | 0.6583 | CNNv48_ |
| 4 | THUML (team) | 424064 | 11.5209 | 0.1064 | 0.8916 | 0.6907 | F1 |
| 5 | adscft | 424300 | 11.5142 | 0.1056 | 0.8924 | 0.7279 | cnnv11152 |
| 6 | piyush555 | 394590 | 11.4681 | 0.1077 | 0.8902 | 0.7103 | dns_l |
| 7 | jhu_suicee | 423555 | 11.4590 | 0.1077 | 0.8903 | 0.6512 | STILI |
| 8 | azhang81 | 424251 | 11.4192 | 0.1098 | 0.8882 | 0.7017 | m6 |
| 9 | mmayr | 424022 | 11.2759 | 0.1133 | 0.8846 | 0.7080 | NL128_LD512_NB2_NSA6_NH16_LR2e4_BS32_NP200_VPR00_noPE_10ens |
| 10 | jagoncalves | 418273 | 11.2437 | 0.1160 | 0.8818 | 0.6835 | 20251008_173243 |
| 11 | DOT (team) | 422646 | 11.2203 | 0.1160 | 0.8818 | 0.7016 | CNN |
| 12 | andry834 | 422647 | 11.1691 | 0.1161 | 0.8817 | 0.7097 | 25-11-14-03-31 |
| 13 | hnayak | 413583 | 11.1505 | 0.1177 | 0.8801 | 0.7130 | updated3 |
| 14 | Yuedong (team) | 422666 | 11.1385 | 0.1197 | 0.8780 | 0.6775 | 25-11-13-21-12 |
| 15 | deleted_user_34276 | 423158 | 11.1220 | 0.1174 | 0.8804 | 0.6903 | 777777777 |
| 16 | uccaeid | 424311 | 11.1220 | 0.1174 | 0.8804 | 0.6903 | 000000000 |
| 17 | farhankhan | 423146 | 11.0931 | 0.1188 | 0.8790 | 0.6871 | 777777 |
| 18 | ahajighasem | 422359 | 10.9727 | 0.1219 | 0.8758 | 0.7026 | 444 |
| 19 | MaDESYn (team) | 402355 | 10.9509 | 0.1235 | 0.8741 | 0.7201 | argustarv1 |
| 20 | young_hee | 409010 | 10.9447 | 0.1225 | 0.8752 | 0.7037 | model8 |
| 21 | mshamani | 422137 | 10.9268 | 0.1232 | 0.8745 | 0.6876 | 4444 |
| 22 | sakib | 375549 | 10.8968 | 0.1215 | 0.8762 | 0.7216 | CNN |
| 23 | satyamk | 424302 | 10.8300 | 0.1283 | 0.8693 | 0.6970 | tta |
| 24 | hao_chun_liang | 403355 | 10.7856 | 0.1265 | 0.8711 | 0.6934 | ensemble_calibrated_exp6_20251027_230608 |
| 25 | Isaul team | 424320 | 10.7320 | 0.1299 | 0.8677 | 0.7017 | cnnv11 |
| 26 | ibrahime | 395630 | 10.6497 | 0.1313 | 0.8663 | 0.7115 | Ens |
| 27 | zdu863 | 416459 | 10.6023 | 0.1310 | 0.8665 | 0.7412 | CNN-06 |
| 28 | ehernandez | dev | 10.5582 | 0.1317 | 0.8659 | 0.6421 | greedy_jan |
| 29 | klinjin | 423847 | 10.5465 | 0.1325 | 0.8650 | 0.7296 | CNN_clean_V16 |
| 30 | exaltedjoseph | 386759 | 10.5397 | 0.1335 | 0.8640 | 0.6860 | op-muy |
| 31 | nthday_jpg | 403758 | 10.4075 | 0.1346 | 0.8629 | 0.6155 | resnetlite |
| 32 | chenghantsai | 385173 | 10.3880 | 0.1341 | 0.8634 | 0.7980 | Optuna CNN 515 depth |
| 33 | sher_s | 413038 | 10.3542 | 0.1388 | 0.8586 | 0.6709 | 20251103_193017 |
| 34 | dingjch | 424010 | 10.2044 | 0.1414 | 0.8560 | 0.6071 | N32_R01-0039_comb05grps_SBI-10-4 |
| 35 | saibaster | 394896 | 9.9422 | 0.1563 | 0.8407 | 0.7021 | xgb |
| 36 | kunhaoz | 423707 | 9.9117 | 0.1515 | 0.8457 | 0.6775 | cnn |
| 37 | limabar | 422717 | 9.7956 | 0.1595 | 0.8376 | 0.6964 | cnn_mcmc1 |
| 38 | faithful | 404896 | 9.7855 | 0.1628 | 0.8342 | 0.7121 | newcnnv8 |
| 39 | vodezhaw | 424052 | 9.7018 | 0.1499 | 0.8473 | 0.8327 | ResnetDirect+Calib+Ensemble |
| 40 | bmdlln | 416985 | 9.6539 | 0.1499 | 0.8473 | 0.6774 | test |
| 41 | pajucg2 | 379685 | 9.4793 | 0.1648 | 0.8321 | 0.7889 | cnn |
| 42 | Trung Le (team) | 375155 | 9.2332 | 0.1765 | 0.8202 | 0.6717 | OkeMCMC |
| 43 | ssunao | 386814 | 9.1654 | 0.1749 | 0.8218 | 0.6304 | angular power spectrum + second third, and fourth order aperture mass statistics |
| 44 | idkastro | 422443 | 9.1589 | 0.1833 | 0.8133 | 0.6302 | dummy_submission |
| 45 | pruhlman | 405469 | 9.1346 | 0.1763 | 0.8204 | 0.7060 | test |
| 46 | aravindsugu | 377027 | 8.8820 | 0.1870 | 0.8096 | 0.7019 | v2 |
| 47 | hzume | 398085 | 8.8059 | 0.1745 | 0.8223 | 0.5871 | exp001-sub3 |
| 48 | pwchang | 363706 | 8.6809 | 0.1949 | 0.8015 | 0.6909 | Baseline_CNN_MCMC_it100k_006_new_data_score |
| 49 | maojiexu | 423787 | 8.4055 | 0.2060 | 0.7902 | 0.6455 | test_official |
| 50 | fudawei | 393359 | 8.3463 | 0.1963 | 0.8001 | 0.5693 | CNN direct |
| 51 | TJHSST (team) | 420595 | 7.7703 | 0.2226 | 0.7733 | 0.6996 | Trial Sub |
| 52 | mathfan2020 | 399432 | 7.7054 | 0.2126 | 0.7834 | 0.5739 | CNNv3 |
| 53 | carmenmisa | 367924 | 4.6266 | 0.3472 | 0.6464 | 0.6737 | Phase 1 Starting Kit |
| 54 | ragansu | 357613 | 4.6012 | 0.3483 | 0.6453 | 0.6736 | model_starting_kit |
| 55 | ravalin2 | 421539 | -6.4842 | 0.6444 | 0.3439 | 0.3755 | regvae_local |
| 56 | sijil_jose | 424309 | -29.3209 | 0.9686 | 0.0136 | 0.3140 | neural_ratio |
| 57 | ayush_killer | 420217 | -34.3385 | 3.7129 | -2.7836 | 0.8472 | CNN |
| 58 | suqik | 424008 | -38.4488 | 1.9372 | -0.9728 | 0.3745 | hybrid_summary_v1 |
| 59 | garyhornbill28 | 369950 | -1854.2724 | 163.5679 | -165.6699 | 0.7210 | CNN_2 |
| 60 | kadidiakonate | 384213 | -3795.6503 | 27.1434 | -26.6566 | 0.6766 | dummy submission |
| 61 | lupodda | 367070 | -56532.8470 | 4325.4582 | -4405.5530 | 0.9149 | DummySubmission |
| 62 | rishi | 372606 | -56532.8470 | 4325.4582 | -4405.5530 | 0.9149 | registration |
| 63 | nithin2003 | 422946 | -56532.8470 | 4325.4582 | -4405.5530 | 0.9149 | Metadata |