FAIR Universe: Weak Lensing Uncertainty Challenge

Winners

Phase 1 of the NeurIPS 2025 Weak Lensing ML Uncertainty Challenge has come to an end. We have announced the winners of the competition.

In the final evaluation, the final submitted models are tested on a new holdout dataset that contains new realizations of convergence maps obtained from (i) the new simulations with the original cosmological parameters in the training data, and (ii) the new simulations with the cosmological parameters that were not seen in the training data. The new parameters are sampled from the same prior distributions as the training data.

After extensive studies, we have decided to award the prizes based on the final rankings in three final leaderboards. The three leaderboards evaluate the model performance on (i), (ii), and the average of (i) and (ii), respectively, to reflect the strengths and robustness of each model.

1. Final leaderboard evaluated solely on (i):

Rank Particpant Final Score Mean MSE (Standerdized) Mean Coverage
1st cmbagent 11.7029 0.1033 0.7000
2nd eiffl 11.6535 0.1038 0.7087
3rd Shubhojit 11.5987 0.1032 0.6583

We will award the prizes to cmbagent, eiffl, and Shubhojit for extraordinary performance on the original cosmologies.

  • cmbagent: team members Erwan Allys, Boris Bolliet, Tom Borret, Celia Lecat, Andy Nilipour, Sebastien Pierre, Licong Xu
  • eiffl - Transatlantic Dream Team: team members Noe Dia, Sacha Guerrini, Wassim Kablan, François Lanusse, Julia Linhart, Laurence Perreault-Levasseur, Benjamin Remy, Sammy Sharieff, Andreas Tersenov, Justine Zeghal
  • shubhojit - Shubhojit Naskar

2. Final leaderboard evaluated solely on (ii):

Rank Particpant Final Score Mean MSE (Standerdized) Mean Coverage
1st Shubhojit 11.3606 0.0968 0.6619
2nd Tie THUML 11.0511 0.1051 0.6733
2nd Tie jagoncalves 11.0367 0.1073 0.6683
2nd Tie andry834 11.0014 0.1076 0.7228
2nd Tie jhu_suicee 10.9892 0.1067 0.6451
2nd Tie eiffl 10.9883 0.1074 0.6818

We recognize Shubhojit for the achievement in the best model generalization, with a score clearly separated from the other participants. The other five participants on the leaderboard cannot be separated in a significant way due to the limited samples of (ii).

3. Final leaderboard from the average of the score obtained on (i) and (ii):

Rank Particpant Final Score Mean MSE (Standerdized) Mean Coverage
1st Shubhojit 11.4796 0.1000 0.6601
2nd eiffl 11.3209 0.1056 0.6953
3rd THUML 11.2848 0.1060 0.6789

We will award the prizes to Shubhojit, eiffl, and THUML for demonstrating excellent performance on both new and old cosmologies.

  • THUML: team members Mingsheng Long, Yuezhou Ma, Yuanxu Sun, Huikun Weng, Haixu Wu, Hang Zhou, Haonan Shangguan

All winners will present at our NeurIPS 2025 Competition Workshop:

We will prepare a paper documenting the top solutions with the winning teams after the workshop. The dataset will also be released permanently to serve as a permanent benchmark.

Congratulations to all the winning teams!

About the Challenge

The large-scale structure of the universe—the cosmic web of galaxies, galaxy clusters, and dark matter spanning hundreds of millions of light-years—encodes essential information about the composition, evolution, and fundamental laws governing the cosmos. However, the majority of matter in the universe is dark matter, which does not interact with light and can only be observed indirectly through its gravitational effects. According to Einstein’s theory of general relativity, the gravitational field of this large-scale structure bends the path of light traveling through the universe. Weak gravitational lensing refers to the subtle, coherent distortions in the observed shapes of distant galaxies caused by the deflection of light as it traverses the inhomogeneous matter distribution of the universe. By statistically analyzing these distortions across large regions of the sky, weak lensing provides a powerful probe of the matter distribution and the underlying cosmological model that governs the expansion of the universe.

Traditional analysis based on two-point correlation functions can only capture limited amount of information from the weak lensing data (2D fields similar to images). To fully exploit the non-Gaussian features present in the cosmic web, higher-order statistics and modern machine learning (ML) methods have become increasingly important. These approaches, including deep learning and simulation-based inference, have been shown to extract significant more information in weak lensing maps than traditional techniques. However, different analyses assume different dataset setups and lead to different results, making it hard to directly compare with existing approaches. Furthermore, most (if not all) of these methods rely heavily on simulations that may not accurately represent real data due to modeling approximations and missing systematics.

This competition is motivated by the need to quantify and compare the information content that different analysis methods—ranging from classical statistics to ML-based models—can extract from weak lensing maps, while also evaluating their robustness to simulation inaccuracies and observational systematics.

The outcomes of this competition are expected to guide the development of next-generation weak lensing analysis pipelines, foster cross-disciplinary collaboration between the astrophysics and machine learning communities, and ultimately improve the reliability of cosmological inference from current and upcoming surveys such as LSST, Euclid, and the Roman Space Telescope. By explicitly addressing simulation-model mismatch and the need to quantify systematic uncertainties, this competition emphasizes scientific robustness and interpretability, aligning with the growing emphasis on trustworthy ML in scientific domains.

Organizers

We gratefully acknowledge the efforts of the FAIR Universe Team, composed of researchers dedicated to advancing high energy physics, cosmology, and machine learning for the benefit of the scientific community.

For inquiries, please contact us at: fair-universe@lbl.gov

  • Wahid Bhimji
  • Paolo Calafiura
  • Ragansu Chakkappai
  • Po-Wen Chang
  • Yuan-Tang Chou
  • Biwei Dai
  • Sascha Diefenbacher
  • Steven Farrell
  • Aishik Ghosh
  • Isabelle Guyon
  • Chris Harris
  • Shih-Chieh Hsu
  • Elham Khoda
  • Benjamin Nachman
  • Peter Nugent
  • David Rousseau
  • Uros Seljak
  • Benjamin Thorne
  • Ihsan Ullah
  • Daniel Whiteson
  • Yulei Zhang

Competition Material

Find all the competition resources below:

Phase 1 Codabench Leaderboard

Last updated: November 17, 2025 at 02:05 PM UTC
Rank Username Submission ID Score MSE Coverage Method Name
1 cmbagent (team) 424324 11.7321 0.1022 0.8958 0.7056 tomborrett_cmbagent
2 eiffl (team) 424310 11.6612 0.1028 0.8952 0.7095 b-remy_e36uxobt
3 shubhojit 417744 11.6192 0.1025 0.8956 0.6583 CNNv48_
4 THUML (team) 424064 11.5209 0.1064 0.8916 0.6907 F1
5 adscft 424300 11.5142 0.1056 0.8924 0.7279 cnnv11152
6 piyush555 394590 11.4681 0.1077 0.8902 0.7103 dns_l
7 jhu_suicee 423555 11.4590 0.1077 0.8903 0.6512 STILI
8 azhang81 424251 11.4192 0.1098 0.8882 0.7017 m6
9 mmayr 424022 11.2759 0.1133 0.8846 0.7080 NL128_LD512_NB2_NSA6_NH16_LR2e4_BS32_NP200_VPR00_noPE_10ens
10 jagoncalves 418273 11.2437 0.1160 0.8818 0.6835 20251008_173243
11 DOT (team) 422646 11.2203 0.1160 0.8818 0.7016 CNN
12 andry834 422647 11.1691 0.1161 0.8817 0.7097 25-11-14-03-31
13 hnayak 413583 11.1505 0.1177 0.8801 0.7130 updated3
14 Yuedong (team) 422666 11.1385 0.1197 0.8780 0.6775 25-11-13-21-12
15 deleted_user_34276 423158 11.1220 0.1174 0.8804 0.6903 777777777
16 uccaeid 424311 11.1220 0.1174 0.8804 0.6903 000000000
17 farhankhan 423146 11.0931 0.1188 0.8790 0.6871 777777
18 ahajighasem 422359 10.9727 0.1219 0.8758 0.7026 444
19 MaDESYn (team) 402355 10.9509 0.1235 0.8741 0.7201 argustarv1
20 young_hee 409010 10.9447 0.1225 0.8752 0.7037 model8
21 mshamani 422137 10.9268 0.1232 0.8745 0.6876 4444
22 sakib 375549 10.8968 0.1215 0.8762 0.7216 CNN
23 satyamk 424302 10.8300 0.1283 0.8693 0.6970 tta
24 hao_chun_liang 403355 10.7856 0.1265 0.8711 0.6934 ensemble_calibrated_exp6_20251027_230608
25 Isaul team 424320 10.7320 0.1299 0.8677 0.7017 cnnv11
26 ibrahime 395630 10.6497 0.1313 0.8663 0.7115 Ens
27 zdu863 416459 10.6023 0.1310 0.8665 0.7412 CNN-06
28 ehernandez dev 10.5582 0.1317 0.8659 0.6421 greedy_jan
29 klinjin 423847 10.5465 0.1325 0.8650 0.7296 CNN_clean_V16
30 exaltedjoseph 386759 10.5397 0.1335 0.8640 0.6860 op-muy
31 nthday_jpg 403758 10.4075 0.1346 0.8629 0.6155 resnetlite
32 chenghantsai 385173 10.3880 0.1341 0.8634 0.7980 Optuna CNN 515 depth
33 sher_s 413038 10.3542 0.1388 0.8586 0.6709 20251103_193017
34 dingjch 424010 10.2044 0.1414 0.8560 0.6071 N32_R01-0039_comb05grps_SBI-10-4
35 saibaster 394896 9.9422 0.1563 0.8407 0.7021 xgb
36 kunhaoz 423707 9.9117 0.1515 0.8457 0.6775 cnn
37 limabar 422717 9.7956 0.1595 0.8376 0.6964 cnn_mcmc1
38 faithful 404896 9.7855 0.1628 0.8342 0.7121 newcnnv8
39 vodezhaw 424052 9.7018 0.1499 0.8473 0.8327 ResnetDirect+Calib+Ensemble
40 bmdlln 416985 9.6539 0.1499 0.8473 0.6774 test
41 pajucg2 379685 9.4793 0.1648 0.8321 0.7889 cnn
42 Trung Le (team) 375155 9.2332 0.1765 0.8202 0.6717 OkeMCMC
43 ssunao 386814 9.1654 0.1749 0.8218 0.6304 angular power spectrum + second third, and fourth order aperture mass statistics
44 idkastro 422443 9.1589 0.1833 0.8133 0.6302 dummy_submission
45 pruhlman 405469 9.1346 0.1763 0.8204 0.7060 test
46 aravindsugu 377027 8.8820 0.1870 0.8096 0.7019 v2
47 hzume 398085 8.8059 0.1745 0.8223 0.5871 exp001-sub3
48 pwchang 363706 8.6809 0.1949 0.8015 0.6909 Baseline_CNN_MCMC_it100k_006_new_data_score
49 maojiexu 423787 8.4055 0.2060 0.7902 0.6455 test_official
50 fudawei 393359 8.3463 0.1963 0.8001 0.5693 CNN direct
51 TJHSST (team) 420595 7.7703 0.2226 0.7733 0.6996 Trial Sub
52 mathfan2020 399432 7.7054 0.2126 0.7834 0.5739 CNNv3
53 carmenmisa 367924 4.6266 0.3472 0.6464 0.6737 Phase 1 Starting Kit
54 ragansu 357613 4.6012 0.3483 0.6453 0.6736 model_starting_kit
55 ravalin2 421539 -6.4842 0.6444 0.3439 0.3755 regvae_local
56 sijil_jose 424309 -29.3209 0.9686 0.0136 0.3140 neural_ratio
57 ayush_killer 420217 -34.3385 3.7129 -2.7836 0.8472 CNN
58 suqik 424008 -38.4488 1.9372 -0.9728 0.3745 hybrid_summary_v1
59 garyhornbill28 369950 -1854.2724 163.5679 -165.6699 0.7210 CNN_2
60 kadidiakonate 384213 -3795.6503 27.1434 -26.6566 0.6766 dummy submission
61 lupodda 367070 -56532.8470 4325.4582 -4405.5530 0.9149 DummySubmission
62 rishi 372606 -56532.8470 4325.4582 -4405.5530 0.9149 registration
63 nithin2003 422946 -56532.8470 4325.4582 -4405.5530 0.9149 Metadata