FormationEval Leaderboard
72 models evaluated on 505 petroleum geoscience MCQs
March 2026 update: FormationEval now also includes the imported DISKOS-QA and SPE MCQ tracks. This Space still displays results for the evaluated MCQ v0.1 track only. A full rerun on the expanded suite is pending because this is a self funded one person project and expanded suite evaluation requires materially more token spend.
Company