Dataset — MatSci-FiB-200¶
The full MatSci Fill-in-the-Blank benchmark used to evaluate Minerva V2 and 7 other models. 200 questions drawn directly from 4 materials science textbooks.
Use the filters below to browse by difficulty or domain. Click any row to expand the ground truth.
Showing 200 questions · Click row to reveal answer