Open
Research
We believe in the power of open science. NoteFill actively contributes to the AI research community by releasing high-quality educational datasets.
Multimodal
ck12-tqa-multimodal
A comprehensive dataset for textbook question answering, featuring multimodal contexts including diagrams, charts, and text from CK-12 curriculum.
Reasoning
gsm8k-instruction
An instruction-tuned variation of the GSM8K dataset, designed to improve mathematical reasoning capabilities in large language models.