Open
Research

We believe in the power of open science. NoteFill actively contributes to the AI research community by releasing high-quality educational datasets.

Multimodal

ck12-tqa-multimodal

A comprehensive dataset for textbook question answering, featuring multimodal contexts including diagrams, charts, and text from CK-12 curriculum.

View on Hugging Face
Reasoning

gsm8k-instruction

An instruction-tuned variation of the GSM8K dataset, designed to improve mathematical reasoning capabilities in large language models.

View on Hugging Face