There is no single home for the open K-12 essay and writing corpora that AI grading research is built on. Datasets live on Kaggle, Figshare, GitHub, ArXiv, lab sites, and state vendors. K12Eval maintains this list as a public service. Free to use. Credited to original authors. Updated as new corpora drop.
Each dataset is credited to its original authors; K12Eval only curates and links. If a link breaks or a corpus moves, please let us know.
Suggest a datasetThe aggregated list above is a free public service. The rest of K12Eval, the production corpus, the IRR-validated subsets, the shared methodology, and the public leaderboard, picks up where these legacy datasets stop.