Sep. 17, 2025
Train a tiny model on Grade School Math
Dumb model trained on simple math = smarter model?
Here are my ongoing experiments in building tiny, intelligent language models. Each experiment is documented with code, datasets, and results shared openly.
All experiments are reproducible - you can find the code, data, and model weights in the linked repositories.
Dumb model trained on simple math = smarter model?
Is it possible to train a Language Model to run on the RP2040? Yes. Dumb but fast