Writing
Blog
Notes, essays, and tutorials on machine learning, NLP, and the digital humanities.
-
Dissolving a 398-Node Geographic Cycle with Gemini Flash Lite
How we found, visualized, and automatically fixed a massive circular reference in the LUX places hierarchy — for about three cents.
Read post -
Whisper Achieves 85% Accuracy on Holocaust Testimonies in Yale's Fortunoff Archive
We evaluated Whisper on 1,847 Holocaust testimonies and found 85 percent accuracy, though the model routinely normalizes raw speech and heritage spellings.
Read post -
Parsing 3.6 Million Historical Names with Small Models
We moved from expensive frontier AI to fine-tuned Qwen 3.5 models to parse historical data, achieving 96% accuracy by switching from JSON to YAML.
Read post