Writing
Blog
Notes, essays, and tutorials on machine learning, NLP, and the digital humanities.
-
Reading the Cluster at a Glance with Yale SLURM Utils
A small command-line tool that turns dense SLURM output into a readable, live dashboard — and why that matters when you share a supercomputer to train and run large language models.
Read post -
Dissolving a 398-Node Geographic Cycle with Gemini Flash Lite
How we found, visualized, and automatically fixed a massive circular reference in the LUX places hierarchy — for about three cents.
Read post -
Whisper Achieves 85% Accuracy on Holocaust Testimonies in Yale's Fortunoff Archive
We evaluated Whisper on 1,847 Holocaust testimonies and found 85 percent accuracy, though the model routinely normalizes raw speech and heritage spellings.
Read post -
Parsing 3.6 Million Historical Names with Small Models
We moved from expensive frontier AI to fine-tuned Qwen 3.5 models to parse historical data, achieving 96% accuracy by switching from JSON to YAML.
Read post