Articles
ML, neuroscience, and other stuff I'm thinking about.
Serving Models Across 4 GPUs Without Losing Your Mind
2026-04-12How I stopped fighting with sequential model loading on a multi-GPU VM and built a simple job queue with uvicorn workers instead.
Running Minecraft on a Kubernetes Cluster with Coding Agents
2026-04-11I built a system that streams Minecraft from L4 GPUs in Kubernetes pods, with an AI agent inside the container that writes Java plugins, compiles them, and tests them in-game.
Why Your Agent Can't See Its Own Mistakes
2026-04-10On using a separate vision model (Gemini) for visual QA instead of letting the same agent judge its own 3D models. Fresh eyes matter, even for LLMs.
OCR, Chunking, and Image Indexing for Scientific Papers
2026-04-08Comparing Tesseract, AWS Textract, and Azure Document Intelligence for scientific paper OCR. Plus notes on semantic chunking and image retrieval.
Using Latent Distance as a Proxy for Uncertainty
2026-04-07Neural networks don't tell you when they're guessing. We tested a cheap way to fix that by looking at the model's internal state, and it outperformed the expensive standard methods.
Uncertainty-Guided Search Beats Beam Search
2026-04-06What if a model only explored alternatives when it was unsure? We tested this against standard approaches and found it recovers errors with less compute.
The Cliff Problem with Chain-of-Thought
2026-04-05Chain-of-thought reasoning gets perfect scores during training, then collapses to 0% one step outside its comfort zone. We found an alternative that degrades gracefully instead.
Your Agent Needs to Know What It Doesn't Know
2026-04-04AI agents confidently walk off cliffs they can't see. We ran three sets of experiments showing that self-awareness, not just raw capability, is what's missing.
Training a 3D Fighting Game as a Neural Network
2026-02-03What if a neural network could replace a game engine? I recorded thousands of frames from a 3D robot fighting game and trained a model to generate the next frame from controller inputs.