Running local models is good now
Vicki Boykis walks through how far local LLMs have come: quantized open-weight models now run usefully on a laptop, the tooling (llama.cpp, Ollama, MLX) has matured, and for many tasks you no longer need a frontier API.