Show HN: Needle — Gemini tool calling distilled into a 26M-param model
A small team open-sourced Needle: a 26-million-parameter model that mimics Gemini's tool-calling decisions well enough to slot in as a router for agent frameworks. The cost story (orders of magnitude cheaper inference per call) is what the thread keeps circling.