Tool-Calling on Pavel Nasovich's Blog

Tool-Calling on Pavel Nasovich's Bloghttps://forcewake.me/tags/tool-calling/Recent content in Tool-Calling on Pavel Nasovich's BlogHugo -- 0.157.0en-usCopyright 2026Sat, 09 Aug 2025 00:31:26 +0200II-Search-4B: A Love Letter to Small Models (Or How I Learned to Stop Worrying and Embrace 4B Parameters)https://forcewake.me/ii-search-4b-a-love-letter-to-small-models-or-how-i-learned-to-stop-worrying-and-embrace-4b-parameters/Fri, 08 Aug 2025 00:00:00 +0000https://forcewake.me/ii-search-4b-a-love-letter-to-small-models-or-how-i-learned-to-stop-worrying-and-embrace-4b-parameters/A technical analysis of II-Search-4B reveals how this 4-billion parameter model achieves 91.8% SimpleQA accuracy through specialized web search capabilities, outperforming base models by 256% on multi-hop reasoning tasks. We dissect the practical deployment strategies from 8-GPU tensor parallelism to single-GPU quantization and Apple Silicon optimization, demonstrating how focused specialization enables frontier search performance at 1/50th the parameter count of comparable general-purpose models—proving that intelligent constraints and targeted training trump raw scale in the evolving landscape of AI agents.