157 models. 30 providers. One command.
Detects your hardware, scores every model across quality, speed, and fit, then tells you exactly which ones will run well on your machine.
cargo install llmfit
or
brew install AlexsJones/llmfit/llmfit
System: Apple M2 Pro | 32 GB unified | Metal | Ollama: 8 installed # Score Model Params Quant tok/s Fit VRAM ── ───── ───────────────────────────── ────── ────── ───── ──────── ──── 1 92 Qwen3-8B 8.2B Q8_0 38.2 Perfect 8.4 GB 2 89 Llama-3.1-8B-Instruct 8.0B Q8_0 36.5 Perfect 8.2 GB 3 87 Mistral-7B-Instruct-v0.3 7.2B Q8_0 40.1 Perfect 7.4 GB 4 85 Gemma-3-12b-it 12B Q6_K 28.7 Perfect 12.6 GB 5 83 Mixtral-8x7B (MoE) 46.7B Q4_K_M 22.4 Good 6.6 GB 6 81 Qwen2.5-Coder-14B-Instruct 14.8B Q4_K_M 25.3 Perfect 14.2 GB 7 78 Mistral-Small-24B 24B Q4_K_M 18.1 Good 18.4 GB 8 74 Qwen3-32B 32.8B Q4_K_M 12.8 Marginal 24.6 GB 9 71 Llama-3.3-70B-Instruct 70.6B Q2_K 6.2 Too Tight 38.4 GB 10 68 DeepSeek-R1 (MoE) 671B Q2_K -- Too Tight 186 GB Filter: All | /search | f:fit | p:provider | d:download | q:quit
One command tells you which models fit your hardware. Written in Rust. Zero runtime dependencies. Works offline.