Find Your Perfect Local AI Model
Compare system requirements, performance benchmarks, and get personalized recommendations based on your hardware.
Your Hardware
Hardware detection not yet run
Showing 20 of 20 models
Meta's specialized coding model based on Llama 2. Excellent for code completion and generation.
ollama pull codellama Cohere's model optimized for RAG and tool use. Excellent at following complex instructions.
ollama pull command-r State-of-the-art reasoning model with chain-of-thought capabilities. Excels at math, coding, and complex reasoning.
ollama pull deepseek-r1 Distilled version of R1 based on Qwen. Great reasoning in a smaller package.
ollama pull deepseek-r1-distill-qwen Google's open model built from Gemini research. Available in 2B, 9B, and 27B sizes.
ollama pull gemma2 Efficient smaller models from Meta, perfect for on-device deployment. Available in 1B and 3B sizes.
ollama pull llama3.2 Multimodal model that can understand images and text. Available in 11B and 90B variants.
ollama pull llama3.2-vision Meta's latest and most capable open model. Excellent for general tasks, coding, and reasoning with 128K context.
ollama pull llama3.3 Visual instruction-tuned model combining CLIP vision with Llama. Great for image understanding.
ollama pull llava Highly efficient 7B model that punches above its weight. Great balance of speed and capability.
ollama pull mistral Latest Mistral model optimized for efficiency. Enterprise-grade quality in a compact size.
ollama pull mistral-small Mixture of Experts model that activates only 2 experts per token. Fast inference with high quality.
ollama pull mixtral State-of-the-art embedding model. Top performance on MTEB benchmarks.
ollama pull mxbai-embed-large High-quality text embedding model. Perfect for RAG, semantic search, and similarity matching.
ollama pull nomic-embed-text Compact model with strong reasoning. Good balance between size and capability.
ollama pull orca-mini Microsoft's small language model. Surprisingly capable for its size, great for resource-constrained environments.
ollama pull phi3 Alibaba's flagship model with excellent multilingual support. Available from 0.5B to 72B.
ollama pull qwen2.5 Specialized coding model with excellent code completion and generation. Supports 92 programming languages.
ollama pull qwen2.5-coder Code LLM trained on The Stack v2. Excellent for code completion across 600+ programming languages.
ollama pull starcoder2