The best guide to LLM tools and resources

10 Underrated LLMs Worth Trying Beyond ChatGPT and Claude

Let's be honest: if you're like most people, your LLM journey probably goes like this. You heard about ChatGPT, gave it a try, maybe switched to Claude when someone told you it was "better at writing," and called it a day. But here's the thing—the AI landscape is exploding with incredible alternatives that most people haven't even discovered yet.

These underrated powerhouses deserve your attention. Some are completely free. Others run locally on your own machine. A few specialize in tasks that make the big names look ordinary. Let's dive into the hidden gems of the LLM world.

1. Mistral AI (Le Chat)

What Makes It Unique: Paris-based Mistral AI burst onto the scene with models that compete with—and sometimes beat—OpenAI's offerings, all while maintaining a smaller footprint. Their chat interface, Le Chat, offers a clean, multilingual experience that feels refreshingly straightforward compared to the feature-bloated interfaces of bigger competitors.

Who It's Best For: Multilingual users who need seamless switching between languages, European language speakers who want a homegrown alternative, and anyone tired of waiting in endless queues during peak hours.

Standout Feature: Mistral offers genuinely free access to models like Mistral NeMo (open-sourced in partnership with NVIDIA), and Le Chat provides a generous free tier with a 32k token context window. Unlike many competitors, they haven't locked everything behind a paywall.

Try Le Chat Free →

Affiliate disclosure: we may earn a commission when you sign up through our links.

2. Microsoft Phi-3

What Makes It Unique: Phi-3 is what happens when Microsoft takes a radically different approach to model design. These "small language models" pack surprisingly capable intelligence into tiny packages—Phi-3-mini has just 3.8 billion parameters yet outperforms models with 10x the size. It runs on your laptop, your phone, or anywhere else without cloud dependencies.

Who It's Best For: Privacy-conscious users, developers building on-device AI, anyone tired of watching their data hop across servers, and budget-conscious projects that can't afford API costs.

Standout Feature: The ability to run Phi-3 entirely offline. While ChatGPT and Claude require constant internet connectivity, Phi-3 can live on your local machine, processing everything locally. Your prompts never leave your device.

Explore Phi-3 on Hugging Face →

3. Yi-34B (01.AI)

What Makes It Unique: Built by Chinese AI startup 01.AI, Yi-34B achieved something remarkable—it ranked highest among ALL open-source models on the Hugging Face Open LLM Leaderboard, outperforming giants like Llama-70B and Falcon-180B. It consistently scores on par with GPT-3.5 across benchmarks while being completely open-source.

Who It's Best For: Researchers, developers who need strong bilingual (English/Chinese) capabilities, and anyone looking for a truly capable open-source model without licensing headaches.

Standout Feature: The Yi-34B-Chat model landed in second place on the AlpacaEval Leaderboard (behind GPT-4 Turbo) and actually outperformed GPT-4, Mixtral, and Claude on certain benchmarks. This isn't a "decent for being open-source" model—it's genuinely competitive with the best.

Get Yi-34B on Hugging Face →

4. WizardCoder

What Makes It Unique: While everyone was hyping up ChatGPT's coding abilities, WizardCoder quietly became one of the best code-generating models on the planet. Built by fine-tuning CodeLlama and StarCoder using Evol-Instruct methodology, it surpasses Claude and Bard on code generation benchmarks like HumanEval and HumanEval+.

Who It's Best For: Developers, programmers, and anyone whose primary use case is writing, debugging, or explaining code.

Standout Feature: WizardCoder-33B-V1.1 achieves 79.9 pass@1 on HumanEval—that's the percentage of problems it solves correctly on the first try. For comparison, many commercial models struggle to hit 70%. And it's completely free and open-source.

Download WizardCoder →

5. DeepSeek

What Makes It Unique: DeepSeek emerged as a formidable open-source alternative, particularly known for strong reasoning and coding capabilities. Their models have gained serious traction in developer communities for delivering Anthropic and OpenAI-level performance at a fraction of the cost—or completely free for certain use cases.

Who It's Best For: Developers who want enterprise-grade coding assistance without enterprise-grade pricing, and anyone exploring reasoning-intensive tasks.

Standout Feature: DeepSeek's open-source models rival closed-source giants in logical reasoning and mathematical problem-solving, all while being completely transparent about their architecture and training data.

Access DeepSeek →

6. Hugging Face Ecosystem

What Makes It Unique: Hugging Face isn't a single model—it's an entire universe of open-source AI. With tens of thousands of models available, from tiny 1B parameter models that run on Raspberry Pis to massive 400B+ beasts, the selection is staggering. The platform has become the GitHub of machine learning.

Who It's Best For: Tinkerers, researchers, developers building custom solutions, and anyone who wants to compare different models side-by-side.

Standout Feature: The ability to test hundreds of models instantly through their Spaces platform. You can try specialized models for everything from medical text analysis to creative writing to legal documents—all in your browser, often for free.

Explore Hugging Face →

7. Grok (xAI)

What Makes It Unique: Built by Elon Musk's xAI, Grok takes a deliberately different approach—it's designed to be "witty" and tackle questions that other AIs refuse to touch. While it's still maturing, Grok offers free access (in certain tiers) and integrates with X/Twitter for real-time information.

Who It's Best For: Users who want uncensored, unfiltered responses, X/Twitter power users, and anyone curious about xAI's rapidly evolving technology.

Standout Feature: Real-time access to X/Twitter data means Grok can discuss current events and trending topics with context that models trained on static datasets simply can't match.

Try Grok Free →

8. Command R+ (Cohere)

What Makes It Unique: Cohere, founded by former Google AI researchers, built Command R+ specifically for enterprise RAG (Retrieval-Augmented Generation) workloads. While it might be overkill for casual users, for businesses building AI applications, it offers capabilities that generalist models can't match.

Who It's Best For: Enterprises, developers building RAG pipelines, and organizations that need verifiable, sourced responses.

Standout Feature: Enterprise-grade reliability with proper citation and source tracking—essential for businesses that need to know where their AI's answers come from.

Explore Command R+ →

9. Gemma (Google)

What Makes It Unique: Google's Gemma models bring the company's AI research to the open-source world. Available in various sizes (including the impressively capable 27B parameter version), Gemma offers strong performance with the backing of Google's massive research infrastructure.

Who It's Best For: Android developers, Google Cloud users, and anyone who wants Google's AI technology without being locked into their ecosystem.

Standout Feature: Seamless integration with Google Cloud and Vertex AI, plus strong performance in a truly open-source package.

Get Gemma Models →

10. LocalAI

What Makes It Unique: LocalAI is exactly what it sounds like—a fully self-hosted, local alternative to OpenAI, Claude, and others. It runs on your own hardware, supports consumer-grade GPUs (or even CPUs), and can serve as a drop-in replacement for OpenAI's API.

Who It's Best For: Privacy maximalists, organizations with strict data residency requirements, developers building offline-capable applications, and anyone tired of vendor lock-in.

Standout Feature: Complete control over your AI infrastructure with no usage limits, no API costs after initial setup, and no data leaving your premises.

Set Up LocalAI →

Why Bother With Alternatives?

Here's the truth that the mainstream AI discourse ignores: the "best" LLM depends entirely on your use case. ChatGPT and Claude are fantastic generalists, but they're not optimized for your specific workflow. They're also not free (for meaningful usage), they require internet connectivity, and they collect your data.

The alternatives above offer something the giants can't: choice. Whether you need local processing for privacy, specialized coding capabilities, multilingual support, or just a fresh perspective, there's an underrated LLM waiting for you.

The AI revolution isn't just happening at OpenAI and Anthropic. It's happening in open-source communities, in startups worldwide, and on laptops everywhere. The only question is: are you going to explore beyond the obvious?

Want more LLM discoveries? Subscribe to our newsletter for weekly deep dives into underrated AI tools.