ChatGPT, Claude, Gemini… Which One Should You Use?
Last week, in one of our training sessions, one question kept coming up again and again: Which AI model should I use?
I get it.
Every few weeks, there’s a new AI model, a new feature, or an update that makes it feel like you’re already behind.
ChatGPT-4o reasons cheaper and faster. Gemini 2.0 has a ridiculous 2-million-token context window. DeepSeek is the latest model promising open-sourced magic.
So, what to choose?
How AI Platforms Are Different
The truth is that there’s no perfect model—only the best model for your needs. And the best way to figure it out is to focus on how you actually work.
(Also: many people confuse models with tools—GPT-4o is the model, while ChatGPT-4o is the tool built around it, just as an operating system powers a device, shaping the overall user experience.)
Here’s how I think about AI model selection—and how you can, too:
1. Model Performance
Besides aptitude tests, where all major models are in an arms race, there are also user benchmarks. The most popular is LMSys, where users rate models on actual (perceived) performance. In these benchmarks, Gemini 2.0 Flash leads the charge, although ChatGPT-4o has a 'shared number 1' spot.
2. Model Focus
While every model is trained on similar data, fine-tuning, and product design changes impact what each model is best at. This is why ChatGPT is often said to be best at reasoning, while Claude excels at creative writing. Here is where personal preferences and objectives play a big role. Here’s what some community members had to say about their favorite platforms:
- Antony Slumbers of The Trillion Dollar Hashtag chooses Gemini: “Especially the ‘Thinking’ models are fascinating. Reading their explanations of how they are ‘thinking’ about answering questions provides a master class in critical thinking.”
- Carlo Benigni opts for Claude: “To get good results out of ChatGPT, I have to invest time in good prompts. Claude usually works the first time.”
- Patrik Breitenmoser also prefers Claude, but for its coding abilities: “I mostly use it when working in Cursor, but also works really well when I need to solve quick one-off questions.”
- Brian Elliott of Work Forward is another vote for Claude: “Claude is more “human” sounding in its responses. Using a Project in Claude allowed me to set tone and language guidelines and share examples of my writing that make it even better.”
- Andrew Currie, CEO of architecture firm Out-2 Design Group, recently discovered Copilot’s edge: “I did a side-by-side comparison with Copilot and ChatGPT yesterday for some industry-specific research, summarising, and creating a draft memo. Surprising to say, I preferred the results from Copilot.”
- And his takes on ChatGPT and Claude: "Most say Claude is better for writing, although if you train it on your own voice and you want to keep things simple, ChatGPT surely will be fine too."
3. Features
While some AIs offer little besides the familiar chatbox on the web and through mobile apps, some are investing heavily in products. Gemini (Deep Research) and Claude (Artifacts) look good here, but ChatGPT takes the crown with two autonomous agents (Operator and Deep Research) alongside Video, Screen Share, Voice Mode, Desktop App, and more.
