Published Date:
February 18, 2025

Top AI Leaders' Favorite AI Models

Understand the differences between popular AI models, what elite executives are using (plus their best tips), and how to choose the best one to pay for.
Daan van Rossum
By
Daan van Rossum
Founder & CEO, FlexOS

ChatGPT, Claude, Gemini… Which One Should You Use?

Last week, in one of our training sessions, one question kept coming up again and again: Which AI model should I use?

I get it.

Every few weeks, there’s a new AI model, a new feature, or an update that makes it feel like you’re already behind.

ChatGPT-4o reasons cheaper and faster. Gemini 2.0 has a ridiculous 2-million-token context window. DeepSeek is the latest model promising open-sourced magic.

So, what to choose?

How AI Platforms Are Different

The truth is that there’s no perfect model—only the best model for your needs. And the best way to figure it out is to focus on how you actually work.

(Also: many people confuse models with tools—GPT-4o is the model, while ChatGPT-4o is the tool built around it, just as an operating system powers a device, shaping the overall user experience.)

Here’s how I think about AI model selection—and how you can, too:

1. Model Performance

Besides aptitude tests, where all major models are in an arms race, there are also user benchmarks. The most popular is ​LMSys​, where users rate models on actual (perceived) performance. In these benchmarks, Gemini 2.0 Flash leads the charge, although ChatGPT-4o has a 'shared number 1' spot.

2. Model Focus

While every model is trained on similar data, fine-tuning, and product design changes impact what each model is best at. This is why ChatGPT is often said to be best at reasoning, while Claude excels at creative writing. Here is where personal preferences and objectives play a big role. Here’s what some community members had to say about their favorite platforms:

  • ​Antony Slumbers​ of ​The Trillion Dollar Hashtag​ chooses Gemini: “Especially the ‘Thinking’ models are fascinating. Reading their explanations of how they are ‘thinking’ about answering questions provides a master class in critical thinking.”
  • ​Carlo Benigni​ opts for Claude: “To get good results out of ChatGPT, I have to invest time in good prompts. Claude usually works the first time.”
  • ​Patrik Breitenmoser​ also prefers Claude, but for its coding abilities: “I mostly use it when working in Cursor, but also works really well when I need to solve quick one-off questions.”
  • ​Brian Elliott​ of ​Work Forward​ is another vote for Claude: “Claude is more “human” sounding in its responses. Using a Project in Claude allowed me to set tone and language guidelines and share examples of my writing that make it even better.”
  • ​Andrew Currie​, CEO of architecture firm ​Out-2 Design Group​, recently discovered Copilot’s edge: “I did a side-by-side comparison with Copilot and ChatGPT yesterday for some industry-specific research, summarising, and creating a draft memo. Surprising to say, I preferred the results from Copilot.”
  • And his takes on ChatGPT and Claude: "Most say Claude is better for writing, although if you train it on your own voice and you want to keep things simple, ChatGPT surely will be fine too."

3. Features

While some AIs offer little besides the familiar chatbox on the web and through mobile apps, some are investing heavily in products. Gemini (Deep Research) and Claude (Artifacts) look good here, but ChatGPT takes the crown with two autonomous agents (​Operator and Deep Research​) alongside Video, Screen Share, Voice Mode, Desktop App, and more.

4. Context Window

One of the most misunderstood things about AI models is the context window, the amount of text an AI model can process simultaneously. This determines how much of a conversation, document, or dataset the AI can remember and respond to effectively. Gemini 2.0 Flash wins here with an astounding 2 million token context window.

5. Knowledge Cutoff

Because training, fine-tuning, and safety testing take a lot of time with the immense amounts of data LLMs are trained on, each model has a knowledge cutoff date. This means that the AI is oblivious to anything that happened after this. Claude 3.5 Haiku is the winner here (July 2024.) But, caveat: since Claude doesn't have internet access, in many cases will actually feel more out of date. (As WSJ's ​Joanna Stern writes​: "For all it knows, David Hasselhoff could be president, and we could all be commuting in Jetsons-style flying cars.")

6. Usage

How much platforms get used matters. As AIs thrive on data, where you place yours matters in the long run. Imagine using a platform for years, only for it to go away. Go back to start and try again. ChatGPT scores major points here: in our latest ranking, it had 8x the traffic of the number two, with 3.2 billion monthly visits. 7. Privacy

With every interaction, you send your AI more data. Most of the major players are taking this seriously, with Claude seen as one of the more privacy-conscious AIs due to fewer data retention concerns. New entrant DeepSeek raises privacy flags due to unclear security policies and China-based hosting. (Plus, as ​The Guardian highlights​ in a recent test, it also refuses to answer questions about Tank Man in Tiananmen Square and other 'sensitive' topics.)

I’ve summarized the key differences between the leading models here:

Integrations: Perhaps the Biggest Factor

In last week’s coaching session, someone asked, “I use ChatGPT, but should I switch to Gemini for better Google Drive integration?”

Most people compare AI models based on intelligence, but the best AI is often the one that fits your workflow.

This means Copilot (built into Teams, Word, Excel, and Outlook) for Microsoft 365 users and Gemini (Docs, Drive, Gmail) for Google Workspace users.

As your AI lives inside your work tools, it can anticipate tasks, summarize meetings, and suggest actions without you asking.

The Bottom Line: Choose What Suits You

There’s no “one best” AI model—only the best one for how you work.

If you’re stuck, my best advice is: be like Andrew and run your own side-by-side test with your most common AI tasks.

That’s the fastest way to determine which model helps you get more done.

Now I’d love to hear from you:

  • Which AI model are you using the most right now?
  • Have you switched models recently? If so, why?

Let me know—I’m curious to hear what’s working for you!

SPONSORED BY GUIDDE

[Case Study] Increase CSAT 13%+ with AI Video

​Guidde​ AI tool helps CS/CX teams create user-ready how-to videos in seconds.

One of their success stories is with Encompass, a 500+ employee company that, among other gains, boosted CSAT by 13% by simply integrating Guidde training videos into their AI chatbot.

​Read the full story here.

>> TRY GUIDDE FOR FREE NOW