How to select models based on your task at hand
gemini-2.5-pro
or claude-4-sonnet
) are confident and make decisions with minimal prompting.o3
or claude-4-opus
) take time to plan or ask questions to understand context more deeply.claude-4-opus
gemini-2.5-pro
o3
(designed for complex reasoning)claude-4-sonnet
gpt-4.1
claude-4-sonnet
, gemini-2.5-pro
, and gpt-4.1
can all serve as reliable daily drivers - it comes down to how much control you want.
If you prefer to… | Models |
---|---|
Be in control, give clear instructions | claude-4-sonnet , gpt-4.1 |
Let the model take initiative | claude-4-opus , gemini-2.5-pro , o3 |
claude-4-sonnet
, gemini-2.5-pro
| | Codebase navigation/search |
gemini-2.5-pro
, claude-4-opus
, o3
| | Planning or problem-solving |
claude-4-opus
, gemini-2.5-pro
| | Complex bugs or deep reasoning | o3
|o3
is designed for complex, ambiguous problems. It is powerful but also
slower and more resource-intensive, which makes it better suited for
occasional use.o3
). It does not route based on task type, but is a solid default if you are unsure which to choose.
Date | Changes |
---|---|
Late May 2025 | Updated recommendations for newer models. Simplified categories as capabilities improve. |
Early May 2025 | Initial version covering model selection guidance, behavior patterns, and selection criteria |
claude-4-sonnet
, gemini-2.5-pro
, and gpt-4.1
are all strong daily drivers. Your choice depends on interaction style.o3
is designed for the hardest problems.