OpenAI rolls out its latest reasoning models o3 and o4‑mini

Described as the company's “smartest models to date,” they can agentically use and combine every tool within ChatGPT, such as web search, analysing uploaded files, reasoning about visual inputs, and generating images.

Apr 17, 2025 - 11:20
 0
OpenAI rolls out its latest reasoning models o3 and o4‑mini

OpenAIhas released o3 and o4-mini, its latest models with reasoning capabilities

Touted to be the company's smartest models to date, they can use agentic AI and combine every tool within ChatGPT—including web search, analysing uploaded files and other data with Python, reasoning about visual inputs, and generating images.

“Critically, these models are trained to reason about when and how to use tools to produce detailed and thoughtful answers in the right output formats, typically in under a minute, to solve more complex problems. This allows them to tackle multi-faceted questions more effectively, a step toward a more agentic ChatGPT that can independently execute tasks on your behalf,” read the company’s blog post. 

Internal tests show o3 setting new scores on STEM and multimodal benchmarks such as MMMU, MathVista and CharXiv, while o4‑mini delivers similar reasoning depth at lower cost and latency. On perception tasks, the larger model reaches 95.7 % accuracy, according to the company.

OpenAI claims that o3 performs well at visual tasks such as analysing images, charts, and graphics. According to external experts, o3 makes 20% fewer major errors, compared to the earlier o1 model, on real-world tasks. It also shows improvements in areas such as programming, business consulting, and creative ideation.

Both models are trained to reason through problems, decide when and how to use available tools to generate detailed answers, and deliver results in under a minute.

“For example, a user might ask: 'How will summer energy usage in California compare to last year?' The model can search the web for public utility data, write Python code to build a forecast, generate a graph or image, and explain the key factors behind the prediction, chaining together multiple tool calls. Reasoning allows the models to react and pivot as needed to information it encounters,” said the company. 


Edited by Swetha Kannan