OpenAI launches next-gen reasoning models with "incredible" coding capabilities

The OpenAI logo appears on the screen of a smartphone.
(Image credit: Getty Images | NurPhoto)

On the last day of OpenAI's 12 days of shipmas, the ChatGPT maker announced OpenAI o3. The new model is OpenAI 01's successor, but as you might have noticed the AI firm skipped o2, which would have been the more obvious moniker for the flagship reasoning model's successor.

A report by The Information suggests that the decision to skip o2 is tied to trademark issues, as it might create conflict with British telecom provider O2 in the foreseeable future. Alongside OpenAI o3, the AI firm announced o3-mini, a smaller version of the next-gen model designed to achieve specific tasks.

While the company shipped OpenAI o1 to broad availability this month, the preview version will be limited to safety researchers and available for sign-up later today. This could be part of OpenAI's plan to fine-tune the model's user experience and performance before shipping it to general availability.

Interestingly, OpenAI o3 features "incredible" coding capabilities per benchmarks shared. OpenAI o1 also features impressive coding capabilities to the extent that it aced OpenAI's research engineer hiring interview for coding at a 90-100% rate. It's also up to three times better at handling tasks and answering complex queries, according to ARC-AGI (a sophisticated benchmark used to determine a model's capability to reason and solve complex tasks for the first time)

According to OpenAI CEO Sam Altman:

“We view this as the beginning of the next phase of AI. Where you can use these models to do increasingly complex tasks that require a lot of reasoning.”

Similarly, Google is trying to keep up with the AI train with its own reasoning model dubbed Gemini 2.0 Flash Thinking. Google CEO Sundar Pichai refers to the new model as the "most thoughtful model yet." Reasoning models are increasingly becoming important as more organizations hop onto the AI train and incorporate the technology into their workflow. This is because they'll be able to handle complex tasks and queries.

CATEGORIES
Kevin Okemwa
Contributor

Kevin Okemwa is a seasoned tech journalist based in Nairobi, Kenya with lots of experience covering the latest trends and developments in the industry at Windows Central. With a passion for innovation and a keen eye for detail, he has written for leading publications such as OnMSFT, MakeUseOf, and Windows Report, providing insightful analysis and breaking news on everything revolving around the Microsoft ecosystem. You'll also catch him occasionally contributing at iMore about Apple and AI. While AFK and not busy following the ever-emerging trends in tech, you can find him exploring the world or listening to music.

Read more
ChatGPT logo is seen displayed on a smartphone screen next to a laptop keyboard.
"It's very good": Sam Altman says OpenAI will launch o3 mini reasoning model in a couple of weeks — with API and ChatGPT simultaneously
OpenAI and ChatGPT
"We made a mistake in not being more transparent": OpenAI secretly accessed benchmark data, raising questions about the AI model's supposedly "high scores" — after Sam Altman touted it as "very good"
s1 artificial intelligence reasoning model displayed on a smartphone
Forget DeepSeek: Researchers develop a $50 OpenAI competitor in less than 30 minutes that thinks harder when you ask it to "wait"
OpenAI logo on an Android phone.
"Deep Research has been a personal AGI moment for me": OpenAI's new AI agentic tool simulates a personal research analyst
OpenAI CEO Sam Altman is seen on a mobile device screen.
Is ChatGPT getting a "grown up mode" with fewer guardrails? CEO Sam Altman hints AGI, AI agents, and deep research as part of OpenAI's roadmap for 2025
OpenAI logo on an Android phone.
Microsoft says 'rStar-Math' demonstrates how small language models (SLMs) can rival or even surpass the math reasoning capability of OpenAI o1 by +4.5%
Latest in Software Apps
Photo of Microsoft's new sign-in page for Xbox.com using the Microsoft Edge browser.
Over one billion users will get a new Microsoft user experience, and it has a dark mode
Artificial intelligence mobile apps for DeepSeek, ChatGPT and Google Gemini arranged.
Google says its latest reasoning model is its "most intelligent" — but Microsoft's CEO claims Google already fumbled its AI opportunity
ChatGPT and Microsoft Logo
ChatGPT’s new image-generation tool is impressive; it can finally create a glass of wine filled to the brim — but it struggles with blank white images and appears to discriminate against 'sexy women'
Microsoft Edge Sidebar
My favorite Microsoft Edge feature just got an AI upgrade — is this the best way to use Copilot on Windows 11?
Professor Sir Roger Penrose, physicist, mathematician and cosmologist
Nobel laureate claims "AI will not be conscious" and shouldn't be considered intelligent — Until it develops its own ideas
In this photo illustration OpenAI ChatGPT icon is displayed on a mobile phone screen in Ankara, Turkiye on August 13, 2024.
OpenAI says an excessive dependency on ChatGPT can lead to loneliness and a "loss of confidence" in decision-making
Latest in News
Cloud servers
Microsoft has killed "several" data center projects in the U.S. and Europe, according to reports — Microsoft responds (Updated)
Photo of Microsoft's new sign-in page for Xbox.com using the Microsoft Edge browser.
Over one billion users will get a new Microsoft user experience, and it has a dark mode
The Thing: Remastered key art
The Thing comes to Xbox Cloud Gaming's "Stream Your Own Game" library alongside other new arrivals
Promotional screenshot of heroes fighting a giant in Pillars of Eternity
Obsidian's classic Baldur's Gate successor 'Pillars of Eternity' is getting a surprise turn-based mode later this year, alongside other updates
Atomfall
Atomfall reviews and Metacritic scores are in: Here's a roundup of what everyone's saying about this new Game Pass survival game
Screenshot of one of the new flat world presets in Minecraft.
Minecraft testing new flat world presets and a better way to locate your friends in-game