Microsoft demonstrates how it's training small language models to reason better than ChatGPT and Copilot

Microsoft Logo Building Redmond
(Image credit: Windows Central)

What you need to know

  • Microsoft recently published a new blog post highlighting its efforts toward teaching small language models how to reason.
  • It unveiled Orca 2, a small language model demonstrating strong reasoning abilities by imitating the step-by-step reasoning traces of more capable LLMs like ChatGPT and Copilot.
  • According to benchmarks, Orca 2 sports advanced performance capabilities compared to other LLMS when put to the test to handle complex tasks. 
  • Microsoft intends to train smaller language models using LLMs, ultimately expanding their capabilities.

There's no doubt that Microsoft has placed all its bets on generative AI, especially after making a multi-billion dollar investment in the tech, further extending its partnership with OpenAI.

Speaking of OpenAI, we've witnessed what most might refer to as a paradigm shift affecting the top management of the tech firm. OpenAI's board of directors stripped Sam Altman of his position, citing a lack of confidence in his leadership skills. Shortly after, Altman was offered a job at Microsoft leading the Advanced AI team alongside Greg Brockman (former OpenAI co-founder who resigned shortly after Altman's ousting).

As all these unfold, Microsoft has published a new blog post highlighting its efforts toward teaching small language models how to reason. A few months ago, the company debuted Orca. "A 13-billion language model demonstrating strong reasoning abilities by imitating the step-by-step reasoning traces of more capable LLMs."

And now, it has unveiled Orca 2 (which comes in two sizes - 7 billion and 13 billion parameters) as part of its efforts to tap into the capabilities of smaller LMs. According to Microsoft, Orca 2 sports "improved training signals and methods can empower smaller language models to achieve enhanced reasoning abilities." This is a significant feat, considering these capabilities are often found on larger language models, including ChatGPT and Copilot

Admittedly, both chatbots have faced numerous setbacks this year, with several users citing that ChatGPT is getting dumber amid claims that OpenAI is on the verge of bankruptcy. On the other hand, a report cited that Bing's user base has stagnated for three months consecutively, despite Microsoft's hefty investment in the tech.

Microsoft further cites that Orca 2 stacks miles ahead of other similar models, even the original Orca model. Moreover, the company indicated that it sports advanced performance levels compared to other larger models when handling complex tasks that test "advanced reasoning abilities in zero-shot settings."

Frontier Language Models such as GPT-4, PaLm, and others have demonstrated a remarkable ability to reason, for example, answering complex questions, generating explanations, and even solving problems that require multi-step reasoning; capabilities that were once considered beyond the reach of AI. Traditionally, such abilities have not been observed in smaller language models, so the challenge is how to use our growing knowledge of large language models to increase the abilities of these smaller models.

Microsoft

Microsoft's Advanced AI team is oncourse

OpenAI staffers joining Microsoft

(Image credit: Windows Central | Bing Image Creator)

Amid the OpenAI fiasco over the weekend, Microsoft CEO Satya Nadella, announced that Sam Altman would be joining the company as lead of the Advanced AI team. A job that suits his capabilities and skill set. 

Following the unfortunate Altman news, more than 500 OpenAI staffers penned a letter to the board of directors requesting his reinstatement, citing that the decision undermined their vision. The employees indicated they'd leave the company if their demands weren't met, citing that "there's no OpenAI without its people."

According to sources familiar with the situation, Microsoft is ready to absorb all OpenAI employees into the AI division should they decide to make good on their promise and leave the company.

Microsoft will likely leverage OpenAI's team to achieve more with Orca 2. Consequently, this will allow the company to use LLMs to train smaller language models, ultimately expanding the capabilities of smaller language models.

Do you think Microsoft is on track with its Orca 2 ventures? Share your thoughts with us in the comments.

CATEGORIES
Kevin Okemwa
Contributor

Kevin Okemwa is a seasoned tech journalist based in Nairobi, Kenya with lots of experience covering the latest trends and developments in the industry at Windows Central. With a passion for innovation and a keen eye for detail, he has written for leading publications such as OnMSFT, MakeUseOf, and Windows Report, providing insightful analysis and breaking news on everything revolving around the Microsoft ecosystem. You'll also catch him occasionally contributing at iMore about Apple and AI. While AFK and not busy following the ever-emerging trends in tech, you can find him exploring the world or listening to music.

Read more
OpenAI logo
I asked ChatGPT and Copilot about AGI predictions for 2025 — OpenAI unanimously tops the chart partly due to its Microsoft tie-up and 2-year lead building AI 'uncontested'
OpenAI logo on an Android phone.
Microsoft says 'rStar-Math' demonstrates how small language models (SLMs) can rival or even surpass the math reasoning capability of OpenAI o1 by +4.5%
DeepSeek logo on a smartphone in front of a PC screen with the same logo.
Is AI all hype? DeepSeek tumbles to #51 on Apple's App Store, weeks after dethroning ChatGPT as the most downloaded free AI app in the US — OpenAI CEO Sam Altman already promised to "obviously deliver better models"
The X account of OpenAI CEO Sam Altman is displayed on a mobile phone with a ChatGPT logo.
Sam Altman says OpenAI can confidently build AGI as the ChatGPT maker shifts focus to superintelligence: "I kinda miss doing AI research back when we didn't know how"
Sam Altman and Satya Nadella on stage
A leaked document suggests OpenAI will hit AGI when it builds an AI system that can generate up to $100 billion in profit — but the ChatGPT maker could endure a massive $44 billion loss before seeing profit in 2029 partly due to Microsoft tie-up
Artificial Intelligence AI Assistant Apps - ChatGPT, Anthropic Claude, Google Gemini, Microsoft Copilot, Perplexity, Poe.
Satya Nadella admits Microsoft missed an opportunity as ChatGPT and Copilot gain popularity — even OpenAI's Sam Altman "doesn't do Google searches anymore"
Latest in Microsoft
Steve Ballmer and Bill Gates, former CEOs of Microsoft.
Bill Gates says Satya Nadella almost missed the cut for CEO of Microsoft — Even with Steve Ballmer's support
Microsoft Majorana 1 chip designed for quantum computing
Microsoft dismisses quantum computing skepticism: "There is a century-old scientific process established by the American Physical Society for resolving disputes"
The Microsoft logo on a smartphone and laptop arranged in Crockett, California, US, on Friday, Dec. 29, 2023.
"Would you say there is a reasonable balance between what you contribute to Microsoft and what you get in return?" Two-thirds of Microsoft employees say YES — as AI engineers get preferential compensation packages.
Like a Dragon Pirate Yakuza in Hawaii screenshot
Microsoft blocks (some) Windows 11 pirates while Lenovo steals the show at Mobile World Congress
Satya Nadella with Sam Altman at a conference
Salesforce CEO Marc Benioff's prediction about Microsoft and OpenAI's partnership may have just manifested — and it's not a pretty look for the ChatGPT maker
Age of Empires II with retail box
I ranked 7 of the best Microsoft games of all time to celebrate its 50th anniversary — disagree with these classics if you dare
Latest in News
Professor Sir Roger Penrose, physicist, mathematician and cosmologist
Nobel laureate claims "AI will not be conscious" and shouldn't be considered intelligent — Until it develops its own ideas
UGreen x Genshin Impact charging accessories: image shows magnetic wireless charger, power bank, GaN charger and USB-C cable
UGreen drops a stunning Genshin Impact collection of charging accessories AND it's all on sale
Lies of P boss
Grab these must-play games at killer deal prices during the CDKeys Spring Festival
In this photo illustration OpenAI ChatGPT icon is displayed on a mobile phone screen in Ankara, Turkiye on August 13, 2024.
OpenAI says an excessive dependency on ChatGPT can lead to loneliness and a "loss of confidence" in decision-making
Alienware Area-51 laptops (2025)
Dell revives Alienware Area-51 with powerful new gaming PCs
The First Berserker: Khazan
The First Berserker: Khazan review and Metacritic score roundup — this stylish Soulslike sounds like a must-play action RPG