Microsoft's new lightweight AI model is as capable as GPT-3.5 despite being small enough to run on a smartphone

ChatGPT on a Google Pixel 7 Pro
Microsoft's new small AI model can compete with larger models that power tools like ChatGPT and Copilot. (Image credit: Ben Wilson | Windows Central)

What you need to know

  • Microsoft has a new lightweight AI model called Phi-3 Mini.
  • Phi-3 Mini is one of three smaller models that Microsoft will release, the other two being Phi-3 Small and Phi-3 Medium.
  • Microsoft trained Phi-3 Mini using a curriculum similar to how children learn from hearing stories.
  • Due to the fact that there aren't enough children's stories to train an AI model, Microsoft has an LLM create children's books to teach Phi-3 Mini.
  • Microsoft states that Phi-3 Mini is as capable as GPT-3.5 but that it is a smaller form factor.

A new lightweight AI model is here from Microsoft, and it promises to deliver a similar level of capabilities as GPT-3.5 in some areas despite being much smaller. Phi-3 Mini is trained on a much smaller amount of data than GPT-4 or other large language models (LLMs), but it can outperform larger models such as Llama 2. The model being smaller also allows it to run on phones and laptops rather than requiring the web.

Microsoft shared details about Phi-3 in a research paper. The Verge then shared insight on the model and quotes from Microsoft.

Phi-3 Mini is a 3.8 billion parameter language model that was trained on 3.3 trillion tokens. The research paper about Phi-3 Mini explains that one of the keys to the model is its dataset for training. Phi-3 Mini is a scaled up version of Phi-2, which was released in December 2023.

According to Microsoft, Phi-3 Mini can compete with models 10 times the size of the new lightweight AI model.

Lightweight models aren't exclusive to Microsoft. Google, Anthropic, and Meta all have smaller models.. One thing that stands out about Phi-3 Mini when compared to other models is how it was trained. Microsoft used a "curriculum," said Vice President of Microsoft Azure AI Platform Eric Boyd to The Verge. Microsoft was inspired by how children learn from hearing bedtime stories, according to the VP.

A limit on Phi-3 Mini's training was how many children's stories there are, so Microsoft had to make some. "There aren’t enough children’s books out there, so we took a list of more than 3,000 words and asked an LLM to make ‘children’s books’ to teach Phi," said Boyd to The Verge.

A model like Phi-3 Mini is not meant to replace GPT-4 or LLMs. Instead, small models can focus on specific tasks and use cases. Small models are also useful for companies using internal data for training.

Local AI

Microsoft Copilot

Some PCs will be able to run Microsoft Copilot locally rather than through the cloud. (Image credit: Windows Central | Jez Corden)

LLMs aren't going anywhere, but local AI is the next evolution of artificial intelligence. AI PCs will be able to run Microsoft Copilot locally to some extent and organizations are working on ways to use AI without requiring a connection to the web. Smaller models like Phi-3 Mini are small enough to run on phones, laptops, and other small devices.

When Intel revealed its next-gen Lunar Lake CPUs, the company confirmed the chips will have 100 TOPS (Trillion Operations per Second) of performance for AI tasks with the NPU accounting for 45 TOPS. That figure is significant because Copilot requires at least 40 TOPS of NPU performance to run locally. Qualcomm's Snapdragon X Elite has 45 TOPS of NPU performance, meaning the processor can also power Copilot locally.

Tech giants raced to roll out LLMs and other AI models to the public, but we're just starting to see hardware that can take advantage of AI technology. Smaller models like Phi-3 Mini will play a role in specialized cases and on devices that don't meet the performance requirements to run Copilot and other AI tools locally.

CATEGORIES
Sean Endicott
News Writer and apps editor

Sean Endicott is a tech journalist at Windows Central, specializing in Windows, Microsoft software, AI, and PCs. He's covered major launches, from Windows 10 and 11 to the rise of AI tools like ChatGPT. Sean's journey began with the Lumia 740, leading to strong ties with app developers. Outside writing, he coaches American football, utilizing Microsoft services to manage his team. He studied broadcast journalism at Nottingham Trent University and is active on X @SeanEndicott_ and Threads @sean_endicott_. 

Read more
OpenAI logo on an Android phone.
Microsoft says 'rStar-Math' demonstrates how small language models (SLMs) can rival or even surpass the math reasoning capability of OpenAI o1 by +4.5%
ChatGPT logo is seen displayed on a smartphone screen next to a laptop keyboard.
"It's very good": Sam Altman says OpenAI will launch o3 mini reasoning model in a couple of weeks — with API and ChatGPT simultaneously
A DeepSeek artificial intelligence logo and icons on various smartphones or laptops.
Microsoft announces distilled DeepSeek R1 models for Windows 11 Copilot+ PCs
DeepSeek logo on a smartphone in front of a PC screen with the same logo.
Is AI all hype? DeepSeek tumbles to #51 on Apple's App Store, weeks after dethroning ChatGPT as the most downloaded free AI app in the US — OpenAI CEO Sam Altman already promised to "obviously deliver better models"
In this photo illustration OpenAI icon is displayed on a mobile phone screen in Ankara, Turkiye on August 13, 2024.
Sam Altman on GPT-4.5: Expensive, yet the closest thing to a thoughtful conversational partner we've seen
Sundar Pichai, chief executive officer of Google Inc., speaks during a company event
Google CEO wants to re-embrace "scrappy" tactics as ChatGPT becomes synonymous with AI, just as Google did with Search
Latest in Microsoft
Cloud servers
Microsoft has killed "several" data center projects in the U.S. and Europe, according to reports — Microsoft responds (Updated)
Steve Ballmer and Bill Gates, former CEOs of Microsoft.
Bill Gates says Satya Nadella almost missed the cut for CEO of Microsoft — Even with Steve Ballmer's support
HP Reverb G2 VR headset
Was Windows Mixed Reality as bad as I remember? I look back at the failed VR platform that was ahead of its time.
Microsoft Majorana 1 chip designed for quantum computing
Microsoft dismisses quantum computing skepticism: "There is a century-old scientific process established by the American Physical Society for resolving disputes"
The Microsoft logo on a smartphone and laptop arranged in Crockett, California, US, on Friday, Dec. 29, 2023.
"Would you say there is a reasonable balance between what you contribute to Microsoft and what you get in return?" Two-thirds of Microsoft employees say YES — as AI engineers get preferential compensation packages.
Like a Dragon Pirate Yakuza in Hawaii screenshot
Microsoft blocks (some) Windows 11 pirates while Lenovo steals the show at Mobile World Congress
Latest in News
Cloud servers
Microsoft has killed "several" data center projects in the U.S. and Europe, according to reports — Microsoft responds (Updated)
Photo of Microsoft's new sign-in page for Xbox.com using the Microsoft Edge browser.
Over one billion users will get a new Microsoft user experience, and it has a dark mode
The Thing: Remastered key art
The Thing comes to Xbox Cloud Gaming's "Stream Your Own Game" library alongside other new arrivals
Promotional screenshot of heroes fighting a giant in Pillars of Eternity
Obsidian's classic Baldur's Gate successor 'Pillars of Eternity' is getting a surprise turn-based mode later this year, alongside other updates
Atomfall
Atomfall reviews and Metacritic scores are in: Here's a roundup of what everyone's saying about this new Game Pass survival game
Screenshot of one of the new flat world presets in Minecraft.
Minecraft testing new flat world presets and a better way to locate your friends in-game