Watch out, Hollywood! OpenAI's latest model generates lifelike minute-long AI videos, but it has some critical weaknesses

Sora struggles with the prompt "Archeologists discover a generic plastic chair in the desert, excavating and dusting it with great care." (Image credit: OpenAI)

What you need to know

  • OpenAI recently debuted a new AI model dubbed Sora with video generation capabilities.
  • The text-to-video model can generate up to one-minute-long videos while maintaining high quality and adherence to the user’s prompt.
  • However, Sora struggles to simulate the physics of a complex scene and understand specific instances of cause and effect.

At the beginning of the year, Microsoft's Bill Gates and OpenAI's Sam Altman touched base at the Unconfuse Me podcast. The two revolutionary leaders discussed everything revolving around the ChatGPT maker, including Altman's firing and rehiring, the development of GPT-5, superintelligence, and more.

Sam Altman also discussed the possibility of video capabilities shipping to the company's AI-powered chatbot since it's the top request from most users. He added that this addition would build on the already existing voice mode and image generation features. 

And now, barely a month after sharing this information, OpenAI has unveiled a new text-to-video model dubbed Sora. The AI model "can generate videos up to a minute long while maintaining visual quality and adherence to the user's prompt."

It's worth noting that the model won't be available for everyone to access immediately. OpenAI is shipping the tool exclusively to "red teamers," visual artists, designers, and filmmakers who will assess potential areas for harm and risk.

Additionally, this will create an avenue for seasoned professionals in the film industry to provide feedback and suggest new ways for OpenAI to advance and improve the model.

Sora is able to generate complex scenes with multiple characters, specific types of motion, and accurate details of the subject and background. The model understands not only what the user has asked for in the prompt but also how those things exist in the physical world.

OpenAI

While the model ships with a deep understanding of language that allows it to interpret text prompts and generate life-like characters correctly, OpenAI admits that it also has its fair share of weaknesses. 

The company pointed out that the model may face challenges when trying to simulate the physics of a complex scene. It may also struggle to understand specific instances of cause and effect. According to an example provided by OpenAI to further explain this premise, "a person might take a bite out of a cookie, but afterward, the cookie may not have a bite mark."

Sora also has the capability to generate a video featuring multiple shots that "accurately persist characters and visual style." However, it may fall short when it comes to the spatial details of a prompt. For instance, it may struggle to decipher right from left or even specific events that take place over time. 

AI may render more professions obsolete

(Image credit: Future | Image Creator by Designer)

Besides the tough economic times, generative AI comes a close second when it comes to factors negatively impacting job security. AI-powered chatbots like Microsoft Copilot and ChatGPT are already claiming jobs from journalists. We've seen multiple publications lay off some of their employees in favor of these AI chatbots, and it turned out to be a hot mess. Microsoft has introduced a new program designed to equip journalists with skills that will prepare them for a future newsroom with AI

RELATED: AI-generated article recommends a food bank as a tourist attraction

Even AI-powered tools like Microsoft's Image Creator from Designer (formerly Bing Image Creator) are getting good at designing projects. This could potentially render architectural jobs redundant and obsolete

Admittedly, if someone showed me the videos generated by Sora, I wouldn't have even imagined that they were AI-generated (they look that good). And while the videos are currently capped at one minute, it's only a matter of time till you can generate an entire episode of your favorite show. 

OpenAI has indicated that it is working on elaborate measures to prevent instances of misinformation, hateful content, and bias  before it ships the model to general availability. 

CATEGORIES
Kevin Okemwa
Contributor

Kevin Okemwa is a seasoned tech journalist based in Nairobi, Kenya with lots of experience covering the latest trends and developments in the industry at Windows Central. With a passion for innovation and a keen eye for detail, he has written for leading publications such as OnMSFT, MakeUseOf, and Windows Report, providing insightful analysis and breaking news on everything revolving around the Microsoft ecosystem. You'll also catch him occasionally contributing at iMore about Apple and AI. While AFK and not busy following the ever-emerging trends in tech, you can find him exploring the world or listening to music.

Read more
The X account of OpenAI CEO Sam Altman is displayed on a mobile phone with a ChatGPT logo.
Sam Altman says OpenAI can confidently build AGI as the ChatGPT maker shifts focus to superintelligence: "I kinda miss doing AI research back when we didn't know how"
DeepSeek logo on a smartphone in front of a PC screen with the same logo.
Is AI all hype? DeepSeek tumbles to #51 on Apple's App Store, weeks after dethroning ChatGPT as the most downloaded free AI app in the US — OpenAI CEO Sam Altman already promised to "obviously deliver better models"
In this photo illustration OpenAI icon is displayed on a mobile phone screen in Ankara, Turkiye on August 13, 2024.
Sam Altman on GPT-4.5: Expensive, yet the closest thing to a thoughtful conversational partner we've seen
OpenAI CEO Sam Altman is seen on a mobile device screen.
Is ChatGPT getting a "grown up mode" with fewer guardrails? CEO Sam Altman hints AGI, AI agents, and deep research as part of OpenAI's roadmap for 2025
OpenAI logo
I asked ChatGPT and Copilot about AGI predictions for 2025 — OpenAI unanimously tops the chart partly due to its Microsoft tie-up and 2-year lead building AI 'uncontested'
Sam Altman and Satya Nadella on stage
A leaked document suggests OpenAI will hit AGI when it builds an AI system that can generate up to $100 billion in profit — but the ChatGPT maker could endure a massive $44 billion loss before seeing profit in 2029 partly due to Microsoft tie-up
Latest in Software Apps
ChatGPT and Microsoft Logo
ChatGPT’s new image-generation tool is impressive; it can finally create a glass of wine filled to the brim — but it struggles with blank white images and appears to discriminate against 'sexy women'
Microsoft Edge Sidebar
My favorite Microsoft Edge feature just got an AI upgrade — is this the best way to use Copilot on Windows 11?
Professor Sir Roger Penrose, physicist, mathematician and cosmologist
Nobel laureate claims "AI will not be conscious" and shouldn't be considered intelligent — Until it develops its own ideas
In this photo illustration OpenAI ChatGPT icon is displayed on a mobile phone screen in Ankara, Turkiye on August 13, 2024.
OpenAI says an excessive dependency on ChatGPT can lead to loneliness and a "loss of confidence" in decision-making
Microsoft 365 app on Windows 11 with shortcuts to create documents in Word, PowerPoint, Excel, and other Microsoft 365 applictions.
This Microsoft 365 feature will nudge users to save files to OneDrive
Photos app on Windows 11
Windows 11's Photos app is about to get a big update, and it's all about AI
Latest in News
ChatGPT and Microsoft Logo
ChatGPT’s new image-generation tool is impressive; it can finally create a glass of wine filled to the brim — but it struggles with blank white images and appears to discriminate against 'sexy women'
Microsoft Edge Sidebar
My favorite Microsoft Edge feature just got an AI upgrade — is this the best way to use Copilot on Windows 11?
Professor Sir Roger Penrose, physicist, mathematician and cosmologist
Nobel laureate claims "AI will not be conscious" and shouldn't be considered intelligent — Until it develops its own ideas
UGreen x Genshin Impact charging accessories: image shows magnetic wireless charger, power bank, GaN charger and USB-C cable
UGreen drops a stunning Genshin Impact collection of charging accessories AND it's all on sale
Lies of P boss
Grab these must-play games at killer deal prices during the CDKeys Spring Festival
In this photo illustration OpenAI ChatGPT icon is displayed on a mobile phone screen in Ankara, Turkiye on August 13, 2024.
OpenAI says an excessive dependency on ChatGPT can lead to loneliness and a "loss of confidence" in decision-making
  • Ben Wilson
    I'm most interested in where they expect to get the training data for Sora, at least whenever the full version is released. You likely have to guess it's YouTube, which feels strange.
    Reply