- The Neural Frontier
- Posts
- OpenAI Unveils New o3 Models š¤!
OpenAI Unveils New o3 Models š¤!
Also: Britannica hops on the AI train š
Happy Holidays from the curious folks at the Neural Frontier āļøš!
Itās the season of jolliness, excitement, family, and, of course, updates from the frontier of AI technology š!
OpenAI unveiled yet another model; as expected, itās been making the rounds. Equally exciting, Britannicaāafter about 200 years of existenceāmight be going public with a valuation of about $1 billion.
Even more interesting, the company is shifting its focus to AI-centric products.
And to round us off, we have a little surprise for you.
Wondering what it is? Letās find out šāā”ļø!
In a rush? Here's your quick byte:
š¤ OpenAI unveils new o3 models!
š Britannica hops on the AI train.
āŖ The Neural Frontierās 2024 Rewind!
Source: OpenAI
On the final day of its ā12 Days of Shipmasā event, OpenAI unveiled o3, its latest family of reasoning AI models.
This successor to the earlier O1 model family boasts impressive new capabilities, claims of approaching artificial general intelligence (AGI), and significant caveats about its limitations and ethical considerations.
Hereās all you need to know:
š Key Features and Updates with o3: This model has two variants: o3, the flagship reasoning model, and o3-mini, a distilled, task-specific version designed for efficiency.
Like its predecessor, o3 uses reinforcement learning to simulate "private chains of thought," breaking down tasks into logical steps before responding. In addition, users can now set compute levels (low, medium, or high), trading off speed for improved performance.
This model has also seen significant performance upgrades, including:
22.8% higher accuracy on SWE-Bench Verified (programming tasks).
96.7% on the 2024 American Invitational Mathematics Exam, missing just one question.
87.7% on GPQA Diamond (graduate-level STEM questions).
A record-breaking 25.2% score on Frontier Math, a benchmark for complex problem-solving.
š¤ Approaching AGI? OpenAI made bold claims that o3 approaches AGIāAI capable of performing tasks beyond its training with human-like adaptability.
ARC-AGI Test: o3 scored 87.5% on high compute settings, tripling o1ās performance on low compute settings.
Limitations: Critics like FranƧois Chollet, co-creator of ARC-AGI, note o3 still fails basic tasks and struggles with generalizing intelligence comparable to humans.
ā ļø Safety and Ethical Considerations: OpenAI continues its focus on safety with measures like collaborations with red-teamers and deliberative alignment, a new technique ensuring o3 adheres to OpenAIās principles.
With the o3-mini model expected by late January 2025, we might be on the cusp of a leap forward in AI reasoning. However, these models are far from flawless.
While OpenAI claims that it edges closer to AGI, real-world applications will be the ultimate test of its impact on industries like coding, STEM research, and problem-solving.
Source: ChatGPT Image Generator
Britannica (formerly Encyclopaedia Britannica) has made a striking pivot into artificial intelligence. With a potential public listing that could value the company at nearly $1 billion, Britannica is betting on AI-powered education products to redefine its role in the digital age.
Hereās the lowdown:
š From Iconic Print to AI Pioneer: Britannica, which stopped printing its iconic encyclopedias in 2012, was the longest-running English-language encyclopedia publisher.
The company has shifted online and leveraged its well-vetted, academic knowledge base to develop AI tools. This curated approach contrasts with models like ChatGPT, which suffer from hallucinations due to less reliable training data.
š New AI-Powered Offerings: Britannica now focuses on online education tools designed for schools and libraries. AI features aim to personalize learning by identifying gaps in studentsā understanding and tailoring lessons accordingly.
Powered by 200 years of encyclopedic knowledge, Britannicaās AI chatbot offers precise answers based on vetted content.
š” Why Britannica Could Thrive:
Reputation Matters: Schools and institutions are willing to pay for trusted sources, especially as free tools like ChatGPT often return unreliable information.
Revenue Growth: The company expects its revenue to double from two years ago, reaching $100 million.
Britannicaās legacy of accuracy and academic rigor positions it uniquely in the education AI space. As institutions seek reliable tools, Britannicaās focus on quality could make it a leader as more organizations embrace the age of AI-powered learning.
āŖ The Neural Frontierās 2024 Rewind!
2024 was definitely one of the most update-filled years weāve seen in the AI space. From GPT-4o to advanced reasoning models like o1 and even agentic models like Gemini 2.0, itās been a pretty lovely ride, to say the least. Donāt even get us started on the wonders of image and video generation weāve seen this year.
We never missed a beat, delivering you the latest and greatest in the AI space every week, and our stats show it.
To all of you who subscribed and kept sharing our newsletter, we say a big thank you, as we recorded a 40% increase in subscriber growth this year.
And, apparently, our followership is large enough to fit into Fruita, Colorado ššµ. We canāt wait till weāre large enough to occupy Fruita, Colorado š.
As we draw the curtains on our last newsletter for 2024, every single one of the folks at the Neural Frontier says a big thank you (yes, to YOU!) for showing up, subscribing, sharing, and engaging with our newsletter and giving us the motivation to send this out every single week.
As we border on the frontier of 2025, who knows what we can expect? If this year was any indication, 2025 promises to be pretty marvelous. And, as always, weāll be right there to capture all the juicy deets! š
So, stay tuned, remain curious, enjoy some much-needed family time, and weāll catch you in 2025 š!