The Neural Frontier
Posts
OpenAI Unveils New o3 Models 🤖!

OpenAI Unveils New o3 Models 🤖!

Also: Britannica hops on the AI train 🚀

The Neural Frontier
December 27, 2024

Happy Holidays from the curious folks at the Neural Frontier ❄️🎄!

It’s the season of jolliness, excitement, family, and, of course, updates from the frontier of AI technology 😏!

OpenAI unveiled yet another model; as expected, it’s been making the rounds. Equally exciting, Britannica—after about 200 years of existence—might be going public with a valuation of about $1 billion.

Even more interesting, the company is shifting its focus to AI-centric products.

And to round us off, we have a little surprise for you.

Wondering what it is? Let’s find out 🏃‍➡️!

In a rush? Here's your quick byte:

🤖 OpenAI unveils new o3 models!

🚀 Britannica hops on the AI train.

⏪ The Neural Frontier’s 2024 Rewind!

🤖 OpenAI unveils new o3 models!

Source: OpenAI

On the final day of its “12 Days of Shipmas” event, OpenAI unveiled o3, its latest family of reasoning AI models.

This successor to the earlier O1 model family boasts impressive new capabilities, claims of approaching artificial general intelligence (AGI), and significant caveats about its limitations and ethical considerations.

Here’s all you need to know:

🚀 Key Features and Updates with o3: This model has two variants: o3, the flagship reasoning model, and o3-mini, a distilled, task-specific version designed for efficiency.

Like its predecessor, o3 uses reinforcement learning to simulate "private chains of thought," breaking down tasks into logical steps before responding. In addition, users can now set compute levels (low, medium, or high), trading off speed for improved performance.

This model has also seen significant performance upgrades, including:

22.8% higher accuracy on SWE-Bench Verified (programming tasks).
96.7% on the 2024 American Invitational Mathematics Exam, missing just one question.
87.7% on GPQA Diamond (graduate-level STEM questions).
A record-breaking 25.2% score on Frontier Math, a benchmark for complex problem-solving.

🤖 Approaching AGI? OpenAI made bold claims that o3 approaches AGI—AI capable of performing tasks beyond its training with human-like adaptability.

ARC-AGI Test: o3 scored 87.5% on high compute settings, tripling o1’s performance on low compute settings.
Limitations: Critics like François Chollet, co-creator of ARC-AGI, note o3 still fails basic tasks and struggles with generalizing intelligence comparable to humans.

⚠️ Safety and Ethical Considerations: OpenAI continues its focus on safety with measures like collaborations with red-teamers and deliberative alignment, a new technique ensuring o3 adheres to OpenAI’s principles.

With the o3-mini model expected by late January 2025, we might be on the cusp of a leap forward in AI reasoning. However, these models are far from flawless.

While OpenAI claims that it edges closer to AGI, real-world applications will be the ultimate test of its impact on industries like coding, STEM research, and problem-solving.

🚀 Britannica hops on the AI train.

Source: ChatGPT Image Generator

Britannica (formerly Encyclopaedia Britannica) has made a striking pivot into artificial intelligence. With a potential public listing that could value the company at nearly $1 billion, Britannica is betting on AI-powered education products to redefine its role in the digital age.

Here’s the lowdown:

📚 From Iconic Print to AI Pioneer: Britannica, which stopped printing its iconic encyclopedias in 2012, was the longest-running English-language encyclopedia publisher.

The company has shifted online and leveraged its well-vetted, academic knowledge base to develop AI tools. This curated approach contrasts with models like ChatGPT, which suffer from hallucinations due to less reliable training data.

🚀 New AI-Powered Offerings: Britannica now focuses on online education tools designed for schools and libraries. AI features aim to personalize learning by identifying gaps in students’ understanding and tailoring lessons accordingly.

Powered by 200 years of encyclopedic knowledge, Britannica’s AI chatbot offers precise answers based on vetted content.

💡 Why Britannica Could Thrive:

Reputation Matters: Schools and institutions are willing to pay for trusted sources, especially as free tools like ChatGPT often return unreliable information.
Revenue Growth: The company expects its revenue to double from two years ago, reaching $100 million.

Britannica’s legacy of accuracy and academic rigor positions it uniquely in the education AI space. As institutions seek reliable tools, Britannica’s focus on quality could make it a leader as more organizations embrace the age of AI-powered learning.

⏪ The Neural Frontier’s 2024 Rewind!

2024 was definitely one of the most update-filled years we’ve seen in the AI space. From GPT-4o to advanced reasoning models like o1 and even agentic models like Gemini 2.0, it’s been a pretty lovely ride, to say the least. Don’t even get us started on the wonders of image and video generation we’ve seen this year.

We never missed a beat, delivering you the latest and greatest in the AI space every week, and our stats show it.

To all of you who subscribed and kept sharing our newsletter, we say a big thank you, as we recorded a 40% increase in subscriber growth this year.

And, apparently, our followership is large enough to fit into Fruita, Colorado 😏🚵. We can’t wait till we’re large enough to occupy Fruita, Colorado 😉.

As we draw the curtains on our last newsletter for 2024, every single one of the folks at the Neural Frontier says a big thank you (yes, to YOU!) for showing up, subscribing, sharing, and engaging with our newsletter and giving us the motivation to send this out every single week.

As we border on the frontier of 2025, who knows what we can expect? If this year was any indication, 2025 promises to be pretty marvelous. And, as always, we’ll be right there to capture all the juicy deets! 🚀

So, stay tuned, remain curious, enjoy some much-needed family time, and we’ll catch you in 2025 😉!