GPT-5 Fails. AGI Cancelled. It's all over... | YouTube Summarizer

Category: AI Technology

Tags: AI Development GPT-5 Model Performance OpenAI Tech Reviews

Entities: Elon Musk Ethan Mollik Gary Marcus GPT-5 Matt Schumer OpenAI Rune Sam Altman

Summary

Release and Reception of GPT-5

GPT-5 was released within the last 24 hours, receiving mixed reviews.
Gary Marcus criticized GPT-5, labeling it as disappointing and not a step towards AGI.
There's a debate on whether investments in AI data centers will pay off.

Technical Performance and Issues

GPT-5's routing system is currently malfunctioning, causing incorrect model assignments.
Rune from OpenAI mentioned that the model auto switcher is broken but will be fixed soon.
Despite issues, GPT-5 can perform well when routed to the correct model with maximum reasoning effort.

Capabilities and Limitations

GPT-5 excels in creating code and software solutions, especially for complex tasks.
It struggles with basic tasks when not using high reasoning models.
The model is particularly good at instruction following and tool calling.

Community and Developer Feedback

Developers report mixed experiences, with some showcasing impressive projects created with GPT-5.
Ethan Mollik demonstrated GPT-5's ability to create a 3D city building game.
Matt Schumer suggests waiting for optimized agent harnesses for better performance.

Future Prospects and Improvements

Sam Altman acknowledged initial issues with GPT-5 but expects improvements soon.
The model will become smarter as routing issues are resolved.
OpenAI plans to make model selection more transparent for users.

Transcript

00:00

So, the much awaited GPT5 went live in the last 24 hours, and the results are mixed to say the least. Gary Marcus is saying that GPT5 is very disappointing.

A lot of it was just hype and marketing. It's not the path to AGI.

In this post,

00:16

he's specifically talking about how OpenAI is falling behind. But in other posts, he's also mentioning that a lot of other people like Elon and Rock, they're investing tons of money into these AI data centers, and those bets are probably not going to pay off because they're not getting us closer to

00:32

AGI. It's really interesting to see how different people's opinions are on this same model that just got released.

GT5 just refactor my entire codebase in one call. None of it worked, but boy was it beautiful.

Disappointment of Chad. GPT5 has burst the AI high bubble.

The

00:49

narrative changed overnight. GPT5 is disappointing.

Hallucinates. The big router keeps failing me.

GPT5 was rumored to do extremely well, better than the human baseline on Simple Bench. That does not appear to be the case.

Looks like it's in fifth place. How good

01:05

is it at math? Well, it's beyond anything we've seen before.

Previous models would be able to answer various math questions correctly. This model completely redefineses math as we know it.

As you can see here, 69 is equal to

01:21

30. Okay, 69 equals to 30, but also 69 is less than 52.

I'm sure you learned something today. You're welcome.

I have no idea what happened here. This is this is beyond me.

Apparently, Reddit hates it. They're deleting their subscription.

Opening eye lost all of their respect.

01:38

And this is highly upvoted. A lot of people are expressing the same sentiment.

So, what happened? Is is AGI cancelled?

Did we just hit a plateau and there's not going to be any AI progress moving forward? Let's break it down a little bit.

First and foremost, one

01:53

thing that GPT5 did that a lot of us kind of were looking forward to if it was done correctly, I guess, but right now, as of right now, it's not working very well, and that is the routing. When you ask GPT5 for whatever you ask it for, it tries to figure out, okay, do we

02:08

send you to the big smart model? Do we send you to something that's a little bit, you know, faster and cheaper to use?

You know, how big is your request? How much reasoning power do we need to allocate to it?

Right? Now, the reason a lot of us were looking forward to that, I think, is, you know, the a bunch of

02:24

different little models. Sometimes it got confusing.

Sometimes you'd be in the wrong one. It would be a little bit of a pain to select the correct one.

Now, there's certain situations if you're a developer, you need these specific models to do this specific thing. That was helpful.

you can kind of like custom tailor the model you want to use for