Is GPT-4o overall worse than GPT-4 for you?

Question

In my experience GPT-4o has much weaker reasoning than GPT-4. It is a lot faster, however. What have your experiences been?

aurareturn · Accepted Answer

Small anecdote but I asked an SQL query and GPT4o got it wrong. It didn't work when I ran it. Pasted the same question into GPT4 and it got it right.

roncesvalles · Answer

Maybe it's a placebo but I switched back to GPT-4. Something about GPT-4o's responses, it's too verbose and rambles on about generalities instead of capturing the nuance of the topic of question, which is what I really want to know. Almost like 3.5 in that regard.

serulin · Answer

4o is completely trash, it literally won't listen, cant reason, talks non stop like an idiot gushing you with usless info you didnt ask for. Its like that one kid that over explains and thinks hes smart. Its not even that much faster, quality sacrifice is not usable.

joeythedolphin · Answer

It's nearly useless for me. GPT-4 used to be so good, 3.5 even, but I believe they have nerfed processing power per request, and the OpenAI stack is virtually useless to me, I tend to rely on Claude.

meiraleal · Answer

That's not my experience at all.I was using Claude Opus to code before and now I'm back to ChatGPT. GPT-4o is faster, doesn't generate placeholders and works way better for me because of the larger context.

speedgoose · Answer

I find it better overall, it's my default.It's also what people think in blind tests: https://arena.lmsys.org

btbuildem · Answer

Came here from a google search with a similar sentiment..I'm getting the strong impression that 4o is significantly weaker than 4, at least for dealing with coding snippets.

runjake · Answer

I find 4o better than 4 in my experiences. Mostly doing code generation/correction in Python/JS, and asking science, business, finance, management, and other non-creative questions.

nullbio · Answer

I've found it is a lot worse in general. I use GPT-4 90% of the time now, and 4o when I need something answered quickly that has a very simple answer.

atleastoptimal · Answer

Half say 4o is better, others say it's worse. I'd wager it's probably about as good as 4T on average then.

EISENFELD · Answer

In my experience GPT-4o is better in coding. I tested it with old C and Go. Both gave me better results.

Turboblack · Answer

maybe someone else is delighted with this, but for me it&rsquo;s all still the stone age

lmiller1990 · Answer

I noticed this too, I find 4.0 is still better for giving me what I ask for.

EchoStar27 · Answer

No issues so far. Quality seems to be similar as 4 but way faster

tikkun · Answer

4o feels worse than 4 for me.

ciprianx · Answer

It's worse.

muzani · Answer

Yeah, it's much worse for me, worse than 3.5 even. Almost at the level of GPT-3 curie at worst.I suspect it could be related to whatever it's using as language detection, because many others don't experience this. It glitches hard on language, often responding in the wrong one.

wruza · Answer

Sorry for a tangent, but also is gpt-4 better for you than 8x7b?When I return to 8x7b from gpt-4 it feels like I just shook off an unbearably boring guy and met a normal one, both very similar in knowledge (and unable to perform complex tasks).

kelsier1 · Answer

Their claim hasn't been that 4o is better than 4. Just that it's faster and cheaper. So it's better than 3.5-turbo but not as good as 4, atleast from the examples I've tried out for summarization, code gen etc.