Is GPT-4o overall worse than GPT-4 for you?
In my experience GPT-4o has much weaker reasoning than GPT-4. It is a lot faster, however. What have your experiences been?
Small anecdote but I asked an SQL query and GPT4o got it wrong. It didn't work when I ran it. Pasted the same question into GPT4 and it got it right.
Maybe it's a placebo but I switched back to GPT-4. Something about GPT-4o's responses, it's too verbose and rambles on about generalities instead of capturing the nuance of the topic of question, which is what I really want to know. Almost like 3.5 in that regard.
4o is completely trash, it literally won't listen, cant reason, talks non stop like an idiot gushing you with usless info you didnt ask for. Its like that one kid that over explains and thinks hes smart. Its not even that much faster, quality sacrifice is not usable.
It's nearly useless for me. GPT-4 used to be so good, 3.5 even, but I believe they have nerfed processing power per request, and the OpenAI stack is virtually useless to me, I tend to rely on Claude.
That's not my experience at all.
I was using Claude Opus to code before and now I'm back to ChatGPT. GPT-4o is faster, doesn't generate placeholders and works way better for me because of the larger context.
Came here from a google search with a similar sentiment..
I'm getting the strong impression that 4o is significantly weaker than 4, at least for dealing with coding snippets.
I find 4o better than 4 in my experiences. Mostly doing code generation/correction in Python/JS, and asking science, business, finance, management, and other non-creative questions.
I've found it is a lot worse in general. I use GPT-4 90% of the time now, and 4o when I need something answered quickly that has a very simple answer.
Half say 4o is better, others say it's worse. I'd wager it's probably about as good as 4T on average then.
In my experience GPT-4o is better in coding. I tested it with old C and Go. Both gave me better results.
maybe someone else is delighted with this, but for me it’s all still the stone age
I noticed this too, I find 4.0 is still better for giving me what I ask for.
No issues so far. Quality seems to be similar as 4 but way faster
4o feels worse than 4 for me.
Yeah, it's much worse for me, worse than 3.5 even. Almost at the level of GPT-3 curie at worst.
I suspect it could be related to whatever it's using as language detection, because many others don't experience this. It glitches hard on language, often responding in the wrong one.
Sorry for a tangent, but also is gpt-4 better for you than 8x7b?
When I return to 8x7b from gpt-4 it feels like I just shook off an unbearably boring guy and met a normal one, both very similar in knowledge (and unable to perform complex tasks).
Their claim hasn't been that 4o is better than 4. Just that it's faster and cheaper. So it's better than 3.5-turbo but not as good as 4, atleast from the examples I've tried out for summarization, code gen etc.