Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
mchusma
7 months ago
|
parent
|
context
|
favorite
| on:
OpenAI o3 and o4-mini
Love Sonnet but 3.7 is not obviously an improvement over 3.5 in my real world usage. Gemini 2.5 pro is great, has replaced most others for me (Grok I use for things that require realtime answers)
int_19h
7 months ago
|
next
[–]
Are you comparing it with or without thinking? I'd say it's a fairly big improvement in long thinking mode.
BriggyDwiggs42
7 months ago
|
prev
[–]
It does a lot better on philosophy questions.
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: