Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

For coding? Claude Opus.

For me, GPT-4-Turbo is significantly worse than even GPT-3.5: the former is much better at providing context for its answers (even erring on the too-verbose side), but then comes up with a pointless solution that it can't be dissuaded to change, even if its predecessor gets it right-ish.

Compared to both these GPT versions, Claude 3 (even though I have to use a proxy to pretend I'm in Nigeria...) is much more 'to the point' and seems more 'willing' to amend answers that don't go in the right direction, as opposed to simply backtracking and proposing a completely new solution.

But having to pare down the context of a question significantly remains a huge issue for all models, and I think this is their Achilles heel. Until you can feed a model your entire project, including any dependencies, and it can answer any questions in the that full context, the work required to retrofit useful answers is just too much to justify the expense.



> you can feed a model your entire project, including any dependencies, and it can answer any questions in the that full context,

I've tried Gemini 1.5 with ~1M tokens and it took >90sec to answer anything.


How did you get access to Gemini 1.5, I thought it wasn't available for general access yet?


There's a waiting list for people who want to try it for free.


Ah, should have probably googled before asking. Thanks!


> I have to use a proxy to pretend I'm in Nigeria

Could you explain more?


I'm located in the EU, where Claude is unavailable: https://www.anthropic.com/claude-ai-locations.

So, since I have access to some IPs in Nigeria, I used those to (brazenly, possibly illegally!) evaluate their services (and no, the recent sea cable cuts don't help, but don't seem to affect my African upstreams too badly).


+1 I am genuinely confused




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: