For me, GPT-4-Turbo is significantly worse than even GPT-3.5: the former is much better at providing context for its answers (even erring on the too-verbose side), but then comes up with a pointless solution that it can't be dissuaded to change, even if its predecessor gets it right-ish.
Compared to both these GPT versions, Claude 3 (even though I have to use a proxy to pretend I'm in Nigeria...) is much more 'to the point' and seems more 'willing' to amend answers that don't go in the right direction, as opposed to simply backtracking and proposing a completely new solution.
But having to pare down the context of a question significantly remains a huge issue for all models, and I think this is their Achilles heel. Until you can feed a model your entire project, including any dependencies, and it can answer any questions in the that full context, the work required to retrofit useful answers is just too much to justify the expense.
So, since I have access to some IPs in Nigeria, I used those to (brazenly, possibly illegally!) evaluate their services (and no, the recent sea cable cuts don't help, but don't seem to affect my African upstreams too badly).
For me, GPT-4-Turbo is significantly worse than even GPT-3.5: the former is much better at providing context for its answers (even erring on the too-verbose side), but then comes up with a pointless solution that it can't be dissuaded to change, even if its predecessor gets it right-ish.
Compared to both these GPT versions, Claude 3 (even though I have to use a proxy to pretend I'm in Nigeria...) is much more 'to the point' and seems more 'willing' to amend answers that don't go in the right direction, as opposed to simply backtracking and proposing a completely new solution.
But having to pare down the context of a question significantly remains a huge issue for all models, and I think this is their Achilles heel. Until you can feed a model your entire project, including any dependencies, and it can answer any questions in the that full context, the work required to retrofit useful answers is just too much to justify the expense.