Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Also worth checking out was codestral... I think that had a 256k context and used Mamba even if it is slightly older model now... it had worked great for a Text2SQL use case we worked on.


Magistral 2509 just came out. It super slows down when you go over 40,000 context. It's quite a fantastic model.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: