Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Why did you stop training shy of the frontier models? From the log plot it seems like you would only need ~50% more compute to reach frontier capability


We did a lot of internal testing and thought this model was already quite useful for release.


Makes sense! I like that you guys are more open about it. The other labs just drop stuff from the ivory tower. I think your style matches better with engineers who are used to datasheets etc. and usually don't like poking a black box


Thanks! I do like the labs blog posts as well though, OpenAI and Anthropic have some classics.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: