Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Ah. Thanks for posting - this makes a lot of sense.

I can totally see how they're able to pre-train models no problem, but are having trouble with the "noticeably better" part.

Thanks!



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: