Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Can multimodal llms read the pdf file format to extract text components as well as graphical ones? Because that would seem to me to be the best way to go.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: