It's not just force inserting a word. Reasoning is integrated into the training ... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		charcircuit 30 days ago \| parent \| context \| favorite \| on: A few random notes from Claude coding quite a bit ... It's not just force inserting a word. Reasoning is integrated into the training process of the model.

samrus 29 days ago [–]

Not the core foundation model. The foundation model still only predicts the next token in a static way. The reasoning is tacked onto the instructGPT style finetuning step and its done through prompt engineering. Which is the shittiest way a model like this could have been done, and it shows

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact