"Starting from a single base LLM" Ok, zero data, except the data used in the tea... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		Iv 3 months ago \| parent \| context \| favorite \| on: R-Zero: Self-Evolving Reasoning LLM from Zero Data "Starting from a single base LLM" Ok, zero data, except the data used in the teacher model.

nickpsecurity 3 months ago [–]

Only 1-15TB of data processed at $10k-$100m depending on model size. Then, this shaves off a few hundred to a few grand on fine-tuning. I mean, we're still saving money at least.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact