Anthropic is actually a good point to focus on since Claude is very good proof that it's not about the scaling. We are not quite there yet but we are "programming" through how we shape and filter the input data for training it seems. With time, we'll understand the methods to better represent.
Current situation doesn't sound too good for "scaling hypothesis" itself.
Current situation doesn't sound too good for "scaling hypothesis" itself.