> The real issue is they tested on data in their training set. Hm, no. They trai...

_carltg · 2025-09-30T21:51:18 1759269078

Yes, but due to it being derived from the same underlying source dataset, it is effectively evaluating on the training dataset, not an independent validation/ test dataset.

The difference is subtle but important. If we expect the model to truly outperform a general model, it should generalize to a completely independent set.

bangaladore · 2025-09-30T19:24:57 1759260297

Thanks, rereading it makes it clear that you are correct.