Hacker Newsnew | past | comments | ask | show | jobs | submit | evc123's commentslogin

> Whether scaling laws hold or not, is up for debate.

The scaling laws have broken: https://arxiv.org/abs/2210.14891


But what about "Broken Neural Scaling Laws" (https://arxiv.org/abs/2210.14891)?


I think my ignorance is showing here, but that paper's tldr to me seems to be: neural network performance is not a monotonic function of network width. Like the conclusion and the problem statement seem to be trivially equivalent.

They admit that this law is only useful if you already know where the 'breaks' are: "If an additional break of sufficient sharpness happens at a scale that is sufficiently larger than the maximum (along the x-axis) of the points used for fitting, there does not (currently) exist a way to extrapolate the scaling behavior after that additional break."



What is the business model of HuggingFace?


What about benefitting non-human animals? Hopefully the benefits are distributed to all creatures and not just humanity.


Nah, fchollet just doesn't want pytorch to minimize keras:

https://twitter.com/jekbradbury/status/976612114260357120



ONNX plans to support TensorFlow as well: https://github.com/onnx/onnx/issues/3


What are "something(s) not on our radar"?


The paper is the documentation.


The paper makes no mention of any of the files in the repo, nor any of the classes defined inside them. I'm not really sure what you mean.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: