More

evc123 · on June 4, 2023

> Whether scaling laws hold or not, is up for debate.

The scaling laws have broken: https://arxiv.org/abs/2210.14891

evc123 · on April 21, 2023

But what about "Broken Neural Scaling Laws" (https://arxiv.org/abs/2210.14891)?

loonginthetooth · on April 21, 2023

I think my ignorance is showing here, but that paper's tldr to me seems to be: neural network performance is not a monotonic function of network width. Like the conclusion and the problem statement seem to be trivially equivalent.

They admit that this law is only useful if you already know where the 'breaks' are: "If an additional break of sufficient sharpness happens at a scale that is sufficiently larger than the maximum (along the x-axis) of the points used for fitting, there does not (currently) exist a way to extrapolate the scaling behavior after that additional break."

evc123 · on Feb 4, 2023

https://arxiv.org/abs/2210.14891

evc123 · on July 20, 2020

What is the business model of HuggingFace?

evc123 · on April 9, 2018

What about benefitting non-human animals? Hopefully the benefits are distributed to all creatures and not just humanity.

evc123 · on March 22, 2018

Nah, fchollet just doesn't want pytorch to minimize keras:

https://twitter.com/jekbradbury/status/976612114260357120

evc123 · on Feb 22, 2018

https://www.reddit.com/r/MachineLearning/comments/7ftxow/d_e...

evc123 · on Sept 8, 2017

ONNX plans to support TensorFlow as well: https://github.com/onnx/onnx/issues/3

evc123 · on Sept 4, 2017

What are "something(s) not on our radar"?

evc123 · on Aug 19, 2017

The paper is the documentation.

windowshopping · on Aug 19, 2017

The paper makes no mention of any of the files in the repo, nor any of the classes defined inside them. I'm not really sure what you mean.