Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
lumost
on Feb 22, 2024
|
parent
|
context
|
favorite
| on:
Things I don't know about AI
For small values of N, the linear terms of the transformer dominate. At the end of the day, a double layer of 764*2048 is still north of 3.1 MM flops/token/layer.
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: