Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
polynomial
15 days ago
|
parent
|
context
|
favorite
| on:
Attention at Constant Cost per Token via Symmetry-...
Right, not to "defend" the paper's claims, but it seems to be more like tuning
how
the leaky bucket leaks, using lossy compression to try to preserve some measure of coherency? Seems to turn on the fixed size summary.
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: