tqian's comments

tqian · 2026-02-06T03:49:49 1770349789

To borrow a concept of cloud server renting, there's also the factor of overselling. Most open source LLM operators probably oversell quite a bit - they don't scale up resources as fast as OpenAI/Anthropic when requests increase. I notice many openrouter providers are noticeably faster during off hours.

In other words, it's not just the model size, but also concurrent load and how many gpus do you turn on at any time. I bet the big players' cost is quite a bit higher than the numbers on openrouter, even for comparable model parameters.

tqian · 2026-01-10T14:27:34 1768055254

Zed on Linux is buggy as hell. On macos it's somehow more stable. Maybe Zed is indeed a "good" example of Ai-coded products.

tptacek · 2026-01-10T19:04:42 1768071882

This is a thing you can credibly say only if you've never used a popular new non-mainline editor. Sublime Code was "buggy as hell" too; all new editors are. Editors are incredibly difficult to do well. And Zed is doing it on hard mode, cross-platform.

tqian · 2026-01-07T21:21:21 1767820881

Bigger than Windows 98

tqian · 2025-12-19T04:05:49 1766117149

Oh junior devs submit PRs that don't fully work all the time.

tqian · 2025-12-15T04:10:36 1765771836

I have this DVD set in my basement. Technically, there are still methods for estimating the probability of unseen ngrams. Backoff (interpolating with lower grams) is an option. You can also impose prior distributions like a Bayesian so that you can make "rational" guesses.

Ngrams are surprisingly powerful for how little computation they require. They can be trained in seconds even with tons of data.

tqian · 2025-12-14T03:48:37 1765684117

Thank you for this wonderful answer.

tqian · 2025-10-31T23:34:29 1761953669

If models are profitable once trained, isn't it weird that chatgpt and Claude have $200 tiers that still have usage limits?