More

Kuinox · 2025-12-15T12:55:48 1765803348

You call writing in a structured fashion with formal words the "worst linguistic vices"

xeonmc · 2025-12-15T12:57:46 1765803466

The worst vices are the superfluous faux-eloquence that meanders without meaning. Employing linguistic devices for the sake of utilizing them without managing to actually make a point with its usage.

komali2 · 2025-12-15T13:19:57 1765804797

I was trying to figure out why my SD card wasn't mounting and asked ChatGPT. It said:

> Your kernel is actually being very polite here. It sees the USB reader, shakes its hand, reads its name tag… and then nothing further happens. That tells us something important. Let’s walk this like a methodical gremlin.

It's so sickly sweet. I hate it.

Some other quotes:

> Let’s sketch a plan that treats your precious network bandwidth like a fragile desert flower and leans on ZFS to become your staging area.

> But before that, a quick philosophical aside: ZFS is a magnificent beast, but it is also picky.

> Ending thought: the database itself is probably tiny compared to your ebooks, and yet the logging machinery went full dragon-hoard. Once you tame binlogs, Booklore should stop trying to cosplay as a backup solution.

> Nice, progress! Login working is half the battle; now we just have to convince the CSS goblins to show up.

> Hyprland on Manjaro is a bit like running a spaceship engine in a treehouse: entirely possible, but the defaults are not tailored for you, so you have to wire a few things yourself.

> The universe has gifted you one of those delightfully cryptic systemd messages: “Failed to enable… already exists.” Despite the ominous tone, this is usually systemd’s way of saying: “Friend, the thing you’re trying to enable is already enabled.”

Kuinox · 2025-12-15T13:25:57 1765805157

Did you not put some weird thing in your prompt ? That's not the style of writing I have in my ChatGPT, I run without memory and with default prompt. Yours try to make a metaphore at every single response.

You can check both in ChatGPT settings.

komali2 · 2025-12-15T13:31:39 1765805499

These are cherry picked. Mostly the first and last sentence look like this.

I just checked settings, apparently I had it set to "nerdy," that might be why. I've just changed it to "efficient," hopefully that'll help.

Kuinox · 2025-12-01T16:04:24 1764605064

volunteers which aren't necessarly pro player and cant distinguish good players from smart cheaters.

chownie · 2025-12-01T17:03:52 1764608632

The developers aren't pro players either, the cutting edge for anti-cheats still require that non-cheaters play with cheaters for months. I would not be shocked if simple vote-kick outperforms every anti-cheat on the market.

Kuinox · 2025-12-01T17:06:53 1764608813

A simple vote-kick will kick, will kick so much players that doesn't cheat. It's already used to troll in games like CS.

Kuinox · 2025-12-01T16:03:09 1764604989

Did you played in this era ?

- If you were too good on some server, you'd get banned.

- If the admin doesn't know well cheating, he could tolerate something that was obvious cheating.

- Cheaters could just change server often.

It used to be easy to just ban peoples yes, and it was as easy to switch servers.

Plus on most competitive game today, you have custom lobbies, which do exactly what you want, and there is a reason why only a minority of players uses it.

OkayPhysicist · 2025-12-01T16:36:34 1764606994

Custom lobbies don't meet the same need. That's for playing with your friends, or at least, people you vet yourself. Community servers are a sub-community in of themselves: people tend to play on the same servers on a regular basis, allowing you to build rapport, community norms, and have substantially more direct moderation than company-run servers.

Yes, sometimes you run into power-tripping moderators. That comes with the territory of having moderators. But the upsides, of being embedded in a usefully-sized community, and having nearly constant human moderation, not to mention the whole "stop killing games" of it all, far outweigh the need to shop around a bit for a good server.

I think the ideal middle ground is something like Squad's server system: The developers offer a contract to server owners, establishing basic standards that must be met to be a recommended server. Rules forbidding the crazy bigotry that milsims tend to attract, minimum server specs to ensure smooth gameplay, an effective appeals process. If a server meets those requirements, and signs the agreement to keep meeting those standards, they get put on a "recommended" server list (which 90%+ of the playerbase exclusively use). Other servers go on the "custom" server list, which can be modded, or spun up for certain events, or whatever.

Kuinox · 2025-12-01T16:50:20 1764607820

two or three months ago, I played a game that did exactly what you proposed, V-Rising, it have a server browser, I played a week with friend on a busy server. Then the server was gone for two weeks. When it was back, mosts of the bases were gone due to inactivity.

That's the kind of things that were common too, maybe you forgot about it.

OkayPhysicist · 2025-12-01T18:31:31 1764613891

All the multiplayer games I play today are either community server based, or I exclusively interact with private lobbies.

My negative experiences with community servers represent a pretty short list. Sometimes servers die, but games die sometimes, too. That's obviously only an issue with persistent-state games, like Minecraft, but it's unfortunate when it happens. Can't say it was so frequent that it impacted my enjoyment of any games as a whole.

hamdingers · 2025-12-01T16:21:05 1764606065

All true, but of course you're missing the player agency component that renders those issues moot. If any of the above happens, you can simply find another server.

Private games (now called "custom lobbies") were available back then too, they're not equivalent to a public server browser.

Kuinox · 2025-12-01T16:25:03 1764606303

They are functionally equivalent for the player. The problem with player hosted servers is that it was very hard to get a fair and balanced competitive match, where now it's extremely common with matchmaking on servers hosted by the game company.

hparadiz · 2025-12-01T16:29:07 1764606547

Back then at least you could do something about it. Now if there's an obvious cheater you just kinda sit there and take your L, and ask people to make reports.

brendoelfrendo · 2025-12-01T17:09:13 1764608953

If you were playing on a server you owned or for which you had ban permissions, you could do something about it. Otherwise, you had to hope that an admin was online to ban the cheater. If no one was around to take action, your option was to... sit there, take your L, and ask people to make reports (to the admins). You had the option to hop around between servers until you found one that didn't have cheaters, but is that all that different from just quitting back to matchmaking and hoping you find a match without cheaters?

Edit to add: I'm not disputing that kernel-level anticheat is bad; I agree that it is. I don't think it helps to try and hearken back to a golden age of PC gaming that didn't really exist. Maybe it was easier for server admins to manage because player populations were smaller back then, but that's about all that would have made things "better."

hamdingers · 2025-12-01T17:21:31 1764609691

You were not helpless if the admin wasn't on, votekick has existed for 25+ years.

Believe it or not us old folks who played during this time had ways to address these issues.

brendoelfrendo · 2025-12-01T17:53:07 1764611587

Votekick still exists in modern games, too.

hamdingers · 2025-12-02T16:07:43 1764691663

Then it's weird you weren't aware of it when you posted your previous comment.

silon42 · 2025-12-02T07:04:46 1764659086

callvote insta_weapon 1

was my favorite in the Quake3A mod I played.

Kuinox · 2025-12-01T16:43:31 1764607411

> Back then at least you could do something about it.

Back then, the most common option taken was leaving the server to find another one.

hparadiz · 2025-12-01T17:59:21 1764611961

This is drudging up some formative memories. In the counter-strike / TF2 communities you'd have servers that would grant vote kick rights with more playtime and some of those regulars would then apply for mod rights. It worked quite well.

Kuinox · 2025-12-01T18:03:42 1764612222

It still doesn't solve the unfair votekick problem. People with more play time, doesn't have necessarly the abilities nor tools to judge if someone is cheating. Take a look at the trackmania community, some cheaters are caught years later, because they played it smart. Some cheating can't only be observed by looking at the statistics, or hard proof of cheating being ran.

hparadiz · 2025-12-01T18:11:23 1764612683

It's a pub. It doesn't matter as long as it's not obvious aim bots and people are having fun. Besides when it's a 32 player instant respawn death match server you have like 200-300 regulars. That type of cheating was never an issue in those because the servers were always full during peak times and everyone kinda knows each other.

hamdingers · 2025-12-01T19:37:33 1764617853

Something you are explicitly punished for in modern matchmaking. Unless you want to be downranked or even temp banned you must suffer the cheater.

hamdingers · 2025-12-01T16:34:57 1764606897

They are not functionally equivalent, unless there are games I'm not familiar with where custom lobbies are published in a list for strangers to join. Normally a custom lobby implies invite only.

Not everyone is interested in a "fair and balanced competitive match" where you're guaranteed to win no more and no less than 50% of the time. I actually find that intolerably boring.

Kuinox · 2025-12-01T16:48:08 1764607688

> They are not functionally equivalent, unless there are games I'm not familiar with where custom lobbies are published in a list for strangers to join.

Lots of the mosts played competitive games have that, or third party websites/discords that have links to custom lobbies.

hamdingers · 2025-12-01T19:52:36 1764618756

Being able to make friends off-platform and then play with them is obviously not what we're talking about.

I have to conclude you're unfamiliar with what multiplayer gaming was like when servers were the norm.

Kuinox · 2025-12-01T20:26:50 1764620810

> I have to conclude you're unfamiliar with what multiplayer gaming was like when servers were the norm.

Did you even played a single game competitively ? The fact you keep pushing for server browser tell me that no, you need communities on something else. You likely forgot the hassle that server browser were, and forgot that lots of games didn't had a server browser.

LFG communities were important and excluding this shows you were only playing casually, forgot all the problems servers browser had.

Do you even remember, that you could get malware by joining servers in a server list ?!?

hamdingers · 2025-12-01T20:54:29 1764622469

No, I used to play multiplayer games for fun, which was the norm until that option was removed and replaced with derisive "casual" and "competitive" modes.

99% of people who played CS1.x/tf/Q3A/bf1942/cod/etc booted up the game, found a server in the browser with low ping to play on, and if they liked it they favorited it. They came back the next day, and the next, and started to recognized other players. That is the server browser experience.

If you were in the tiny minority of players trying to be "competitive" back then, you're right I don't know what it looked like for you. Sounds like it sucked, honestly, and maybe competitive matchmaking solved some of those problems, but in the bargain we lost a lot of what made those games fun for "casuals" as you smugly call us.

Kuinox · 2025-12-01T21:43:12 1764625392

Again, it looks like you forgot all the issues there was, and only remember the good aspect of it. You didnt find the server on the first try.

Your favorite server was not always online

Some people were too good on the server you like and it wasn't fun.

Countless other problems, and you needed to sink in a lot of time to get the few quality time you wrote about.

simoncion · 2025-12-01T21:55:36 1764626136

> ...you needed to sink in a lot of time to get the few quality time you wrote about.

Sounds like you've got a skill issue. That doesn't match my experience, like, at all.

But a really, really easy shortcut was to find servers that indicated that they were furry-friendly. This all but guaranteed that

1) The folks on there would be fairly even-keeled and reasonable, and folks who weren't would be rapidly banned forever.

2) The folks on there would generally be good at the game. [0]

3) If you're lucky -and the game is one that permits custom "sprays" (as HL1 and Source engine did)- you might get to see some high-quality-but-thumbnail-sized furry porn.

[0] Seriously, at least back when both server browsers and user-hosted game servers were commonplace, I found a 1:1 correlation between "Are they a furry?" and "Are they particularly good at the game?". It was wild.

josefx · 2025-12-01T18:29:38 1764613778

> The problem with player hosted servers is that it was very hard to get a fair and balanced competitive match

Playing against overwhelming odds has its own kind of charm. I once spend days just sabotaging the top players on some gun game servers, only wining myself once or twice. Games against friends with various fun handicaps and flat out abuse of any knowledge you could gain from playing against the same people repeatedly - what good is a hidding spot when everyone knows you will be there 50% of the time.

"Fair and balanced" games against completely random people are just missing something for me.

bee_rider · 2025-12-01T18:37:18 1764614238

This is something matchmaking games totally miss which keeps them from being truly competitive in the way sports or old games were: a competitive community. You need other players with known identities to compare yourself against on a consistent basis.

Of course, classic competitive institutions had problems as well (“he’s very competitive” is not necessarily a nice description of a person!), but they seemed more enjoyable that this matchmaking stuff.

babypuncher · 2025-12-01T19:43:52 1764618232

I hated wasting a whole half hour server hopping until I found one that didn't suck

jdashg · 2025-12-01T20:21:49 1764620509

I did indeed play in the era LanceH is talking about, and I agree with them! We had many thriving communities with no serious cheating problems because of community moderation.

Yes, there were poorly moderated servers, but you could simply leave and try a different community until you found one that clicked for you. When you require equal moderation everywhere, you throw the baby out with the bath water.

Kuinox · 2025-12-01T23:16:53 1764631013

How much time did you wasted server hopping ?

thenthenthen · 2025-12-02T11:43:19 1764675799

Initially, until you found the right community run ones? I don't see the issue. Today is worse, especially when there is no server browser but just a blackbox that drops you in a random match.

Kuinox · 2025-11-23T17:32:56 1763919176

I have no ambiant lighting. I have my window opened or the CO2 level gets bad. If I get lights, all the fucking insect existing in the forest will come in my room. Or I can get a fresh breeze while being on my PC in the evening.

With Dark Mode.

Kuinox · 2025-11-14T19:47:17 1763149637

The inference doesn't return a single token, but the probably for all tokens. You just select the token that is allowed according to the compiler.

mkagenius · 2025-11-14T20:15:49 1763151349

Hmm, wouldn't it sacrifice a better answer in some cases (not sure how many though)?

I'll be surprised if they hadn't specifically trained for structured "correct" output for this, in addition to picking next token following the structure.

tdfirth · 2025-11-14T20:40:19 1763152819

In my experience (I've put hundreds of billions of tokens through structured outputs over the last 18 months), I think the answer is yes, but only in edge cases.

It generally happens when the grammar is highly constrained, for example if a boolean is expected next.

If the model assigns a low probability to both true and false coming next, then the sampling strategy will pick whichever one happens to score highest. Most tokens have very similar probabilities close to 0 most of the time, and if you're picking between two of these then the result will often feel random.

It's always the result of a bad prompt though, if you improve the prompt so that the model understands the task better, then there will then be a clear difference in the scores the tokens get, and so it seems less random.

miki123211 · 2025-11-15T00:13:07 1763165587

It's not just the prompt that matters, it's also field order (and a bunch of other things).

Imagine you're asking your model to give you a list of tasks mentioned in a meeting, along with a boolean indicating whether the task is done. If you put the boolean first, the model must decide both what the task is and whether it is done at the same time. If you put the task description first, the model can separate that work into two distinct steps.

There are more tricks like this. It's really worth thinking about which calculations you delegate to the model and which you do in code, and how you integrate the two.

mmoskal · 2025-11-14T22:26:18 1763159178

Grammars work best when aligned with prompt. That is, if your prompt gives you the right format of answer 80% of the time, the grammar will take you to a 100%. If it gives you the right answer 1% of the time, the grammar will give you syntactically correct garbage.

mirekrusin · 2025-11-14T20:59:26 1763153966

Sampling is already constrained with temperature, top_k, top_p, top_a, typical_p, min_p, entropy_penalty, smoothing etc. – filtering tokens to valid ones according to grammar is just yet another alternative. It does make sense and can be used for producing programming language output as well – what's the point in generating/bothering with up front know, invalid output? Better to filter it out and allow valid completions only.

Kuinox · 2025-11-14T20:37:25 1763152645

The "better answer" wouldnt had respected the schema in this case.

Kuinox · 2025-11-13T20:48:48 1763066928

No, that's a rumor lots of people have been taking at face value. If you do the math, inferrence is very lucrative. Here someone deployed a big model, the costs are $0.20/1M token https://lmsys.org/blog/2025-05-05-large-scale-ep/

Kuinox · 2025-10-20T15:47:53 1760975273

I could find an ARR for Cursor of $500M. Why do they they in this article that Cursor is loosing with this spending number ?

dcre · 2025-10-20T16:10:13 1760976613

The article Zitron links says Cursor has single-digit millions of cash burn with about $1B in the bank (as of August). Assuming that is true, they are losing money but have a long runway.

https://www.newcomer.co/p/cursors-popularity-has-come-at-a

omnicognate · 2025-10-20T16:33:53 1760978033

Single-digit cash burn on AWS, which the article says is only a small part of its compute, with the majority coming from Anthropic.

dcre · 2025-10-20T16:35:22 1760978122

That article says "Anysphere runs pretty lean with around 150 employees and has a single digit monthly cash burn, a source tells me." That would be total cash burn, i.e., net losses. If their AWS bill is bigger than that it's because they are making up for part of it with revenue.

omnicognate · 2025-10-20T16:38:13 1760978293

Ah, gotcha. I misunderstood your comment.

LetsGetTechnicl · 2025-10-20T16:05:21 1760976321

Ed's mentioned ARR in previous articles and it's not a "generally accepted accounting principle". They cherry pick the highest monthly revenue number and multiply that by 12, but that's not their actual annual revenue.

dcre · 2025-10-20T16:08:00 1760976480

"Cherry pick the highest" is misleading. If your revenue is growing 10% a month for a year straight and is not seasonal, picking any other than the most recent month to annualize would make no sense.

mossTechnician · 2025-10-20T16:26:29 1760977589

If a company's revenue in January is $100 and it grows by 10% every month, the December revenue is $285. The year's revenue would be about $2,138, but ARR in December would be $3,423. That's 1.6x the actual revenue.

ARR could be a useful tool to help predict future revenue, but why not simply report on actual revenue and suggest it might increase in the next year? I have found the most articles to be unclear to the reader about what ARR actually represents.

dcre · 2025-10-20T16:44:16 1760978656

Why is the calendar year the relevant unit? If you insist on years, then if you consider the year from June to June, $2,138 would be misleading small.

The point of ARR is to give an up to date measure on a rapidly changing number. If you only report projected calendar year revenue, then on January 1 you switch from reporting 2025 annual revenue to 2026 projected revenue, a huge and confusing jump. Why not just report ARR every month? It's basically just a way of reporting monthly revenue — take the number you get and divide it by 12.

I am really skeptical that people are being bamboozled by this in some significant way. Zitron does far more confusing things with numbers in the name of critique.

mossTechnician · 2025-10-20T16:48:54 1760978934

You're correct, ARRs can be both misleading and for any 12-month period (I just chose a year to simplify), but the problem is AI companies tend to only release their latest ARR, and only selectively, which I believe is misleading in the opposite direction.

LetsGetTechnicl · 2025-10-20T16:46:51 1760978811

Because that's a part of the generally accepted accounting principles: https://www.rightrev.com/gaap-revenue-vs-arr/

Nobody considers a year from June to June because that would be misleading.

dcre · 2025-10-20T16:48:50 1760978930

That is an article explaining why ARR is useful and important despite not being the same thing as GAAP revenue.

OptionOfT · 2025-10-20T16:54:24 1760979264

How can you talk about ARR if you only look at 1 year?

datadrivenangel · 2025-10-20T16:58:02 1760979482

It's useful for financial planning. Less useful for overall financial reporting given how volatile it is.

OptionOfT · 2025-10-20T18:37:29 1760985449

I should've been more clear. How can you talk about ARR if you only have 1 year?

How do you know it's recurring? What data do you have (historic) that makes you believe the revenue will happen again?

Is this based on signed contracts etc so you have some guarantees?

dcre · 2025-10-20T20:19:24 1760991564

The "annual" just means that the unit of time is a year. It doesn't mean that it is recurring annually. You can call it Annualized Monthly Recurring Revenue if it makes you feel better.

LetsGetTechnicl · 2025-10-20T16:34:27 1760978067

Well people like Sam Altman have not been entirely honest and there's a reason they're not sharing their actual revenue numbers. If they could show they were growing 10% every month they would.

dcre · 2025-10-20T16:49:38 1760978978

They are sharing their actual revenue numbers. That's what ARR is. Take the number and divide it by 12 and that's monthly revenue.

LetsGetTechnicl · 2025-10-20T16:51:51 1760979111

It's literally not ARR is the highest monthly revenue times 12. Dividing it by 12 doesn't get you the actual, on the books monthly revenue numbers.

infecto · 2025-10-20T16:13:38 1760976818

Eh, when you have a company that’s growing, picking the highest and annualizing it is sensible. If we had a mature company with highly seasonal revenue it would be dishonest.

LetsGetTechnicl · 2025-10-20T16:33:05 1760977985

I mean I think there are instances where OpenAI's revenue is seasonal. Lots of students using it during the school year and cancelling it during summer.

dcre · 2025-10-20T16:50:38 1760979038

The graph that was widely shared to make this claim was from OpenRouter and did not represent ChatGPT usage in any way.

infecto · 2025-10-20T16:41:38 1760978498

I think you missed the forest for the trees. I am sure the student population has some dropoff during summer months but the point is that for businesses that a growing month over month which most of these have since creation, you take the highest number (latest) and annualize it.

I am also willing to bet that the student dropoff is not pronounced. I am more thinking of a business that sells beach umbrellas, they make a lot of sales in the summer months and then next to nothing in the winter months. That would be dishonest.

LetsGetTechnicl · 2025-10-20T16:47:26 1760978846

Then why aren't AI companies reporting their actual monthly revenues?

dcre · 2025-10-20T16:50:55 1760979055

They are. That is what ARR is.

Kuinox · 2025-10-16T16:56:52 1760633812

Over all, peoples prefer writing with lambda, somes use the SQL-like, but there is a major preference towards lambdas.

Kuinox · 2025-09-29T17:40:12 1759167612

Just compare it with a human on a bicycle, you would see that LLMs are weirdly good at drawing pelicans in SVG but not humans.

AlecSchueler · 2025-09-29T17:58:52 1759168732

I thought a human would be a considerable step up in complexity but I asked it first for a pelican[0] and then for a rat [1] to get out of the bird world and it did a great job on both.

But just fot thrills I also asked for a "punk rocker"[2] and the result--while not perfect--is leaps and bounds above anything from the last generation.

0 -- ok, here's the first hurdle! It's giving me "something went wrong" when I try to get a share link on any of my artifacts. So for now it'll have to be a "trust me bro" and I'll try to edit this comment soon.

Kuinox · 2025-09-29T17:16:16 1759166176

I never understood the point of the pellican on a bicycle exercise: LLMs coding agent doesnt have any way to see the output. It means the only thing this test is testing, is the ability of the LLMs to memorise.

Edit: just to show my point, a regular human on a bicycle is way worse with the same model: https://i.imgur.com/flxSJI9.png

_joel · 2025-09-29T17:18:44 1759166324

Because it excercises thinking about a pelican riding a bike (not common) and then describing that using SVG. It's quite nice imho and seems to scale with the power of the LLM model. Sure Simon has some actual reasons though.

Kuinox · 2025-09-29T17:23:50 1759166630

> Because it excercises thinking about a pelican riding a bike (not common)

It is extremely common, since it's used on every single LLM to bench it.

And there is nothing logic, LLMs are never trained for graphics tasks, they dont see the output of a code.

_joel · 2025-09-29T17:48:46 1759168126

I mean the real world examples of a pelican riding a bike is not common. It's common in benchmarking LLM's but that's not what I meant.

imiric · 2025-09-29T17:23:02 1759166582

The only thing it exercises is the ability of the model to recall its pelican-on-bicycle and other SVG training data.

furyofantares · 2025-09-29T17:41:19 1759167679

It's more for fun than as a benchmark.

Kuinox · 2025-09-29T17:43:44 1759167824

It also measure something llms are good probably due to cheating.

furyofantares · 2025-09-29T19:06:31 1759172791

I wouldn't say any LLMs are good at it. But it doesn't really matter, it's not a serious thing. It's the equivalent of "hello world" - or whatever your personal "hello world" is - whenever you get your hands on a new language.

mhh__ · 2025-09-29T17:18:26 1759166306

Memorise what exactly?

Kuinox · 2025-09-29T17:25:29 1759166729

Coordinate and shape of the element used to form a pellican. If you think about how LLMs ingest their data, they have no way to know how to form a pellican in SVG.

I bet their ability to form a pellican result purely because someone already did it before.

throwaway314155 · 2025-09-29T20:06:57 1759176417

> If you think about how LLMs ingest their data, they have no way to know how to form a pellican in SVG.

It's called generalization and yes, they do. I bet you could find plenty of examples of it working on something that truly isn't "present in the training data".

It's funny, you're so convinced that it's not possible without direct memorization but forgot to account for emergent behaviors (which are frankly all over the place in LLM's - where you been)?

At any rate, the pelican thing from simonw is clearly just for fun at this point.