Doesn't the Unix Shell implementation break the "threading" constraint, in that ...

slaymaker1907 · on July 24, 2022

If I were doing the benchmark, I would just run it on a VM with a single physical core. There are good reasons for comparing single threaded performance since it may not be necessary or desirable to scale that way. For example, suppose it's a wc API or something; limiting each request to a single thread can be better since you get fairer scheduling and less data sharing which is almost always better for multiprocessing. Even shared immutable data can cause problems since the CPU may not realize the data is actually immutable.

I agree GPUs would probably do well on this problem, though it depends on how expensive the memory copying to/from the GPU ends up being. However, if you are going for all our performance, it seems like you could use SIMD like memchr or something.

Karellen · on July 24, 2022

> If I were doing the benchmark, I would just run it on a VM with a single physical core.

I think that's definitely an environment worth benchmarking on - but I don't think that it should be the only environment to benchmark on.

Also, I don't think it's a good reason to limit implementations to a single thread, even if that is your benchmark environment. It can be worth seeing how well an implementation that's capable of taking advantage of multiple cores/CPUs, does when it's only given one core to work with.

It's probably worth optimising for and benchmarking different threading strategies too - does your implementation create one thread per "work unit" and let the scheduler work them all out, or does it create one a one thread per core (or maybe, one thread per core, plus one), thread pool, and assign work units to each each thread until they're done? And how do those strategies work if they're only given a single core?

The single core case is definitely worth testing, but it seems odd to limit implementations to a single thread because of it. If you think you can go faster with a threaded implementation, you should be able to try that out.

igouy · on July 25, 2022

> … worth seeing how well an implementation that's capable of taking advantage of multiple cores/CPUs, does when it's only given one core to work with.

Did something like that — programs written for multi-core forced onto one core, alongside programs not written for multi-core.

iirc That difference wasn't something anyone ever expressed interest in.

https://web.archive.org/web/20121231010227/http://benchmarks...