They're using it for data transformation.They're long time dbt users, but are switching to SQLMesh because it's extremely efficient, provides a better development experience, and can help them become warehouse agnostic.
> In short, the main reasons why MPIRE is faster are:
When fork is available we can make use of copy-on-write shared objects, which reduces the need to copy objects that need to be shared over child processes
Workers can hold state over multiple tasks. Therefore you can choose to load a big file or send resources over only once per worker
Automatic task chunking
Yea, I am a struggling to figure out what the secret sauce of this library and if that sauce is introducing foot guns down the line.
Multiprocessing std uses fork in linux distros already. I once ran a multiprocess code on Linux and Windows and there was a significant improvement in performance when running Linux.
There's a lot of hype with ruff, but I've been doing fine with black and autoflake. I have a pretty sizeable project and have never thought to myself it's problematic because it's slow.
Monorepos at scale only really works if you have the tooling and infra to support it. Otherwise, you're going to be miserable with slow builds, pushes, pulls, impossible merge conflicts.
You can read some case studies here https://tobikodata.com/harness.html or join Slack to meet with folks to learn more about their experiences.