An important thing omitted in this post, which makes work-stealing less attracti...

CyberDildonics · on Oct 6, 2023

If a CPU is being cooled enough to not throttle, it is much more time and energy efficient to use all the cores you can rather than have another core run at a slightly higher frequency.

Higher frequencies have diminishing returns and exponential heat loss.

You might as well work-steal across machines at that point

Shared memory is extremely fast, it crushes using local loopback networking, let alone using actual networking.

jeffbee · on Oct 6, 2023

You can practice energy-aware scheduling at higher levels, too. If you have to send an RPC and you can choose between multiple peers, choose the one with the coldest CPU temperature.