• 0 Posts
  • 14 Comments
Joined 2 years ago
cake
Cake day: June 12th, 2023

help-circle







  • Because the kernel doesn’t like you spawning 100k threads.

    Why do you say this?

    Your RAM doesn’t, either

    Not if your stacks per thread are small.

    Even all the stacks aside, the kernel needs to record everything in data structures which now are bigger and need longer to traverse.

    These data structures must exist either in userland or the kernel. Moving them to the kernel won’t help anything. Also, many of these data structures scale at log(n). Splitting have the elements to userland and keeping the other half gives you two structures with log(n/2) so 2log(n/2) = log(n^2/4). Clearly that’s worse.

    Each thread is a process which could e.g. be sent a signal, requiring keeping stuff around that rust definitely doesn’t keep around (async functions get compiled to tight state machines).

    If signals were the reason async worked better, then the correct solution is to enable threads that opt-out of signals. Anything that slows down threads that isn’t present in an async design should be opt-out-able. The state-machines that async compiles to, do not appear inherently superior to multiple less stateful threads managed by a fast scheduler.

    Specifically with io_uring: You can fire off quite a number of requests, not incurring a context switch …

    As described here you would still need to do a switch to kernel mode and back for the syscalls. The extra work required from assuming processes are hostile to each other should be easy to avoid among threads known to have a common process as they are obviously not hostile to each other and share memory space anyway. The synchronization required to handle multiple tasks should be the same regardless if they are being run on the same thread by a user land scheduler or if they are running on multiple threads with an os scheduler.

    Anyhow, your mode of inquiry is fundamentally wrong in the first place: …

    I’m not interested in saying that async is the best because it appears to work well currently. That’s not the right way to decide the future of how to do things. That’s just a statement of how things are. I agree, if your only goal is get the fastest thing now with no critical thought, then it does appear that async is faster. I am unconvinced it must fundamentally be the case.





  • MantisWaffle@lemmy.worldtoProgrammer Humor@lemmy.mlSTOP DOING ASYNC
    link
    fedilink
    arrow-up
    1
    arrow-down
    3
    ·
    edit-2
    11 months ago

    The only way I have heard threads are expensive, in the context of handling many io requests, is stack usage. You can tell the os to give less memory (statically determined stack size) to the thread when it’s spawned, so this is not a fundamental issue to threads.

    Go ahead and spin up a web worker and transfer a bunch of data to it and tell us how long you had to wait.

    Time to transfer data to one thread is related to io speed. Why would this have anything to do with concurrency model?