Chroot builds are slow #179

edolstra · 2013-12-02T21:50:51Z

Chroot builds have a significant overhead. For instance, this expression:

with import <nixpkgs> {};
with lib;
let deps = map (n: runCommand "depM-${toString n}" {} "touch $out") (range 1 100);
in runCommand "foo" { x = deps; } "touch $out"

(i.e. a trivial build with 100 trivial dependencies) takes 4.7s to build on my laptop without chroot, but 39.6s with chroot.

The main overhead seems to be in setting up the chroot (i.e. all the bind mounts), but the various CLONE_* flags also have significant overhead.

Unfortunately, there is not much we can do about this since it's all in the kernel, but it does mean we can't enable chroot builds by default on NixOS.

This is on Linux 3.4.70.

The text was updated successfully, but these errors were encountered:

vcunat · 2013-12-02T21:54:09Z

Oh, not good. Are there some other standard sandboxing options? (except for LD_PRELOADing some libc hooks)

edolstra · 2013-12-02T22:02:33Z

I can imagine a cheaper chroot that just bind-mounts the entire Nix store. Maybe we could even put an ACL on /nix/store to deny "r" but not "x" permission to nixbld users. That way builds can only access store paths that they already know.

Also, maybe it's faster on newer kernels.

vcunat · 2013-12-02T22:18:51Z

Well, I don't think packages try finding something by listing /nix/store. In general, maybe we could deny "r" on it for everyone, but I fail to see any significant gain.

Maybe providing some cheap variant of chroot by default could be a good compromise (with possibility to switch to stronger guarantees).

peti · 2013-12-16T14:09:28Z

The benefits of chrooted builds are far more significant than the performance cost. Chroot builds should be totally enabled on NixOS by default!!!

alexanderkjeldaas · 2014-03-05T15:03:38Z

Are the bind mounts done in parallel?

edolstra · 2014-03-05T15:14:13Z

No.

domenkozar · 2014-10-28T14:07:47Z

@edolstra I'd still prefer purity/determinism over performance and enable chroots on Linux by default.

domenkozar · 2014-10-28T14:41:38Z

On my machine (SDD, running kernel 3.14):

real 0m27.129s
user 0m0.139s
sys 0m0.038s

vcunat · 2014-10-30T21:01:56Z

@iElectric: are you sure your measurement is correct? It shows mostly waiting and no real work. Or is that because the work is in fact done in another process?

wmertens · 2014-10-30T21:55:15Z

👍 for a mini chroot that has all of nix store. This could be reused, no? Same chroot for all builds?

domenkozar · 2014-10-30T22:03:44Z

@vcunat i'd say most of the time it's waiting for nix-daemon IO

aristidb · 2015-01-07T19:08:33Z

Computers have become faster in the past 2 years. We should re-evaluate whether the speed is really worth the significant impurities.

Note that the fact that NixOS default Hydra not using chroot leads to packages "randomly" failing to build locally for those who do use it.

So at least Hydra should enable it.

edolstra · 2015-01-07T19:16:00Z

Hydra does use it. It's the other way around, users like me might not have it enabled and think that a package builds properly when it doesn't. (Happened today with a PHP update, which turns out to do a download during its build.)

domenkozar · 2015-01-07T19:44:04Z

Yes, leaving our deterministic promise aside for a sake of some small overhead.

benley · 2015-01-07T22:18:04Z

When nix sets up chroots, is most of the time spent setting up bind mounts? Or does it do a lot of file copying too? If the latter, have you considered using something like Cowdancer (http://www.netfort.gr.jp/~dancer/software/cowdancer.html.en) to get copy-on-write bind mounts? It's low-overhead and fast to set up. Debian uses it in cowbuilder/pbuilder, which makes for an excellent ephemeral-chrooted build system.

vcunat · 2015-01-07T22:57:46Z

@benley: COW isn't needed, as all accessible in the chroot is read-only anyway. From the comments it seems noone has analyzed precisely what's the main cost, but bind mounts are suspected (and they probably were never meant to be used so massively).

copumpkin · 2015-01-16T23:49:36Z

Has anyone looked into proot for this purpose?

edolstra · 2015-01-19T11:49:46Z

"PRoot uses the CPU emulator QEMU user-mode to execute transparently guest programs." I doubt that's faster than bind mounts :-)

vcunat · 2015-01-19T16:00:24Z

Yeah, PRoot might be faster to setup, but it sounds significantly slower to run longer builds (which happen a lot). Various other preloading solutions might also slow down system calls, although probably not so much.

copumpkin · 2015-01-19T16:05:30Z

@edolstra oh sorry, my understanding was that it only used QEMU when the guest was of a different architecture

benley · 2015-01-19T23:45:53Z

I believe proot only uses qemu when it's running binaries from a non-native architecture. The proot website is fairly clear about that, unless I'm badly misinterpreting it: http://proot.me/

benley · 2015-01-19T23:47:03Z

It does still intercept system calls in userland, and it's going to have some unavoidable speed overhead.

alexanderkjeldaas · 2015-01-20T03:17:24Z

Isn't it documented to use ptrace? If so it will signal the controlling
process and wait for a command on every syscall that is intercepted.

On Tue, Jan 20, 2015 at 12:47 AM, Benjamin Staffin <notifications@github.com

wrote:

It does still intercept system calls in userland, and it's going to have
some unavoidable speed overhead.

—
Reply to this email directly or view it on GitHub
#179 (comment).

wmertens · 2015-01-22T11:13:03Z

My sophisticated web searches (i.e. "proot benchmark") didn't show up
anything. Anybody tried it yet?

On Tue Jan 20 2015 at 4:17:27 AM Alexander Kjeldaas <
notifications@github.com> wrote:

Isn't it documented to use ptrace? If so it will signal the controlling
process and wait for a command on every syscall that is intercepted.

On Tue, Jan 20, 2015 at 12:47 AM, Benjamin Staffin <
notifications@github.com

wrote:

It does still intercept system calls in userland, and it's going to have
some unavoidable speed overhead.

—
Reply to this email directly or view it on GitHub
#179 (comment).

—
Reply to this email directly or view it on GitHub
#179 (comment).

cedric-vincent · 2015-02-24T09:25:15Z

Hello all,

I confirm that PRoot uses QEMU to run non-native binaries only, and
that it is currently based on ptrace; which is known to cause a
significant slowdown. However, in order to decrease this slowdown as
much as possible, PRoot uses process_vm_{readv/writev} (available on
Linux 3.2+) and seccomp mode 2 (available on Linux 3.5+). For
information, here follow figures I've published when I've enabled
seccomp mode 2 in PRoot:

https://github.com/cedric-vincent/PRoot/blob/v5.1.0/doc/proot/changelog.txt#L510

My suggestion is to give PRoot a try if your kernel version is equal
or greater than 3.5, and if it's not too difficult to replace in your
scripts calls to "chroot" and to "mount --bind" with a call to
"proot". If PRoot is not fast enough, this will be likely fixed in
the future using kernel namepaces (available on Linux 3.8+).

Regards,
Cedric.

Ericson2314 · 2015-09-29T01:58:06Z

Seems like using Linux namespaces would dovetail with the pure Darwin stdenv work. All the better if they are faster than chroots.

copumpkin · 2015-09-29T02:04:29Z

They actually already use Linux namespaces. chroot is a bad name for them.

benley · 2015-09-29T03:01:12Z

Heh, in that case NixOS should call them Containers and pick up some buzzword publicity points. "Build all the things in containers!" containers containers containers containers containers. ;-)

zimbatm · 2018-06-15T12:01:07Z

It would be good to look at how Bazel does it as they are facing similar problems.

nh2 · 2018-06-15T14:01:52Z

Thanks for the explanations!

Maybe we should use a selective approach until Linux namespaces are very fast. While sandboxing ver every derivation is certainly desirable, it would already be a huge benefit if we could, for starters, sandbox "the average build" of nixpkgs libraries and applications. For example, I'd be very happy to pay a 24 ms overhead if in turn my 5 hour Chromium build is guaranteed to be pure. But right now it's full sandboxing or no sandboxing.

Another point: The nsenter benchmark at #179 (comment) measures 4 ms mean time. However, we are already dangerously close to Linux's process startup overhead that this number probably is not very meaningful. For example, just running the help text with time nsenter --help > /dev/null takes anything between 1 and 4 ms on my computer.

We should probably benchmark whatever nsenter does in a loop in C to get meaningful numbers for that.

ryantrinkle · 2018-06-15T15:36:05Z

FWIW, here are the results on my machine (the same one I used for the prior benchmarks), for nsenter --help >/dev/null:

» nix run -f channel:nixos-unstable bench -c bench "nsenter --help >/dev/null" -o unshare.html 
[4 copied (3.8 MiB), 11.5 MiB DL]
benchmarking nsenter --help >/dev/null
time                 3.108 ms   (3.088 ms .. 3.126 ms)
                     0.999 R²   (0.999 R² .. 1.000 R²)
mean                 3.222 ms   (3.188 ms .. 3.289 ms)
std dev              161.1 μs   (93.24 μs .. 275.8 μs)
variance introduced by outliers: 31% (moderately inflated)

And here it is for true:

benchmarking true
time                 2.470 ms   (2.452 ms .. 2.492 ms)
                     0.999 R²   (0.999 R² .. 1.000 R²)
mean                 2.456 ms   (2.445 ms .. 2.469 ms)
std dev              37.67 μs   (31.60 μs .. 46.29 μs)

So sandboxing is about an order of magnitude slower than running a minimal command. I definitely agree that this amount of time is not important for most use cases today.

zimbatm · 2018-06-16T09:40:23Z

And building a stdenv.mkDerivation is also going to execute bash which stat(2) for rc and profile files all over the place.

edolstra · 2018-08-02T15:20:19Z

@zimbatm Yes, but Nix does not require the use of stdenv.mkDerivation.

BTW on Linux 4.17 I get a 37% slowdown in the test mentioned in #179 (comment). That's a big improvement over the 742% slowdown in 2013...

copumpkin · 2018-08-02T15:25:14Z

Any idea of a good threshold for acceptable? I doubt it'll ever be zero cost, but purity-by-default is a big win IMO and I'd be willing to pay a slight cost on it. Especially since the benchmark you're citing mostly affects tiny derivations and not big builds. One even smallish build will completely eclipse a ton of small slowdowns on unit files and NixOS configuration files.

zimbatm · 2018-08-03T08:30:29Z

Having sandboxing turned on by default would be great. It would reduce the number of issues with nixpkgs submissions that don't compile and user reports. We'll be able to trim the PR and Issue templates. That being said, if nix is running inside of a docker container it won't work as docker containers don't support cgroups by default.

Back on the subject of sandboxing, is it possible to re-use sandboxes between runs? if sandboxes could be re-used then they could also be pooled where the pool size = maxJobs.

Ericson2314 · 2018-08-03T14:17:13Z

We must be sound now. We must compete with the likes of Bazel on granularity soon. That's how I see it.

edolstra · 2018-08-03T14:57:20Z

@copumpkin I think the 37% slowdown is okay-ish, though obviously not ideal.

@zimbatm No, I don't think sandboxes can be reused. The main overhead seems to be setting up the mount namespace, which is necessarily different for each build (since they have different input closures). Of course, you could bind-mount all of /nix/store, but that reduces security a bit (since even if it has -wx permission, builders would be able to access path contents if they know the store path).

7c6f434c · 2018-08-03T15:15:50Z

The main overhead seems to be setting up the mount namespace, which is necessarily different for each build (since they have different input closures).

Setting up the namespace or setting up the mounts? I.e., would reusing the namespace, and mounting/unmounting only the paths that differ (keeping the stdenv paths untouched) help in any appreciable way?

ryantrinkle · 2018-08-03T15:20:51Z

@7c6f434c Good question! I think we would need to benchmark bind mounting to see.

zimbatm · 2018-08-03T17:14:34Z

Another motivation to enforce the sandboxing is that we could get rid of the nixbld\d+ users. Each sandbox gets it's own pid namespace so they could all run with the same uid/gid. That would be great to limit the footprint nix has on non-nixos systems.

7c6f434c · 2018-08-03T17:34:12Z

We-ell, some namespace bugs (kernel-level) could be mitigated by the fact that the real UID outside of namespace is different, so for maximum isolation we might still want to have these users…

edolstra · 2018-08-03T18:15:18Z

In ad1c827 I implemented automatic UID allocation from a range. You would still like to ensure that the UID range doesn't clash with any existing accounts, though it's unlikely people have UIDs in the range 872415232+...

7c6f434c · 2018-08-04T05:25:45Z

Well, if the range is configurable it should be easy to move outside the ranges used by other tools; definitely simpler than listing eight build users in global passwd. Thanks.

domenkozar · 2018-11-02T12:12:57Z

@edolstra I wanted to implement sandboxing to be on for Docker after @garbas talk, but really that road leads back to Nix doing it by default for overall good experience. Given that we're at the okayish threshold now, and kernel 4.19 was released that will be next LTS, can we make sandboxing by default on? :)

copumpkin · 2018-11-02T17:04:34Z

Why docker? I missed the talk but intuitively it feels like a step backwards

domenkozar · 2018-11-02T18:23:26Z

@copumpkin just to enable sanboxing for https://github.com/NixOS/docker, since it helps sandbox networking during Nix builds.

zimbatm · 2018-11-02T23:56:07Z

Does the Nix sandboxing work inside of Docker now?

dtzWill · 2018-11-03T00:48:36Z

I believe it's worked for quite some time -- but requires permissions/capabilities that aren't added by default. Don't remember the appropriate flags, sorry, others might be able to say more.

…

On Fri, 02 Nov 2018 16:56:08 -0700, zimbatm ***@***.***> wrote: Does the Nix sandboxing work inside of Docker now? -- You are receiving this because you are subscribed to this thread. Reply to this email directly or view it on GitHub: #179 (comment) part: text/html

copumpkin · 2018-11-03T01:54:37Z

Oh sorry, I misread and thought you wanted to change our sandboxing mechanism to use docker, rather than get docker to work from inside one of our sandboxes 😄 sorry!

dtzWill · 2018-11-03T02:16:36Z

Based on the talk (IIRC) I think it's actually neither of those xD, but rather getting our sandbox (and the Nix "story") to work in Docker (which to some extent is how people expect to "get started" these days).

…

On Fri, 02 Nov 2018 18:54:38 -0700, Daniel Peebles ***@***.***> wrote: Oh sorry, I misread and thought you wanted to change our sandboxing mechanism to _use_ docker, rather than get docker to work from inside one of our sandboxes 😄 sorry! -- You are receiving this because you are subscribed to this thread. Reply to this email directly or view it on GitHub: #179 (comment) part: text/html

copumpkin · 2018-11-03T02:18:48Z

oh, I see, thanks!

domenkozar · 2018-11-07T16:31:19Z

🎉

Ericson2314 · 2019-04-10T19:10:02Z

#2759 A wildly different idea for maybe-even-faster sandboxing.

maisiliym · 2021-03-12T08:56:34Z

The solution to this is eBPF-based sandboxing, which would be essentially 'free' if the builder doesnt 'try anything funny'.

…ns/cachix/install-nix-action-14 chore(deps): bump cachix/install-nix-action from 13 to 14

vcunat mentioned this issue Dec 27, 2013

Explain why nix.useChroot is false by default NixOS/nixos#217

Closed

vcunat mentioned this issue Jan 27, 2014

TTL for failed builds #198

Closed

LnL7 mentioned this issue Jul 29, 2018

nixos/nix-daemon: default nix.useSandbox to true. NixOS/nixpkgs#44190

Merged

edolstra closed this as completed in 812e393 Nov 7, 2018

PedroRegisPOAR mentioned this issue Jun 29, 2021

Reproducible research GNU-ES/core#1

Open

zolodev pushed a commit to zolodev/nix that referenced this issue Jan 1, 2024

Merge pull request NixOS#179 from nix-dot-dev/dependabot/github_actio…

883ef85

…ns/cachix/install-nix-action-14 chore(deps): bump cachix/install-nix-action from 13 to 14

Chroot builds are slow #179

Chroot builds are slow #179

Comments

edolstra commented Dec 2, 2013

vcunat commented Dec 2, 2013

edolstra commented Dec 2, 2013

vcunat commented Dec 2, 2013

peti commented Dec 16, 2013

alexanderkjeldaas commented Mar 5, 2014

edolstra commented Mar 5, 2014

domenkozar commented Oct 28, 2014

domenkozar commented Oct 28, 2014

vcunat commented Oct 30, 2014

wmertens commented Oct 30, 2014

domenkozar commented Oct 30, 2014

aristidb commented Jan 7, 2015

edolstra commented Jan 7, 2015

domenkozar commented Jan 7, 2015

benley commented Jan 7, 2015

vcunat commented Jan 7, 2015

copumpkin commented Jan 16, 2015

edolstra commented Jan 19, 2015

vcunat commented Jan 19, 2015

copumpkin commented Jan 19, 2015

benley commented Jan 19, 2015

benley commented Jan 19, 2015

alexanderkjeldaas commented Jan 20, 2015

wmertens commented Jan 22, 2015

cedric-vincent commented Feb 24, 2015

Ericson2314 commented Sep 29, 2015

copumpkin commented Sep 29, 2015

benley commented Sep 29, 2015

zimbatm commented Jun 15, 2018

nh2 commented Jun 15, 2018 • edited Loading

ryantrinkle commented Jun 15, 2018

zimbatm commented Jun 16, 2018

edolstra commented Aug 2, 2018

copumpkin commented Aug 2, 2018

zimbatm commented Aug 3, 2018

Ericson2314 commented Aug 3, 2018

edolstra commented Aug 3, 2018

7c6f434c commented Aug 3, 2018 via email

ryantrinkle commented Aug 3, 2018

zimbatm commented Aug 3, 2018

7c6f434c commented Aug 3, 2018 via email

edolstra commented Aug 3, 2018

7c6f434c commented Aug 4, 2018

domenkozar commented Nov 2, 2018

copumpkin commented Nov 2, 2018

domenkozar commented Nov 2, 2018

zimbatm commented Nov 2, 2018

dtzWill commented Nov 3, 2018 via email

copumpkin commented Nov 3, 2018

dtzWill commented Nov 3, 2018 via email

copumpkin commented Nov 3, 2018

domenkozar commented Nov 7, 2018

Ericson2314 commented Apr 10, 2019

maisiliym commented Mar 12, 2021

nh2 commented Jun 15, 2018 •

edited

Loading