add "oe test" subcommand #217

Villemoes · 2017-01-30T23:28:07Z

While working on trying to allow async do_fetch and other python functions, I've realized I'm gonna need some low-level helpers (lockfiles, some signal handling), and that stuff is tricky enough that I want to be able to test those directly. Our makedirs helper also turned out to be racy, so the first test case is rather complex.

While at it, move the tiny existing self tests to 'oe test' so that they can get run without odd python invocations. Also, I use this opportunity to add a little performance improvement to do_fetch; being able to write a test for the new helper in just a few lines makes me much more confident that the new code really is equivalent to the old.

I also plan to add some sort of unit tests of our various task functions (for example, to allow refactoring do_split while being confident that the semantics don't change, and if they do, that it's deliberate and documented in the commit message why), but that is likely to be in the form of special-purpose recipes.

I've tried to add small unit tests here and there, especially when low-level OS stuff is involved. This is an attempt at creating a place to put all these, allowing them to be run without weird and ad hoc command lines. The first test case I'm adding is a bit complex, but it's the simplest I could come up with to provoke the race in our makedirs helper. That race may show its ugly face when we do stuff in parallel processes and be very hard to debug ("what do you mean EEXIST, the code obviously tests for that before calling os.makedirs!"). Before fixing any race one needs a test case that reliably reproduces it.

The stdlib utility function os.makedirs raises EEXIST if the leaf directory already exists. Our makedirs helper tries to avoid that by checking for that condition upfront. However, when doing tasks in parallel, two different processes may call makedirs concurrently; both may initially see that the directory does not exist, one will successfully call os.makedirs(), while the other will get an EEXIST exception. Debugging that would be no fun, since the code clearly checks whether the path exists... This reimplements the helper to avoid such races. The previous implementation would also silently return success in the case where 'path' exists and is a regular file, which would in all likelyhood just lead to ENOTDIR later when the caller tries to create 'path + "/foo"'. It's better to fail early, and this is also consistent with 'mkdir -p' failing if the target exists but is not a directory.

Maybe overkill, but it's nice to know what one can expect.

This gives it a better chance of actually getting run once in a while. Should we ever go for Py3 (3.2+, https://docs.python.org/3.2/library/subprocess.html), oelite.signal.restore_defaults becomes redundant but harmless.

We get the sha1 or md5 digest of files in a couple of places by doing m = hashlib.md5() with open("foo") as f: m.update(f.read()) h = m.hexdigest() But that .read() call can be rather expensive if the file in question is a 100M tarball (opencv, I'm looking at you); it temporarily bumps our memory usage quite a lot, and it's also terribly inefficient (lots of page faults and completely blowing the cpu cache and TLB first by populating the gigantic buffer, then again when reading through it). So create a generator which simply yields a given file in chunks. Then the above would be: m = hashlib.md5() for chunk in iter_chunks("foo"): m.update(chunk) h = m.hexdigest() but since we do that middle part a lot, create another helper that does that and returns the passed-in hasher (we sometimes feed more than just one file to the same hasher, so we don't finish up by calling .hexdigest). Then the above could be simply h = oelite.util.hash_file(hashlib.md5(), "foo").hexdigest()

This makes signature verification (hence do_fetch) somewhat cheaper for large ingredients tar balls (I've measured about 10x for the largest ones).

The last calls of the write_checksum and verify_checksum methods vanished with 6f3677d (class/fetch, lib/oelite: Improve fetch support). That commit also introduced the cache() method, but as far as I can tell, that has never had a caller; moreover, no fetcher has ever implemented it, so it's not really clear what its semantics should be.

As further prep for parallel do_fetch / do_unpack, don't open-code the old and racy version of oelite.util.makedirs.

Rasmus Villemoes and others added 11 commits January 30, 2017 23:30

oe test: test oelite.util.makedirs semantics

a881829

Maybe overkill, but it's nice to know what one can expect.

oelite/signal.py: move self test to 'oe test' command

0d509ab

This gives it a better chance of actually getting run once in a while. Should we ever go for Py3 (3.2+, https://docs.python.org/3.2/library/subprocess.html), oelite.signal.restore_defaults becomes redundant but harmless.

oelite/compat.py: move cloexec tests to 'oe test' command

1231699

oe test: add hash_file test cases

f9b9b33

oelite/fetch/fetch.py: use hash_file helper in mirror() method

4cb03b2

oelite/fetch/url.py: use hash_file helper

de6a3ae

This makes signature verification (hence do_fetch) somewhat cheaper for large ingredients tar balls (I've measured about 10x for the largest ones).

oelite/fetch/local.py: use hash_file helper

9d22174

esben added the in progress label Jan 30, 2017

oelite/fetch/url.py: use oelite.util.makedirs helper

0e7996b

As further prep for parallel do_fetch / do_unpack, don't open-code the old and racy version of oelite.util.makedirs.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add "oe test" subcommand #217

add "oe test" subcommand #217

Uh oh!

Villemoes commented Jan 30, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

add "oe test" subcommand #217

Are you sure you want to change the base?

add "oe test" subcommand #217

Uh oh!

Conversation

Villemoes commented Jan 30, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants