in #metabrainz

12:36 PM
Mineo

alastairp: if you remove images before removing containers, docker should not remove the images (and intermediate layers) that are still used by the containers, maybe that's good enough?
12:37 PM
alastairp

Mineo: oh, that's a neat idea
12:37 PM
though, I'm not sure if it'll work.
12:38 PM
imagine if I have 4 layers, the last of which churns a lot because of code files changing. we have lots of different services running tests, so ideally I want to remove images based on its name
12:38 PM
Mineo

it definitely won't work if you have images for more than one project on the docker host, yeah
12:40 PM
alastairp

so, what actually happens is we use the jenkins project name _and_ build number as part of the container/image name
12:41 PM
so I think I can say "all images whose name includes the project name, and are older than the image that I just built now"
12:41 PM
because we'll actually keep the labels for previous images
12:41 PM
yvanzo

maybe that would be more clear with examples (?)
12:42 PM
alastairp

yes, I'm just running some experiments now have clear what happens
12:48 PM
https://www.irccloud.com/pastebin/jz1NVkNF/
12:49 PM
https://www.irccloud.com/pastebin/aLTNMTtJ/
12:51 PM
see that we have 2 labeled images here from build1 and build2. when build2 runs, we can use it to remove all images that match the reference and are older than build2. When build3 runs, it will remove build2. This will untag the image, we will still need to run a periodic image prune to remove dangling intermediate images
12:51 PM
the filter=reference= could also be done with a label, we don't have labels in most of the python apps, but it seems like a good idea to copy what yvanzo does in some dockerfiles
12:52 PM
yvanzo

alastairp: IMHO, docker images should have labels that allow filtering them, see counter-example: docker inspect listenbrainzunittestjenkinsbuildjenkinslistenbrainzunit259_listenbrainz
12:52 PM
alastairp

yes, I agree that adding labels here will make it a lot easier to filter
12:55 PM
http://livegrep.metabrainz.org/search/livegrep?...
12:57 PM
reosarevok

yvanzo: what's the point of https://github.com/metabrainz/musicbrainz-serve... ? :)
12:59 PM
yvanzo

reosarevok: it is self-explanatory :)
12:59 PM
reosarevok

I mean, sure, but was it not needed before and now it is? I'm just confused about why :D
13:00 PM
alastairp

maybe he wanted to re-trigger testing?
13:00 PM
yvanzo

sorry, yes, I just triggered a build.
13:01 PM
alastairp

yvanzo: "@brainzbot retest this please" will do the same for jenkins, but I don't know about circleci
13:01 PM
yvanzo

that is right, but I keep forgetting the exact syntax of this command
13:01 PM
alastairp

me too. there are many commands that I only remember by looking in jenkins config. maybe we should document them somewhere
13:08 PM
Darkloke has quit
13:11 PM
yvanzo: do you have specific labels when building MBS for development?
13:11 PM
yvanzo

alastairp: does listenbrainz really need that many images for each build?
13:11 PM
alastairp

are you asking about jenkins?
13:11 PM
yvanzo

alastairp: we have only one base image to run all tests for musicbrainz-server.
13:12 PM
currently musicbrainz-tests:v-2020-11
13:12 PM
alastairp

there are a few things. one is that we currently run 3 separate jenkins jobs for unit tests, integration tests, and js test. I'm working on combining those
13:13 PM
zas

yvanzo: test-keydb-A and test-keydb-B are test containers I made to evaluate keydb (replacement for redis), please keep for now
13:14 PM
alastairp

within a test we may have multiple services (although I see in https://github.com/metabrainz/listenbrainz-serv... we appear to reuse the same image in both services, so that's fine)
13:15 PM
how does your musicbrainz-tests:v-2020-11 image work? Is that a single image built in november to run all subsequent tests?
13:15 PM
what is/isn't in that image?
13:15 PM
yvanzo

alastairp: for mbs tests, we have one job in jenkins and one in circleci, services are not ran using separate images/containers.
13:16 PM
alastairp: this image contains dependencies and services
13:23 PM
alastairp

so what happens if you change dependencies? Do you have to add a new test image?
13:23 PM
how do you include updated sourcecode into the image? Do you build a new one?
13:25 PM
yvanzo

Dependencies are just pre-installed, but they are updated by each build.
13:27 PM
alastairp

are the updated during a docker build stage, or in the container before tests are started?
13:27 PM
yvanzo

We need to build a new image either when dependencies cannot be updated: breaking changes in dependencies installation method, or breaking change in services dependencies.
13:30 PM
alastairp: in the container
13:31 PM
alastairp: sourcecode is copied to the container too
13:32 PM
alastairp

ah, I see now. so in https://github.com/metabrainz/musicbrainz-serve... you specify the base image to run, and circleci runs that image and then copies the code into the container, and then runs the `steps` ?
13:32 PM
yvanzo

yes, it's the same for jenkins
13:33 PM
alastairp

this is very different to the docker-compose-based system that we currently have. In my view it adds significant complexity for little value
13:34 PM
I think that it's still best to continue using our current system but ensure that we delete images after each build
13:35 PM
yvanzo

The musicbrainz-tests approach is more lightweight but not necessarily as flexible as using docker-compose.
13:36 PM
alastairp: at least that explains why we don't face the same issues regarding jenkins :)
13:36 PM
alastairp

yes, absolutely
13:36 PM
OK, I am going to add labels to ListenBrainz for testing
13:37 PM
yvanzo

+1
13:37 PM
alastairp

however, during development we shouldn't update these labels each time a user builds, because if the build date variable changes it will invalidate the entire cache
13:37 PM
does that make sense to you?
13:43 PM
yvanzo

do you mean creating a LABEL named BUILD_DATE?
13:43 PM
alastairp

yes
13:43 PM
yvanzo

It's not needed as this is already part of image's metadata.
13:43 PM
alastairp

ah. so perhaps https://github.com/metabrainz/docker-irccat/blo... should be updated?
13:44 PM
yvanzo

right
13:44 PM
alastairp

ok, thanks. how about VCS_REF?
13:45 PM
sumedh_p joined the channel
13:45 PM
yvanzo

hmm, not really
13:45 PM
sumedh has quit
13:46 PM
alastairp

the main point here is that local developers use the same Dockerfile that we use for building the production image
13:46 PM
yvanzo

BUILD_DATE is unneeded for irccat but it can be useful when tagging
13:46 PM
I think that was a lazy solution.
13:47 PM
Build date can be extracted from image as in https://github.com/metabrainz/docker-python/blo...
13:49 PM
VCS_REF seems to be more useful at least :)
13:49 PM
bitmap

reosarevok: yvanzo: orderingattribute ultimately wasn't intended to be searchable, but I think was left in on accident or in case it was used in the future
13:50 PM
alastairp

yvanzo: is the only way to view labels to use `docker inspect` from the host?
13:50 PM
can a process in a container see the label?
13:50 PM
bitmap

originally series were designed to support different ordering attributes but we decided to just have one after the schema change iirc
13:51 PM
(or maybe before but it was too late to change the sql)
13:51 PM
https://github.com/metabrainz/musicbrainz-serve...
13:52 PM
yvanzo

alastairp: not sure, I think host is the only way
13:52 PM
alastairp

yvanzo: for example we do this in LB push.sh: https://github.com/metabrainz/listenbrainz-serv...
13:52 PM
this allows us to report the git hash from the app itself
13:53 PM
yvanzo

bitmap: it always looked weird to me to have a field name 'number' that actually is free text.
13:53 PM
bitmap

see https://chatlogs.metabrainz.org/brainzbot/music... too
13:54 PM
well it was intended to have its meaning determined by the series type, I guess, so it needed a general name
13:55 PM
and usually it's a volume number, catalog number, or part number...
13:56 PM
yvanzo

alastairp: it would require to break out of the containement
13:56 PM
bitmap

I do think adding a searchable 'number' field makes sense, just not sure about reusing orderingattribute for that. afaict it wasn't intended to be added or at least not publicly documented
13:57 PM
alastairp

yvanzo: yes, I thought so. no problem then
13:57 PM
I have a working solution now, based on your and Mineo's suggestions. just preparing PR
13:57 PM
yvanzo

bitmap: do you mean a list of "numbers" for items of this series?
13:58 PM
"orderingnumbers" maybe?
13:58 PM
bitmap

yes
13:59 PM
yvanzo

Alright, would it be fine to drop the useless (and unused) orderingattribute for now?
14:00 PM
bitmap

that sounds fine to me
14:01 PM
yvanzo

Ok, thanks, I understand the situation better now. :)
14:02 PM
alastairp: but you could create an ENV variable similar to these labels.
14:02 PM
it would be available from containers.
14:03 PM
alastairp

yes, either an ENV or the file that we add. you're right
14:16 PM
BrainzGit

[listenbrainz-server] MonkeyDo opened pull request #1255 (master…jenkins-js-reporter): Use checkstyle eslint reporter format for Jenkins https://github.com/metabrainz/listenbrainz-serv...
14:17 PM
Mr_Monkey

alastairp: drumroll ^
14:20 PM
Yeah, the report looks really nice !
14:20 PM
alastairp

https://ci.metabrainz.org/job/listenbrainz-js/2...
14:20 PM
great!
14:20 PM
Mr_Monkey

(on top of reporting errors/warnings separately)
14:20 PM
alastairp

I'll slowly update other jobs to use warnings-ngs instead of regular old warnings too!
14:22 PM
so now we need to decide what difference there is between stable, unstable, and failed:
14:22 PM
https://usercontent.irccloud-cdn.com/file/6PtQO...
14:23 PM
Mr_Monkey

I think we'll be happy with 1 error = fail
14:23 PM
alastairp

so we can differentiate between failed (tests fail) and unstable (tests pass, but there are 2 warnings)
14:23 PM
right, so errors -> fail, warnings -> unstable ?
14:23 PM
or warnings -> pass ?
14:23 PM
Mr_Monkey

And unstable if there are +5 new warnings?
14:24 PM
How about under 5 new warnings, pass, over that mark as unstable. Any error = fail.
14:24 PM
alastairp

https://usercontent.irccloud-cdn.com/file/c4Roa...
14:24 PM
oh yeah, cool. you can choose based on total number, or new ones
14:25 PM
OK, I tried to configure that.
14:25 PM
re-running tests
14:28 PM
[ESlint] Creating SCM blamer to obtain author and commit information for affected files
14:28 PM
[ESlint] -> No blamer installed yet. You need to install the 'git-forensics' plugin to enable blaming for Git.
14:28 PM
that looks scary
14:29 PM
Mr_Monkey

Huh
14:29 PM
alastairp

Mr_Monkey: cool! the test has failed, because there are errors
14:29 PM
so I guess it can say "error introduced by alastairp"
14:29 PM
Mr_Monkey

Great
14:29 PM
alastairp

can you fix those 10 errors?
14:29 PM
Mr_Monkey

uh, "introduced by alastairp"?
14:29 PM
Yeah, that's the next step. I'll do it in the same PR I suppose
14:30 PM
alastairp

yeah, because then we should see the state of that check change from failed to passed
14:30 PM
-> class for the afternoon. have a good day
14:32 PM
Mr_Monkey

Thanks for setting that up alastairp ! g'luck !
14:34 PM
zas

bitmap, yvanzo: can you have a look at trille, something is creating load spikes, badly impacting mb containers and rabbitmq performance, unsure what to do. I see critiquebrainz runs there too.
14:34 PM
this is why we often get response times alerts for trille
14:36 PM
basically users served by trille got very slow responses if they are unlucky enough to hit a load spike
14:36 PM
d4rkie joined the channel
14:37 PM
there's also critiquebrainz there
14:37 PM
yvanzo

and mbspotify
14:39 PM
D4RK-PH0_ has quit
14:43 PM
BrainzGit

[listenbrainz-server] alastair opened pull request #1256 (master…jenkins-image-cleanup): Jenkins image cleanup https://github.com/metabrainz/listenbrainz-serv...
15:12 PM
yvanzo

alastairp: critiquebrainz-prod logs are probably too large: 100MB in just 24h
15:12 PM
alastairp

ok thank. possibly related to transient errors during yesterday's outage. I can put this on my list to check tomorrow
15:13 PM
sorry, no time this afternoon
15:13 PM
Mr_Monkey

alastairp: Now without errors but with warnings, the build correctly end in an unstable state. However on the PR on GH that seems to be interpreted as a failed check.Not sure if there's a config change needed on Jenkins to send a non-failure status for unstable builds;
15:13 PM
Alternatively we could just want to pass the builds in case of only ESLint warnings.
15:13 PM
(No rush of course)
15:13 PM
sumedh_p has quit
15:13 PM
alastairp

Mr_Monkey: interesting. I don't know if github has the concept of 3 states
15:13 PM
Mr_Monkey

I don't recall ever seeing that (other than "pending" as a transient state)
15:16 PM
alastairp

it looks like there is an option in the github runner to tell it to treat unstable as success:
15:16 PM
https://usercontent.irccloud-cdn.com/file/SU0uO...
15:16 PM
Mr_Monkey

Oooh, juicy :)
15:16 PM
Let's do that then.
15:17 PM
please-and-thank-you