in #metabrainz

12:05 PM
ruaok

build fixed?
12:10 PM
Mr_Monkey

ruaok: Yes. I just pushed an updated and rebuilt image of PR 1390 for test.lb if you want to deploy it.
12:10 PM
I'm now looking at front-end tests that I borked during a merge.
12:21 PM
alastairp

_lucifer: yeah, look at every other dockerfile in metabrainz that uses a pre-defined ARG in a LABEL, and you'll see that yvanzo follows this exact patter
12:21 PM
its... weird. but apparently they way that it has to work
12:21 PM
oh, did _lucifer fix the babel thing??!?
12:22 PM
!m _lucifer (and Mr_Monkey, I guess)
12:22 PM
BrainzBot

You're doing good work, _lucifer (and Mr_Monkey, I guess)!
12:22 PM
Mr_Monkey

I fixed it in another way that a drawback, so not as good :p
12:23 PM
that added a drawback*
12:23 PM
alastairp

weiiird
12:24 PM
rebasing and testing 1424, then
12:25 PM
Mr_Monkey: oh, I ran into something testing on my other computer. new docker uses this thing called "buildkit", and it doesn't leave behind intermediate images like old "docker build" does, so this issue that we had about having old LABELs that don't make sense isn't an issue anymore
12:25 PM
because those intermediate layers appear to no longer exist
12:26 PM
Mr_Monkey

Huh
12:26 PM
But there's no ill effects to setting it to blank?
12:27 PM
alastairp

no. I've left the "setting it to blank" in place until I can work out how buildkit fits into the grand scheme of things
12:27 PM
It'll probably become a defult in future versions of docker, but until then let's keep the workaround in place
12:33 PM
BrainzGit

[listenbrainz-server] alastair merged pull request #1424 (master…add-git-hash-label-last): Don't add GIT_COMMIT_SHA build to the label at the beginning of the build https://github.com/metabrainz/listenbrainz-serv...
12:34 PM
alastairp

_lucifer: ^ fixed - can you please add the tricks for the duplicate ARG to the spark PR?
12:34 PM
there's now a conflict in https://github.com/metabrainz/listenbrainz-serv..., I'm fixing it
12:35 PM
ShivamAwasthi joined the channel
12:44 PM
_lucifer

alastairp: thanks! i'll add the label thing to spark PR as well
12:50 PM
BrainzGit

[listenbrainz-server] alastair merged pull request #1430 (master…gh-prod): LB-882: Build production image in Github actions https://github.com/metabrainz/listenbrainz-serv...
12:58 PM
ruaok

PR 1390 is built with 'test' 'test', Mr_Monkey ?
12:58 PM
Mr_Monkey

Correct
12:59 PM
ruaok

thx!
13:00 PM
ShivamAwasthi has quit
13:10 PM
alastairp: you about?
13:13 PM
_lucifer

ruaok: alastairp: can you please take a look at this script to copy emails from MB db to LB? https://github.com/metabrainz/listenbrainz-serv...
13:14 PM
alastairp

_lucifer: in what context will this be used? daily updates? one-off? is it associated with a PR?
13:14 PM
ruaok: hi
13:14 PM
_lucifer

alastairp: this will be run by the cron job to update users emails daily
13:15 PM
ruaok

looks good. but, I do wonder if this should run in a transaction.
13:15 PM
in case something biffs, you can try again without making a mess.
13:16 PM
alastairp

do we think it's OK to do it in bulk on all users each time? Or only the users whose email address has changed?
13:16 PM
ruaok

ok today, in a few years? prolly not.
13:16 PM
but if this script goes away once we have centralized meb accounts, then this ought to be fine.
13:17 PM
alastairp

good point
13:17 PM
_lucifer

yeah, this should go away at that time.
13:17 PM
ruaok

add a transaction, then I'd be happy in that case.
13:17 PM
alastairp

_lucifer: you don't need ; to terminate sql statements in ptyhon
13:17 PM
_lucifer

alastairp, yeah i know. i added to separate the quotes.
13:17 PM
alastairp

ah, right. maybe a space?
13:18 PM
or quote the string with ''
13:18 PM
_lucifer

sure, will do that.
13:18 PM
ruaok

I have a puzzle for you two, _lucifer & alastairp.
13:18 PM
https://www.irccloud.com/pastebin/uZWofhtM/
13:18 PM
alastairp

I've not used UPDATE .. SET FROM before
13:18 PM
ruaok

there are no keys in redis. but test.lb.org is clearly pulling items from the cache.
13:19 PM
I checked IP and port and they match.
13:19 PM
but NO keys?
13:19 PM
alastairp

test.lb? first thing to check, is it the same redis instance?
13:19 PM
_lucifer

maybe its treating * as the keyname
13:19 PM
https://redis.io/commands/GET
13:19 PM
yeah try it here, it returns nil
13:20 PM
alastairp

isn't there a KEYS command too to get the keys?
13:20 PM
ruaok

yes. test. I checked that it connects to the right version.
13:20 PM
keys works. but not get.
13:20 PM
ruaok wonders if it used to.
13:20 PM
redis has been running for weeks.
13:20 PM
ruaok shrugs
13:21 PM
_lucifer

it should be KEYS *
13:21 PM
ruaok

sorry, then. just being stupid. as usual.
13:22 PM
_lucifer

happens to me all the time :)
13:22 PM
Mr_Monkey

Yeah, how can you not perfectly remember the million commands from a hundred languages and syntaxes?
13:22 PM
ruaok

I can't even remember them from week to week.
13:23 PM
but strncpy(dest, src, num) will NEVER fall out of my brain.
13:23 PM
I'll be in the old folks home, not remembering shit... Rob: Arguments to strcnpy, go!
13:23 PM
des, src, num
13:24 PM
who am i?
13:24 PM
_lucifer

lol, i always get confused whether its dest, src or src, dest
13:24 PM
Mr_Monkey

"Is my name desrcnum"?
13:24 PM
ruaok

dest src. matches x86 assembler convention.
13:25 PM
shivam-kapila

Assemblers. Ah shit here we go again
13:25 PM
ruaok

WOW.
13:26 PM
ts 2.2.0 IS a lot faster.
13:26 PM
let me remove all the cache keys for the new setup (they are all stale now)
13:27 PM
_lucifer

alastairp: my ide is issuing a warning if i leave the org.label-schema.vcs-ref= empty. safe to ignore?
13:28 PM
alastairp

_lucifer: good question. the build works, so 🤷...
13:28 PM
maybe we can set it to "" ?
13:28 PM
_lucifer

yeah that quitens it
13:28 PM
alastairp

ruaok: at least you're not using strcpy
13:28 PM
ruaok

there lie daemons.
13:29 PM
alastairp

great to know that 2.2 is faster! possible that we can remove the redis keys?
13:29 PM
ruaok

with their ttys properly detached and stdin/stdout closed.
13:29 PM
keeping the keys will still be more future proof. but recovery from losing them might not be as dire.
13:29 PM
_lucifer

https://github.com/metabrainz/listenbrainz-serv...
13:30 PM
attempt to store cache got ratelimited
13:31 PM
alastairp

the post-run logging looks interesting
13:31 PM
_lucifer

alastairp, updated https://github.com/metabrainz/listenbrainz-serv... with the label, want to take a look again or should i merge
13:31 PM
alastairp

nable to reserve cache with key layer-docker-layer-caching-ListenBrainz Unit Tests-d261fe9be9433846e3ce7055d0a028c5a11ad8e664d1ce7e81b5e00730bb7eea, another job may be creating this cache.
13:32 PM
_lucifer: https://github.com/metabrainz/listenbrainz-serv...
13:32 PM
_lucifer

ah my bad
13:32 PM
alastairp

you need to add `ARG PYTHON_BASE_IMAGE_VERSION` after the FROM line but before LABEL
13:32 PM
_lucifer

one sec
13:33 PM
alastairp

maybe we should also add org.label-schema.vcs-ref at the end of the build as well? to be consistent with the other LB images
13:34 PM
_lucifer

i add it here https://github.com/metabrainz/listenbrainz-serv...
13:35 PM
its added only to the prod image only to avoid invalidating the entire test setup.
13:37 PM
alastairp

right, nice
13:37 PM
one thing - I've encountered an issue before in docker multistage builds where in a file like this, if you ask it to build test then it'll also do the items in prod
13:38 PM
I don't know if this is something that still happens. did you see anything like it?
13:38 PM
(sorry, I missed the fact that this was a multi-stage file)
13:38 PM
_lucifer

yeah, i noticed that. but its only one step so i let it slide.
13:39 PM
alastairp

run me through - why does the prod version not need java, but the test one does?
13:39 PM
_lucifer

the prod version has java, spark and hadoop installed directly on the node
13:40 PM
we only use docker to manage python dependencies
13:40 PM
but i think i might be able to get that done without the docket image soon, at that point entire prod setup will be dockerless
13:41 PM
alastairp

and you build a docker image with the dependencies and then copy them out of the docker image into the spark environment?
13:42 PM
yeah, I think it makes sense to either run everything in docker, or nothing. but we should also think about consistency - if local dev uses docker and production doesn't, is there a risk of things working in one and not the other?
13:42 PM
_lucifer

no the spark installation is mounted to the docker image so it runs inside the container
13:42 PM
yeah, the current setup is a bit of a hack.
13:43 PM
alastairp

I see. what was the reason for that?
13:45 PM
_lucifer

https://github.com/metabrainz/listenbrainz-serv...
13:45 PM
the prod is setup as hadoop yarn cluster. every node has hadoop, yarn and spark configured.
13:46 PM
on the leader node (the driver) we run the spark-request-consumer inside a docker container.
13:46 PM
one of the reason for that is historically everything was run inside docker.
13:47 PM
every node needs python deps to run the code. the script i shared above does the task to distribute it to the worker nodes.
13:48 PM
L12-L16, install all the python deps we need in a virtual env and distribute it to the workers as a zip file
13:49 PM
however, it does distribute this file to the driver
13:49 PM
alastairp

right, and there's not an easy way of running each node as a docker container which includes the source and the dependencies?
13:49 PM
I don't know anything about hadoop, spark, or yarn, so I'm just making stuff up. If this is the way it has to work then that's fine
13:49 PM
_lucifer

that's intentional, we do not want to run inside docker due to security concerns
13:50 PM
zas or ruaok can explain those better than me. but at the end we want to restrict docker to a minimum on the cluster.
13:51 PM
the odd one out is the spark-request-consumer, i want to get docker out of the equation there as well
13:51 PM
there shouldn't be docker here at all.
13:51 PM
alastairp

how does the cluster work currently with new code / new dependencies?
13:51 PM
you build a new image from Dockerfile.spark and do something with that start script?
13:51 PM
what would the new way be to do it?
13:52 PM
_lucifer

exactly the same, just without the docker run line
13:52 PM
the driver also gets the deps as the zip file or from the venv
13:53 PM
alastairp

but how does the updated code get into the cluster?
13:53 PM
_lucifer

its also packaged as a zip, on the leader node we have a clone of the lb repo
13:54 PM
alastairp

right. so ssh leader; git pull; run some command ?
13:54 PM
_lucifer

yup, ssh, git pull, the script above, done.
13:55 PM
the zip is created here https://github.com/metabrainz/listenbrainz-serv...
13:57 PM
alastairp

so the only reason that there's a dokerfile is so that we can run `spark_manage.py request_consumer` ?
13:57 PM
and an alternative would be to just run that from within the pyspark_venv env?
13:57 PM
_lucifer

yes
13:59 PM
ruaok

https://www.irccloud.com/pastebin/QcEoUwRK/
13:59 PM
alastairp

yeah, especially because there's a (small) risk that the env in the docker image and in the packed venv could be different, I think that's a good idea
13:59 PM
ruaok

much much much better with ts 2.2
13:59 PM
alastairp

and then we'd still have a dockerfile-based setup for local development?
14:00 PM
_lucifer

yeah, right.
14:00 PM
alastairp

cool, sounds useful. thanks for you work!
14:00 PM
_lucifer

spark is agnostic to the underlying setup so the code does not needs to change.
14:03 PM
alastairp, if you are available, let's test the AB on bono? i tried to bring up the instance but was unable to do it.
14:04 PM
alastairp

_lucifer: right - sorry, this was a bit messy because I always thought it was going to be temporary
14:06 PM
there's a bono.sh file, this uses docker/docker-compose.bono.yml to build, and then docker-compose.yml to start nginx + uwsgi
14:06 PM
_lucifer

right, i saw that. it says acousticbrainz-server in the image but there's no such image methinks.