so yes, it called start_registrator, which rm the running container and start a new one
alastairp
this is how I've always done it. is there a better way that doesn't try and restart all services?
zas
I just copy the run script (cp cage.sh cage_tmp.sh), remove unneeded lines to keep only the concerned service, and run it, then if all is ok, then copy changes to real scripts, remove tmp script, commti changes, push
but we could seriously improve this... ;)
alastairp
right
I guess we could also source services.sh in the shell and just run the start_ method too
[brainzutils-python] alastair merged pull request #45 (master…version-ranges): Use open-ended versions so that downstream projects can pin exact versions https://github.com/metabrainz/brainzutils-pytho...
do you want to test LB and CB again with the latest master (check that it installs, check that stuff like CB database lookups work), and confirm with me?
then we can merge the LB change, I'll schedule a CB merge party for Friday
_lucifer
CPU usage on cage spiked again
alastairp: yes that's for BU. sure, i'll check again with LB and then later for CB as well
zas: ^^
alastairp
let me turn jenkins workers down to 3
ruaok
and with CPU spikes come MB service degradations. boo.
zas: do you have a machine at hetzerner dedicated to "processing" tasks that are not impacting services with sensitive response times?
zas
ruaok: best choice atm would be paco
gateways-redis isn't used anymore (but I keep it for now in case keydb has issues)
ruaok looks
and there's pg-williams (<-- why???)
ruaok
lol.
yes, that used to be on.. williams. but the service was never renamed.
zas
services shouldn't be named after machines
ruaok
agreed.
zas
but nvm
ruaok
ok, looking at the weekly load graph, I see that paco would be pretty good.
there are two loads spikes that go to 2. and no time sensitive services.
alastairp: what is needed to move the service from cage to paco? just editing the nodes file and stopping/starting services or more than that?
ruaok: we need to copy the jenkins-data volume too: /var/lib/docker/volumes/jenkins-data/_data
ruaok
want me to do that?
alastairp
yes please, I don't really have any time left today to look at this
ruaok
ok.
what is the procedure for migrating a volume?
alastairp
rsync :)
ruaok
take containers down, create new volume, rsync?
alastairp
note that ci.metabrainz.org is magic based on the service existing, so you'll have to shut down on cage before starting up on paco, then it should just magically work
at least that's what happened when I migrated from williams
ruaok
k
zas
I wonder why a cpu load on cage affects all mbs services
zas had shared the link and it was open in a tab. i was closing tabs when it just spiked, alastairp :)
alastairp
but there were also LB js tests running. oddly, `ps` hung while printing info about js processes :/
I hope jest doesn't spawn a million threads for whatever reason
Mr_Monkey: when my time frees up next month, I agree that we should sit down and try and improve python style guides. This should include instructions to make vscode do the right thing when you press return, consolidation of the tools that we have locally, in jenkins and squaking, and removal of stupid warnings that we don't want