#metabrainz

/

0:36 AM
supersandro20009 joined the channel

2020-04-08 09955, 2020

0:40 AM
Chinmay3199 has quit

2020-04-08 09958, 2020

0:40 AM
supersandro2000 has quit

2020-04-08 09919, 2020

1:38 AM
shivam-kapila has quit

2020-04-08 09952, 2020

2:13 AM
naiveai has quit

2020-04-08 09930, 2020

5:09 AM
prabal joined the channel

2020-04-08 09952, 2020

6:13 AM
yvanzo

mo’’in’

2020-04-08 09950, 2020

6:30 AM
d4rkie joined the channel

2020-04-08 09950, 2020

6:32 AM
D4RK-PH0ENiX has quit

2020-04-08 09924, 2020

6:33 AM
D4RK-PH0ENiX joined the channel

2020-04-08 09920, 2020

6:34 AM
Darkloke joined the channel

2020-04-08 09937, 2020

6:35 AM
d4rkie has quit

2020-04-08 09935, 2020

7:14 AM
v6lur joined the channel

2020-04-08 09950, 2020

7:32 AM
alastairp

hello

2020-04-08 09909, 2020

7:33 AM
alastairp

zas: hi, if you have some time this morning I would like to ask you some deploy questions

2020-04-08 09919, 2020

7:42 AM
zas

alastairp: sure, ok for you in one hour?

2020-04-08 09935, 2020

7:45 AM
alastairp

zas: that's great

2020-04-08 09911, 2020

8:03 AM
alastairp

ruaok: I found this today: https://www.mirantis.com/blog/mirantis-will-conti…

2020-04-08 09938, 2020

8:03 AM
alastairp

looks like some time last year Docker-the-company sold their deployment devision to another company, which includes docker swarm

2020-04-08 09949, 2020

8:03 AM
alastairp

apparently they're now "focused on developer tools"

2020-04-08 09926, 2020

8:04 AM
alastairp

this gives me bad feelings about how much longer swarm is going to be around for. I give it 2 years

2020-04-08 09952, 2020

8:11 AM
iliekcomputers

Moin

2020-04-08 09940, 2020

8:28 AM
zas

alastairp: I'm there

2020-04-08 09942, 2020

8:29 AM
yvanzo

iirc we rather talked about kubernetes at latest summit, which might be more reliable regarding these concerns.

2020-04-08 09932, 2020

8:36 AM
zas

yvanzo: yes, we should rather think about how to migrate to kubernetes, the move to docker was just a first step, and our current setup mainly targetted at making the move possible. Now most devs are more familiar with docker, and our apps are, at least partially, ready for a step further.

2020-04-08 09936, 2020

8:36 AM
zas

https://boxboat.com/2019/12/10/migrate-docker-swa…

2020-04-08 09955, 2020

8:36 AM
zas

"Supporting Kubernetes applications is more challenging than Docker Swarm. Kubernetes provides a more flexible architecture, at the cost of increased complexity."

2020-04-08 09900, 2020

8:40 AM
ishaanshah[m]

iliekcomputers: I have updated the docs

2020-04-08 09910, 2020

8:43 AM
alastairp

zas: hi

2020-04-08 09931, 2020

8:43 AM
zas

https://people.canonical.com/~ubuntu-security/cve… <--- if you have haproxy on 18.04 (16.04 not affected)

2020-04-08 09950, 2020

8:43 AM
alastairp

ubuntu specific? I have it on some debians. will check today, thanks

2020-04-08 09922, 2020

8:44 AM
alastairp

I have some questions about volumes. from having a look at the docker-server-configs scripts it looks like almost all external data is stored on named volumes

2020-04-08 09953, 2020

8:44 AM
alastairp

what's the process for backing up this data? e.g. in AcousticBrainz we have data models created by people, which result in data files. Ideally we should back this up

2020-04-08 09959, 2020

8:45 AM
alastairp

from looking at other systems, it seems like you have for example `start_sshd_musicbrainz_json_dumps_incremental`, which starts an sshd with a volume mounted. Is this so that another process can get into it and copy content out?

2020-04-08 09950, 2020

8:48 AM
zas

It depends on your app, but we have https://github.com/metabrainz/borg-backup that can be configured to do regular backups of volume contents

2020-04-08 09957, 2020

8:49 AM
zas

for an example, check https://github.com/metabrainz/borg-backup/blob/ma…

2020-04-08 09937, 2020

8:50 AM
zas

basically, you set up client side on the node (if not already), and add your paths to the create script

2020-04-08 09941, 2020

8:50 AM
alastairp

don't have permission to that repo

2020-04-08 09947, 2020

8:50 AM
zas

ah, let me check

2020-04-08 09957, 2020

8:51 AM
zas

test now

2020-04-08 09952, 2020

8:52 AM
alastairp

I see it

2020-04-08 09912, 2020

8:53 AM
alastairp

and does that copy text straight out of the volume location on disk, or does it start a container with the volume mounted and copy it out of there?

2020-04-08 09935, 2020

8:53 AM
alastairp

`/var/lib/docker/volumes/jenkins-data` looks like straight from disk? I Don't know much about the local volume driver. is this safe?

2020-04-08 09950, 2020

8:53 AM
zas

yes, afaik

2020-04-08 09919, 2020

8:54 AM
alastairp

ok, great. I'll add to my todo list that we have to make backups for some files, and will ask you if I have any other questions

2020-04-08 09952, 2020

8:54 AM
zas

check if the node you want to backup from has borg setup already

2020-04-08 09948, 2020

8:55 AM
alastairp

boingo. how do I do that? see if there's a borg container?

2020-04-08 09934, 2020

8:56 AM
alastairp

ah, no node file in borg-backup repo. I guess that's a no

2020-04-08 09948, 2020

8:57 AM
zas

it depends, it can use "default" config

2020-04-08 09955, 2020

8:57 AM
zas

but systemctl list-timers mb-backup.timer

2020-04-08 09902, 2020

8:58 AM
zas

should show a timer

2020-04-08 09914, 2020

8:58 AM
zas

I don't think there's one on boingo yet

2020-04-08 09942, 2020

8:58 AM
alastairp

0 timers. what's the process here. Open a ticket for you to install it?

2020-04-08 09911, 2020

8:59 AM
zas

I'll do it right now

2020-04-08 09919, 2020

8:59 AM
alastairp

thanks!

2020-04-08 09925, 2020

8:59 AM
alastairp

one more question, about creating volumes

2020-04-08 09950, 2020

8:59 AM
alastairp

I had a look - it doesn't seem like there's a function in services.sh or similar for generic "create a volume". is that right?

2020-04-08 09900, 2020

9:00 AM
alastairp

every place that I see just calls `docker volume create` when it's needed

2020-04-08 09954, 2020

9:00 AM
alastairp

for AB, we have a volume to share data which is shared between 3 different services. This means that we need to create it once before bringing up services

2020-04-08 09936, 2020

9:01 AM
alastairp

https://github.com/metabrainz/docker-server-confi… here, iliekcomputers just runs it in boingo.sh before bringing up services, however it seems a bit wrong to me to put a command like this in a node script, as all other commands call generic start_ functions

2020-04-08 09947, 2020

9:02 AM
alastairp

the alternative is to just run this command anyway at the beginning of _all_ `start_` scripts that require it, because if it exists it'll just complete without performing any action. However, this also seems dangerous to me, because there's a risk of adding a new service and forgetting to add this command

2020-04-08 09901, 2020

9:05 AM
Gazooo has quit

2020-04-08 09949, 2020

9:06 AM
Gazooo joined the channel

2020-04-08 09957, 2020

9:06 AM
Chinmay3199 joined the channel

2020-04-08 09958, 2020

9:09 AM
zas

you can just ensure the volume exists, and create it in start_* commands if needed, those scripts are rather hacky, we don't have any dependency management or even priorities. Another reason to move to kubernetes or the like

2020-04-08 09916, 2020

9:12 AM
zas

https://github.com/metabrainz/docker-server-confi… should be in start_* functions

2020-04-08 09952, 2020

9:12 AM
alastairp

yes, I was thinking about this kind of dependency functionality. agreed that the next management tool should do it for us

2020-04-08 09936, 2020

9:13 AM
alastairp

OK, I'll add `volume create` to all start_* functions, and add a comment to remind us to add it to new functions if we make a new one

2020-04-08 09956, 2020

9:13 AM
alastairp

that'll be good enough for now

2020-04-08 09903, 2020

9:14 AM
alastairp

thanks for the confirmation

2020-04-08 09920, 2020

9:14 AM
alastairp

for backups - I should make a node in the borg repo for boingo, pointing to the volumes to back up?

2020-04-08 09910, 2020

9:15 AM
zas

I did already, but this stuff isn't great yet, deployement isn't well documented

2020-04-08 09948, 2020

9:15 AM
zas

just add directories to https://github.com/metabrainz/borg-backup/blob/ma…

2020-04-08 09955, 2020

9:15 AM
alastairp

thanks, will do

2020-04-08 09937, 2020

9:16 AM
zas

then tell me when done, so I can test it runs properly

2020-04-08 09907, 2020

9:17 AM
zas

by default backups happen once a day

2020-04-08 09922, 2020

9:17 AM
zas

and target is the machine with RAID1 drives at the office

2020-04-08 09945, 2020

9:17 AM
zas

everything is encrypted, compressed, and underlying protocol is rsync

2020-04-08 09941, 2020

9:20 AM
shivam-kapila joined the channel

2020-04-08 09924, 2020

9:33 AM
alastairp

Today I learned that there are 4 trimesters in a year. the tri- defines the number of months, not the number of divisions, and so there a 4 3-month trimesters, instead of 3 of 4 months. similarly, in semester, the se- is from latin for 6, I always related it with the number 2, because I counted it as 2 divisions of the year

2020-04-08 09945, 2020

9:35 AM
yvanzo

We are so proud of you! Might it be because uni was open only 3 trimesters in a year? ;)

2020-04-08 09955, 2020

9:35 AM
alastairp

yeah, exactly!

2020-04-08 09947, 2020

9:37 AM
alastairp

I guess semestre in French ties a lot more to 6

2020-04-08 09934, 2020

9:39 AM
yvanzo

Not really, it is quite the same, and I've always been confused about trimesters until filling tax declaration.

2020-04-08 09908, 2020

9:51 AM
ruaok

moooin!

2020-04-08 09919, 2020

9:51 AM
ruaok

> trimestral tax declarations. I was confused because there are 4 of them in a year!

2020-04-08 09923, 2020

9:51 AM
ruaok

yep, I've done that. :)

2020-04-08 09942, 2020

9:51 AM
ruaok

iliekcomputers: thanks for moving the branch along -- it was really good to go offline for the evening...

2020-04-08 09902, 2020

9:52 AM
iliekcomputers

happy to help!

2020-04-08 09956, 2020

9:54 AM
iliekcomputers

i think shivam-kapila has all the tests fixed, although the travis build is still borked

2020-04-08 09929, 2020

9:55 AM
ruaok

I need to tend to tend to business stuff, then I'll finish the rest of the surgically removing influx... which should fix the rest of the tests.

2020-04-08 09947, 2020

9:55 AM
ruaok

my brain didn't pick a good stopping point to melt down yesterday.

2020-04-08 09956, 2020

10:00 AM
iliekcomputers

ruaok: we will have to run both influx and timescale simultaneuosly for some time tho, right?

2020-04-08 09934, 2020

10:01 AM
ruaok

I hope for that to be measured in hours, not days.

2020-04-08 09958, 2020

10:01 AM
ruaok

testing will happen on the timescale instance on gaga.

2020-04-08 09949, 2020

10:02 AM
ruaok

once we're happy with the timescale code, then clean the incoming queue for timescale and stop the timescale_writer and let listens pile up.

2020-04-08 09913, 2020

10:03 AM
ruaok

a few minutes after that, we will trigger an LB full dump. I'll take the full dump and run my import/cleanup scripts.

2020-04-08 09945, 2020

10:03 AM
ruaok

we'll import the data completely and then start the timescale_writer. all duplicate listens will be ignored and the new listens will be inserted.

2020-04-08 09903, 2020

10:04 AM
ruaok

and then we should be consistent between influx and timescale.

2020-04-08 09913, 2020

10:04 AM
ruaok

then we can decide when to cut over to timescale in production.

2020-04-08 09921, 2020

10:04 AM
ruaok

that's the plan I've hashed out.

2020-04-08 09911, 2020

10:05 AM
iliekcomputers

that makes sense to me.

2020-04-08 09903, 2020

10:06 AM
shivam-kapila

Hi :)

2020-04-08 09935, 2020

10:06 AM
ruaok

hi shivam-kapila

2020-04-08 09946, 2020

10:06 AM
Mr_Monkey

alastairp: When I was learning Latin at school, I didn't necessarily believe them when they said it would be useful. I've since come to agree with the teachers !

2020-04-08 09949, 2020

10:06 AM
ruaok

iliekcomputers: I'm glad. timescale and its rock solid dups handling makes it easy.

2020-04-08 09912, 2020

10:07 AM
ruaok

iliekcomputers: I'm also going to remove dups and fuzzy last.fm dupes in the re-import process.

2020-04-08 09939, 2020

10:07 AM
iliekcomputers

yeah, that sounds like a good idea

2020-04-08 09941, 2020

10:07 AM
ruaok

e.g. two listens that are identical in a 2 second window will be considered dupes

2020-04-08 09952, 2020

10:07 AM
iliekcomputers

i wonder if there's more things in the data that we should fix while we're at it

2020-04-08 09956, 2020

10:07 AM
ruaok

identical save for the timestamp.

2020-04-08 09904, 2020

10:08 AM
iliekcomputers

i'm pretty sure there are, i'll look over it once

2020-04-08 09910, 2020

10:08 AM
ruaok

please do.

2020-04-08 09922, 2020

10:08 AM
ruaok

I know those two are easy goals....

2020-04-08 09945, 2020

10:08 AM
ruaok

remember that my process sorts all of the listens into one file. (shudder). and then that file is sorted in a massive sort operationg.

2020-04-08 09915, 2020

10:09 AM
ruaok

then it is imported in sorted order, so anything that we can run over a narrow window of listens, we can do in the import.

2020-04-08 09933, 2020

10:09 AM
iliekcomputers

this logic is in the import function in timescale_listenstore?

2020-04-08 09951, 2020

10:09 AM
ruaok

no. hang on.

2020-04-08 09923, 2020

10:10 AM
ruaok

https://github.com/mayhem/timescale-testing

2020-04-08 09946, 2020

10:10 AM
ruaok

https://github.com/mayhem/timescale-testing/blob/…

2020-04-08 09912, 2020

10:11 AM
ruaok

this is all proof-of-concept code. a lot of which has been ported to LB proper. this script will need to be moved to LB proper as well.

2020-04-08 09928, 2020

10:11 AM
ruaok

ah, null character cleanup is done as well.

2020-04-08 09936, 2020

10:11 AM
iliekcomputers

awesome

2020-04-08 09946, 2020

10:12 AM
iliekcomputers

seems like you have it covered, i'll be happy to review the branch when it's ready.

2020-04-08 09942, 2020

10:13 AM
ruaok

the main LB codebase PR will come first. we can deploy that as test.lb.org

2020-04-08 09955, 2020

10:13 AM
ruaok

then I'll start the PR for migration.

2020-04-08 09934, 2020

10:14 AM
ruaok

I may ping you this afternoon if I have questions about the listen stuff from last night.

2020-04-08 09936, 2020

10:15 AM
iliekcomputers

cool. i'd prefer not to merge the branches until we're ready-ish to deploy on prod, considering influx is removed and it would block other releases

2020-04-08 09938, 2020

10:15 AM
alastairp

Mr_Monkey: ! awesome :)

2020-04-08 09902, 2020

10:16 AM
alastairp

https://imisstheoffice.eu/

2020-04-08 09906, 2020

10:16 AM
iliekcomputers

i've been releasing small diffs over the week and it's definitely a much better process.

2020-04-08 09931, 2020

10:16 AM
ruaok

iliekcomputers: agreed, I'm keeping that in mind.

2020-04-08 09910, 2020

10:17 AM
shivam-kapila

ruaok: When you get time please once look into the change I made in Spark dumps to make it consistent with Influx. I replaced the timestamp to check for unwritten listens to be based on listened_at rather than created because created is NULL in some cases. I havent made PR and its on my fork. Please ping when you want the link.

2020-04-08 09946, 2020

10:17 AM
ruaok

ok, will do. this afternoon.

2020-04-08 09925, 2020

10:18 AM
shivam-kapila

The tests are done in my knowledge and I have moved to modify Timescale writer

2020-04-08 09932, 2020

10:19 AM
ruaok

shivam-kapila: 👍

2020-04-08 09926, 2020

10:22 AM
Mr_Monkey

:D

2020-04-08 09949, 2020

10:48 AM
BrainzGit

[bookbrainz-site] MonkeyDo merged pull request #404 (master…fix-entity-create-route): Fix entity /create route https://github.com/bookbrainz/bookbrainz-site/pul…

2020-04-08 09901, 2020

10:50 AM
D4RK-PH0ENiX has quit

2020-04-08 09949, 2020

10:51 AM
travis-ci joined the channel

2020-04-08 09949, 2020

10:51 AM
travis-ci

Project bookbrainz-site build #2798: passed in 2 min 21 sec: https://travis-ci.org/bookbrainz/bookbrainz-site/…

2020-04-08 09949, 2020

10:51 AM
travis-ci has left the channel

2020-04-08 09942, 2020

11:01 AM
D4RK-PH0ENiX joined the channel

2020-04-08 09956, 2020

11:13 AM
BrainzGit

[bookbrainz-site] prabalsingh24 closed pull request #386 (master…EditorActivity): BB-50: Add Editor activity graphs https://github.com/bookbrainz/bookbrainz-site/pul…

2020-04-08 09957, 2020

11:13 AM
BrainzBot

BB-50: Add editor activity graphs https://tickets.metabrainz.org/browse/BB-50

2020-04-08 09959, 2020

11:13 AM
BrainzGit

[bookbrainz-site] prabalsingh24 reopened pull request #386 (master…EditorActivity): BB-50: Add Editor activity graphs https://github.com/bookbrainz/bookbrainz-site/pul…

2020-04-08 09908, 2020

11:14 AM
Mr_Monkey

Woops :D

2020-04-08 09921, 2020

11:17 AM
prabal

I was closing my comment. Accidentally clicked close pull request button. This has happened couple of times now. smh

2020-04-08 09920, 2020

11:36 AM
BrainzGit

[bookbrainz-site] MonkeyDo merged pull request #386 (master…EditorActivity): BB-50: Add Editor activity graphs https://github.com/bookbrainz/bookbrainz-site/pul…

2020-04-08 09920, 2020

11:36 AM
BrainzBot

BB-50: Add editor activity graphs https://tickets.metabrainz.org/browse/BB-50