#metabrainz

/

1:15 AM
vogen joined the channel

2021-04-15 10524, 2021

1:45 AM
vogen has quit

2021-04-15 10551, 2021

1:53 AM
MusicbrainzB0T has quit

2021-04-15 10508, 2021

1:54 AM
MusicbrainzB0T joined the channel

2021-04-15 10508, 2021

2:54 AM
thomasross has quit

2021-04-15 10531, 2021

5:04 AM
reosarevok

CatQuest: yeah, that's all it does. Now that's all sortname guess does too :p

2021-04-15 10517, 2021

5:27 AM
_lucifer

ruaok, i tried to test again on michael but it seems not to be picking up requests from the queue again.

2021-04-15 10538, 2021

5:27 AM
_lucifer

however pinging prince on the 62673 succeeds.

2021-04-15 10555, 2021

5:42 AM
BrainzGit

[musicbrainz-server] reosarevok opened pull request #2066 (beta…MBS-11583): MBS-11583: Use sanitized context in hydrated component https://github.com/metabrainz/musicbrainz-server/…

2021-04-15 10500, 2021

6:00 AM
BrainzGit

[musicbrainz-server] reosarevok merged pull request #2066 (beta…MBS-11583): MBS-11583: Use sanitized context in hydrated component https://github.com/metabrainz/musicbrainz-server/…

2021-04-15 10509, 2021

6:08 AM
BrainzGit

[musicbrainz-server] reosarevok merged pull request #2033 (master…MBS-11542): MBS-11542 / MBS-11552: Allow and cleanup new Classical Archives links + add validation https://github.com/metabrainz/musicbrainz-server/…

2021-04-15 10548, 2021

6:38 AM
_lucifer

ok i figured due to some reason yarn is trying to connect to marlon instead of worker-marlon. i think that might be the case if its tries public ip instead of internal ip.

2021-04-15 10558, 2021

6:39 AM
_lucifer

interesting hdfs cli works as expected though reports the internal ips and worker-*

2021-04-15 10558, 2021

7:29 AM
_lucifer

ok. the ip thing is fixed now but it now fails with user application exited with exit code 1.

2021-04-15 10535, 2021

7:30 AM
alastairp

CatQuest: thanks! blank spaces is OK, the only problem we might have is an empty string. Anything that is actually represented by characters is fine 👍

2021-04-15 10511, 2021

7:32 AM
sumedh joined the channel

2021-04-15 10532, 2021

7:44 AM
reosarevok

bitmap, yvanzo: for MBS-1658, I was thinking at least one of the places to add a comment to the entry should be from the list itself

2021-04-15 10533, 2021

7:44 AM
BrainzBot

MBS-1658: My Collection: add free text comment field https://tickets.metabrainz.org/browse/MBS-1658

2021-04-15 10544, 2021

7:44 AM
reosarevok

https://usercontent.irccloud-cdn.com/file/pajfOoK…

2021-04-15 10553, 2021

7:46 AM
CatQuest

alastairp: iirc there was an issue with an artist credit that included https://beta.musicbrainz.org/artist/3f0bdf7f-3f40… some time back..

2021-04-15 10529, 2021

7:48 AM
CatQuest

https://beta.musicbrainz.org/edit/17556643

2021-04-15 10551, 2021

7:48 AM
reosarevok

bitmap, yvanzo: So that last column should have like an edit icon somehow, and I guess allow inline editing that would get sent to the DB? Do you know if we do anything like that anywhere else?

2021-04-15 10503, 2021

7:49 AM
reosarevok

Or if you think that's a bad idea, how would you do it?

2021-04-15 10508, 2021

7:49 AM
alastairp

CatQuest: hah, nice

2021-04-15 10521, 2021

7:49 AM
alastairp

that's not a problem either though, but I can see how it could be a problem

2021-04-15 10534, 2021

7:49 AM
CatQuest

it was a nice bobby tables thing

2021-04-15 10541, 2021

7:49 AM
CatQuest

.. niche

2021-04-15 10550, 2021

7:49 AM
CatQuest

damn englich

2021-04-15 10526, 2021

7:57 AM
sumedh has quit

2021-04-15 10551, 2021

8:19 AM
D4RK joined the channel

2021-04-15 10541, 2021

8:21 AM
D4RK-PH0ENiX has quit

2021-04-15 10538, 2021

8:34 AM
BrainzGit

[musicbrainz-server] reosarevok opened pull request #2067 (master…MBS-10899): MBS-10899: Report for releases with catnos that look like ISRCs https://github.com/metabrainz/musicbrainz-server/…

2021-04-15 10529, 2021

8:59 AM
ruaok

moooin!

2021-04-15 10554, 2021

9:01 AM
ruaok

Mr_Monkey: alastairp : so after more experimenting last night, I'm able to get rid of the min/max ts cont aggs by simply creating a 5 days cont agg with compound index on user/listened_at. Thats 18M rows less for starters.

2021-04-15 10517, 2021

9:02 AM
ruaok

and I think we can replace those with month and year cont aggs for the graphs you two would like.

2021-04-15 10519, 2021

9:02 AM
alastairp

right. get all data from the same table?

2021-04-15 10541, 2021

9:02 AM
ruaok

yeah, it was already there. just the index was missing to make it faster.

2021-04-15 10543, 2021

9:02 AM
alastairp

sweet, if a month and year aggregate is possible then that sounds like it should be perfect

2021-04-15 10544, 2021

9:02 AM
alastairp

great

2021-04-15 10551, 2021

9:04 AM
ruaok

basically we swapped doing a table scan on the DB with an index scan. not sure we can do much better than that -- but with increased cache times, this should work well.

2021-04-15 10517, 2021

9:06 AM
alastairp

_lucifer: ^ remember how I told you to add indexes to tables where you want to select some data?

2021-04-15 10521, 2021

9:11 AM
_lucifer

yup, i'll keep it in mind :D

2021-04-15 10555, 2021

9:11 AM
ruaok

its the rookie mistake that keeps on giving. #going25yearsstrong

2021-04-15 10521, 2021

9:15 AM
sumedh joined the channel

2021-04-15 10557, 2021

9:29 AM
ruaok

woah. https://restofworld.org/2021/the-rise-and-fall-of…

2021-04-15 10522, 2021

9:37 AM
sumedh has quit

2021-04-15 10555, 2021

9:45 AM
_lucifer

ruaok, apparently `0.0.0.0` is causing hostname resolution errors. 0.0.0.0 is resolving to the server's name michael instead of leader.

2021-04-15 10500, 2021

9:46 AM
_lucifer

https://www.irccloud.com/pastebin/Uuitqvql/

2021-04-15 10538, 2021

9:46 AM
_lucifer

the same error was happening on workers leading to resolving as tito instead of worker-tito so on.

2021-04-15 10506, 2021

9:47 AM
sumedh joined the channel

2021-04-15 10540, 2021

9:47 AM
_lucifer

i fixed that by changing the configurations of various files here https://github.com/metabrainz/hadoop-cluster-dock…

2021-04-15 10543, 2021

9:48 AM
_lucifer

but for michael it seems it is picking up some default we didn't use to define in the earlier setup. any guesses which one it could be?

2021-04-15 10504, 2021

10:00 AM
ruaok

_lucifer: I suspect that is because the canonical name of the machine is michael and has its reverse DNS set like that.

2021-04-15 10537, 2021

10:00 AM
ruaok

so, for config purposes you should always use michael. leader is just a shorthand/convention for us to log into the cluster.

2021-04-15 10522, 2021

10:01 AM
_lucifer

i think what is happening is that michael is used when it tries to bind an interface on 0.0.0.0.

2021-04-15 10529, 2021

10:01 AM
ruaok

I would change the /etc/hosts and change leader to michael.

2021-04-15 10526, 2021

10:02 AM
_lucifer

makes sense. i'll try that.

2021-04-15 10547, 2021

10:02 AM
ruaok

not 100% sure that will work. the last paste -- which container is that from? is it up?

2021-04-15 10509, 2021

10:03 AM
_lucifer

no it goes down after that.

2021-04-15 10515, 2021

10:03 AM
ruaok

from the inside of that container can you ping michael:31171 ?

2021-04-15 10544, 2021

10:03 AM
ruaok

try the bash trick again, get the container up and then see what you can or cannot connect to.

2021-04-15 10511, 2021

10:08 AM
_lucifer

just tried that fails with unknown host

2021-04-15 10534, 2021

10:08 AM
_lucifer

ping michael works but with the port doesn't

2021-04-15 10500, 2021

10:09 AM
ruaok

sorry wget michael:31171

2021-04-15 10504, 2021

10:12 AM
_lucifer

Aaannd it worked!

2021-04-15 10512, 2021

10:12 AM
ruaok

!m _lucifer

2021-04-15 10512, 2021

10:12 AM
BrainzBot

You're doing good work, _lucifer!

2021-04-15 10513, 2021

10:12 AM
_lucifer

https://www.irccloud.com/pastebin/9mkggfTg/

2021-04-15 10521, 2021

10:12 AM
ruaok

yisss!

2021-04-15 10533, 2021

10:12 AM
_lucifer

changing to michael didn't work

2021-04-15 10543, 2021

10:12 AM
_lucifer

but changing a spark default to the vlan ip did

2021-04-15 10543, 2021

10:12 AM
ruaok

so, lets do all the loading of data (mapping, incrementals) then we can fire off some jobs.

2021-04-15 10555, 2021

10:12 AM
ruaok

that makes sense.

2021-04-15 10559, 2021

10:12 AM
ruaok

very very good.

2021-04-15 10537, 2021

10:13 AM
_lucifer

two things left to do. one is define memory defaults and second is update the new configuration in syswiki.

2021-04-15 10508, 2021

10:21 AM
_lucifer

monitoring this cluster is easier than the docker one. one tunnel is sufficient

2021-04-15 10512, 2021

10:22 AM
ruaok

that was exactly the goal.

2021-04-15 10529, 2021

10:22 AM
ruaok

and each server is being monitored by all of zas' magic.

2021-04-15 10548, 2021

10:22 AM
_lucifer

:D

2021-04-15 10519, 2021

11:14 AM
_lucifer

alastairp: available to talk about the GH actions PR?

2021-04-15 10522, 2021

12:00 PM
_lucifer

ruaok, zas: do you know if any service we run on j5 might listen on port 5666?

2021-04-15 10548, 2021

12:00 PM
zas

nagios

2021-04-15 10554, 2021

12:00 PM
zas

well, its agent

2021-04-15 10544, 2021

12:01 PM
_lucifer

👍 thanks!

2021-04-15 10537, 2021

12:03 PM
sumedh has quit

2021-04-15 10518, 2021

12:06 PM
sumedh joined the channel

2021-04-15 10519, 2021

12:10 PM
_lucifer

zas, i saw a commit in syswiki renaming germaine to jermaine so wanted to let you know that i noticed /etc/hosts on jermaine still contains has a couple of entries referring germaine.

2021-04-15 10519, 2021

12:12 PM
zas

oh, ok, I'll fix it

2021-04-15 10539, 2021

12:21 PM
zas

https://data.musicbrainz.org (ftp / williams over http(s|2))

2021-04-15 10523, 2021

12:22 PM
BrainzGit

[musicbrainz-server] reosarevok opened pull request #2068 (master…MBS-10711): MBS-10711: Convert report lists to react-table [WIP] https://github.com/metabrainz/musicbrainz-server/…

2021-04-15 10540, 2021

12:24 PM
reosarevok

bitmap, yvanzo ^ would really appreciate some feedback on whether the way I'm approaching this seems sensible, improvements etc, before I keep working on other lists

2021-04-15 10529, 2021

12:44 PM
zas

https://blog.metabrainz.org/2021/04/15/picard-2-6…

2021-04-15 10504, 2021

13:19 PM
BrainzGit

[bookbrainz-site] akashgp09 opened pull request #601 (master…browser-compatibility): FIX(BB-615): Copy/Paste annotation text in FireFox <= 60 https://github.com/bookbrainz/bookbrainz-site/pul…

2021-04-15 10517, 2021

13:51 PM
scory joined the channel

2021-04-15 10518, 2021

13:58 PM
scory

Hello everyone. I would like to ask about Lucene Search syntax of the musicbrainz database. What kind of instance is it running on? If i would like to have a musicbrainz database (mbdata) with ElasticSearch instance, how do you recommend to integrate these two. I just started looking into ElasticSearch but what I found out i would need some kind of

2021-04-15 10518, 2021

13:58 PM
scory

data set to import it to ElasticSearch (*.json for example). Do you have some kind of method to import musicbrainz database into a Lucene Search intance, a data set I can use, or i should generate it myself? How do you keep it updated?

2021-04-15 10547, 2021

13:58 PM
ruaok

hi scory!

2021-04-15 10553, 2021

13:58 PM
ruaok

why must is be elasticsearch?

2021-04-15 10528, 2021

13:59 PM
ruaok

because we have a perfectly working search infrastructure that you can use without having to reinvent the wheel.

2021-04-15 10537, 2021

14:04 PM
scory

That infrastructure currently doesn't support what I need, last time I was here that was the conclusion for me. That's the reason I am currently running a mbdata server locally, and can run graphql queries against it, with batching. But I would like to implement an ElasticSearch instance on graphql. But currently i am just investigating.

2021-04-15 10551, 2021

14:05 PM
ruaok

you could look at the denormalized JSON dumps we have: ftp://ftp.eu.metabrainz.org/pub/musicbrainz/data/…

2021-04-15 10500, 2021

14:06 PM
ruaok

those fit for importing into a document store.

2021-04-15 10511, 2021

14:07 PM
scory

Thank you very much.

2021-04-15 10519, 2021

14:08 PM
scory has quit

2021-04-15 10553, 2021

14:24 PM
sumedh has quit

2021-04-15 10533, 2021

14:54 PM
bitmap

reosarevok: I can't think of anything else like that offhand, but doesn't seem like a bad idea. we could add a small endpoint to /ws/js for it

2021-04-15 10505, 2021

15:03 PM
sumedh joined the channel

2021-04-15 10526, 2021

15:52 PM
vardan has quit

2021-04-15 10558, 2021

16:17 PM
adhi001 joined the channel

2021-04-15 10558, 2021

16:20 PM
adhi001

Sorry ruaok , I was sick the last week and was not able to submit a proposal for GSoC. Still part of the community :)

2021-04-15 10520, 2021

16:21 PM
ruaok

oh, bummer. that sucks. at least you're better, right?

2021-04-15 10531, 2021

16:21 PM
adhi001

yeah

2021-04-15 10508, 2021

16:23 PM
adhi001

Thank you for your concern

2021-04-15 10542, 2021

16:24 PM
alastairp

_lucifer: hi, sorry - had a hectic day. still around?

2021-04-15 10553, 2021

16:25 PM
_lucifer

alastairp: hi! no worries. yup, i am available.

2021-04-15 10506, 2021

16:26 PM
alastairp

so I was suggesting using test.sh in the actions?

2021-04-15 10516, 2021

16:26 PM
_lucifer

yes

2021-04-15 10542, 2021

16:26 PM
alastairp

so we already have things like `./test.sh -b` to build, and `test.sh -u` to bring up containers

2021-04-15 10548, 2021

16:26 PM
alastairp

test.sh fe to run frontend tests

2021-04-15 10527, 2021

16:27 PM
_lucifer

yes there's also test.sh spark

2021-04-15 10549, 2021

16:28 PM
alastairp

great, so it sounds like it's probably a good fit that we can use the actions files for specifying the orders in which to run things, but reusing test.sh for the actual commands allows us to share 100% test code between local developement and CI, right?

2021-04-15 10505, 2021

16:29 PM
_lucifer

should we use separate build steps? like there's a ./test.sh -u to just bring up supporting containers. or should we just to do ./test.sh which does it in one go

2021-04-15 10500, 2021

16:30 PM
alastairp

but we need to separate pull / cache / build / run, in CI, right?

2021-04-15 10506, 2021

16:30 PM
_lucifer

yes, mostly. we'll still have to pull manually

2021-04-15 10527, 2021

16:30 PM
_lucifer

that step won't change

2021-04-15 10558, 2021

16:31 PM
alastairp

one question - if a test generates files (e.g. the junit xml), will it be cached? Or does the cache action only cache docker layers?

2021-04-15 10539, 2021

16:32 PM
_lucifer

i'll need to check that.

2021-04-15 10515, 2021

16:33 PM
_lucifer

i expect docker layers only but we can confirm it by generating some files and looking at the actions output

2021-04-15 10536, 2021

16:33 PM
alastairp

it seems that satackey/action-docker-layer-caching works explicitly on layers (looking at the output, it uses docker commands to generate the archives)

2021-04-15 10505, 2021

16:34 PM
alastairp

so yeah, ./test.sh pull; restore cache; run test.sh; save cache

2021-04-15 10512, 2021

16:34 PM
alastairp

if that works, then I'm all for it!

2021-04-15 10521, 2021

16:34 PM
_lucifer

makes sense. i'll try that.

2021-04-15 10528, 2021

16:34 PM
_lucifer

i also opened https://github.com/metabrainz/brainzutils-python/…

2021-04-15 10555, 2021

16:34 PM
_lucifer

the junit action works fine but as i mentioned it might comment excessively

2021-04-15 10513, 2021

16:35 PM
alastairp

neat. did you see what happens if one fails? does it only update the comment or does it also add an annotation to the failing test?

2021-04-15 10518, 2021

16:36 PM
alastairp

and it'll make a new comment on every push (i.e. even if they all pass?)

2021-04-15 10519, 2021

16:36 PM
_lucifer

no i haven't, let me do that right now.

2021-04-15 10524, 2021

16:36 PM
_lucifer

yes

2021-04-15 10525, 2021

16:36 PM
alastairp

or only if the results of the test run change?

2021-04-15 10529, 2021

16:36 PM
alastairp

interesting

2021-04-15 10535, 2021

16:36 PM
alastairp

you're right that this could get a bit annoying

2021-04-15 10543, 2021

16:36 PM
_lucifer

it'll hide the existing one and add a new one

2021-04-15 10502, 2021

16:37 PM
_lucifer

for LB there are going to be 4 comments on each push

2021-04-15 10538, 2021

16:37 PM
alastairp

oh, that's quite annoying. merging tests together would help (e.g. get it down to 2), but I suspect that this might be too much

2021-04-15 10551, 2021

16:37 PM
alastairp

one thing that ruaok was suggesting back on jenkins was that it seems stupid to run tests on _every_ push, perhaps there could be a way to run them less often. once a day? on request based on a comment? just before merge?

2021-04-15 10520, 2021

16:38 PM
ruaok

anything, really.

2021-04-15 10533, 2021

16:38 PM
alastairp

let's not spend too much more time on this, but perhaps there is an action or a flag for `on:` that lets us decide to run them less often

2021-04-15 10537, 2021

16:38 PM
_lucifer

i'll look into that should be possible i think

2021-04-15 10553, 2021

16:38 PM
alastairp

on: comment: contains: "test please"

2021-04-15 10556, 2021

16:38 PM
alastairp

that'd be great