in #metabrainz

1:36 AM
d4rkie joined the channel
1:54 AM
kepstin_ joined the channel
1:58 AM
kepstin_ has quit
2:24 AM
d4rkie has quit
2:41 AM
Sophist_UK joined the channel
2:42 AM
Sophist-UK has quit
2:46 AM
d4rkie joined the channel
2:48 AM
akashgp09 joined the channel
4:48 AM
peterhil joined the channel
7:27 AM
Etua joined the channel
7:42 AM
ruaok

moin moin!
7:43 AM
lucifer: that does sound quite promising.
7:46 AM
d4rkie has quit
7:48 AM
d4rkie joined the channel
8:21 AM
Etua has quit
8:21 AM
Etua joined the channel
8:22 AM
Etua has quit
8:55 AM
Freso

https://listenbrainz.org/similar-users suggests that the two users listed have identical listening histories, but that doesn’t seem apparent from looking at their profiles, and also their MB profiles don’t seem to hint at them being the same person. Any chance someone with more direct access to the data can give them a look and give me some more insight in what is or might be going on?
9:04 AM
lucifer

Freso: that page is currently bugged. i'll look into fixing it soon.
9:04 AM
Freso

lucifer: 👍
9:05 AM
lucifer: Is there a ticket I can watch? :)
9:05 AM
lucifer

Freso: none that i know of. but i'll ping you once its fixed.
9:06 AM
ruaok: hi! as you can see from my later comments, it worked but showed another problem. to fix the issue completely we also need to offload the spark invocation to a different thread. i'll probably take the MBID mapping writer route for that.
9:08 AM
ruaok: alastairp: let me know when you have some time. need to discuss about artist credit id -> artist mbid in LB stats.
9:09 AM
ruaok

I have a few mins. (On mobile)
9:09 AM
What is the q?
9:10 AM
I suspect that you need a mapping available in spark, yes?
9:11 AM
lucifer

lb stats now return artist_credit_id and artist name, but frontend needs a artist mbid so we need to add a translation step somewhere
9:11 AM
ruaok

That should be easy to create when you have an mb db connection
9:12 AM
lucifer

yeah so doing on LB side is probably better.
9:12 AM
ruaok

Look at the mb schema and perhaps get reosarevok to help you with this query, if it isn't obvious.
9:12 AM
lucifer

when LB receives stats from spark, replace artist_credit_ids with artist_mbids, right?
9:12 AM
ruaok

Then write the data to a pandas dataframe then parquet, then import.
9:13 AM
lucifer

ah ok, understood
9:13 AM
ruaok

A longer term solution night be to have them included in the mbid mapping.
9:13 AM
Export ac id and array[mbids]
9:14 AM
It would make the mapping dump a bit larger...
9:14 AM
lucifer

makes sense
9:14 AM
we are going to need artist mbid almost always though, artist credit are not very useful to expose to users.
9:15 AM
*artisy credit ids are not
9:17 AM
thanks, i'll try the separate dump for now. we can look into the longer term solution later
10:19 AM
peterhil has quit
10:22 AM
yvanzo: hi! i am investigating some rabbitmq issues in LB. i see sir is having those too according to rabbitmq-clash logs. mentioning in case its relevant.
10:22 AM
https://www.irccloud.com/pastebin/1EjDPWrK/
10:24 AM
looks like the original channel got closed and then it was opened again and tried to ack a message that the old channel received.
10:40 AM
peterhil joined the channel
11:14 AM
peterhil has quit
11:29 AM
peterhil joined the channel
11:59 AM
alastairp

lucifer: this is interesting: https://github.com/metabrainz/acousticbrainz-se...
11:59 AM
annoy fails to compile on the prod build action, I just tried it locally from scratch and it worked fine. Perhaps something related to a cached layer that's incorrect?
12:01 PM
lucifer

possible or maybe something do with buildx?
12:02 PM
alastairp

https://github.com/spotify/annoy/issues/402
12:02 PM
I need a way to check the version of gcc in the build container, I'll check
12:03 PM
but I've just had a thought - It could be something weird like the the build farm hardware advertising that it has a particular avx extension
12:03 PM
which could cause problems if we try and build on one architecture and then deploy on another :(
12:05 PM
lucifer

yeah :/, we'll need to configure according to our machine's hardware
12:09 PM
alastairp

they have a fix for it which I think will help for now - because we're compiling on a lower version of gcc it'll skip the extension, but this is definitely something to check and fix when we upgrade gcc
12:09 PM
https://github.com/spotify/annoy/pull/430/files...
12:09 PM
actually, there we go. they used to have a specific -no-avx flag
12:14 PM
they have the issue reported already: https://github.com/spotify/annoy/issues/472
12:15 PM
looks like a flag to say "only use ssse3 even if you have avx512 available" would be good
12:16 PM
lucifer

oh! nice.
12:17 PM
fwiw we have avx, avx2 and sse(%d) on clash
12:18 PM
alastairp

correct. avx-512 is a new intel extension, looks like it won't be supported in AMD until the next generation: https://www.techpowerup.com/279129/amd-zen-4-mi...
12:18 PM
https://github.com/spotify/annoy/blob/master/sr...
12:19 PM
unfortunately, you can only say "disable vectorisation", you can't say "disable avx512 but use sse3"
12:20 PM
there is this gcc flag -mtune=native, but from reading the docs it's not clear to me if this means that it'll use sse3: https://gcc.gnu.org/onlinedocs/gcc/x86-Options....
12:22 PM
lucifer

we can write a few simple .c files and print out the flags and test maybe?
12:22 PM
the -mtune will probably just define the relevant headres
12:22 PM
s/headrs/macros
12:25 PM
btw i had rerun the job and it built fine this time https://github.com/metabrainz/acousticbrainz-se...
12:25 PM
alastairp

OK, I just worked out what mtune=native does. it means "use all of the flags that are available on this specific build machine", which isn't what we want
12:25 PM
https://wiki.gentoo.org/wiki/Ryzen#GCC
12:26 PM
in newer versions of gcc there are zen flags that we can use
12:26 PM
lucifer

nice.
12:26 PM
alastairp

oh amazing... I guess you don't know which machine you're going to run on!
12:27 PM
let me open another PR to upgrade the version of annoy. at least we'll use the right extensions when using gcc 5. We can address it again when we upgrade to a new baseimage
12:27 PM
lucifer

+1
12:31 PM
yeah it just says it can run on any of these processors https://docs.microsoft.com/en-in/azure/virtual-...
12:32 PM
peterhil has quit
12:36 PM
peterhil joined the channel
12:54 PM
alastairp: can you take a look at https://github.com/metabrainz/listenbrainz-serv... next? i have been looking int orabbitmq and have quite a few discrepancies in how LB connects to rabbitmq.
12:55 PM
you can ignore the latest commits, i am trying to come up with a threading solution but haven't reached there yet.
12:56 PM
i have written down the issue i found in context of the request consumer but these issues exist elsewhere as well. in api, spark reader so on. i didn't check ts writer yet so don't know about that one.
12:58 PM
peterhil has quit
13:04 PM
peterhil joined the channel
13:12 PM
yvanzo

hi lucifer: thanks, it's likely relevant, btw you would be very welcome to look at sir again after you are done with your current project.
13:13 PM
lucifer

sure :D
13:13 PM
yvanzo

yyoung: thanks, did not reply the other thread yet, will check your PR as well.
13:31 PM
BrainzGit joined the channel
13:34 PM
alastairp

lucifer: sure, I'll look at it tomororw when I'm on LB tasks
13:44 PM
lucifer

awesome, thanks!
13:47 PM
CatQuest

ok yvanzo I'm here now
13:56 PM
BrainzGit

[acousticbrainz-server] 14alastair merged pull request #409 (03master…annoy-valid-extensions): Upgrade annoy, force a CPU architecture to avoid unavailable extensions https://github.com/metabrainz/acousticbrainz-se...
13:57 PM
lucifer

alastairp: https://github.com/actions/runner/issues/1069, unless this changed in last couple of months. i am not sure how avx512 caused that build issue
13:58 PM
alastairp

shrug. it was clearly the same issue reported in the annoy repo
13:58 PM
the solution I just merged seems to work fine though
14:01 PM
lucifer

ah cool then.
14:07 PM
BrainzGit

[acousticbrainz-server] 14alastair opened pull request #410 (03master…similarity-n-jobs): Specify number of threads to use when building annoy index https://github.com/metabrainz/acousticbrainz-se...
14:07 PM
KassOtsimine has quit
14:07 PM
KassOtsimine joined the channel
14:11 PM
alastairp

lucifer: could you take a look at that + docker-server-configs? If it looks good I'll deploy on beta again to verify that the new index works
14:23 PM
lucifer

on it
14:29 PM
done
14:29 PM
alastairp

nice
14:46 PM
yvanzo

CatQuest: ok thanks for having settled that
14:47 PM
peterhil has quit
14:47 PM
peterhil joined the channel
15:04 PM
wargreen joined the channel
15:38 PM
akshaaatt[m]

Hi lucifer ! I was planning to release the wordpress blog I wrote a while back.
15:39 PM
lucifer

akshaaatt[m]: hi! let's wait till the app update is released and published.
15:39 PM
akshaaatt[m]

Sure! I was about to say that :)
15:39 PM
lucifer

i have reviewd two of the PRs and two are left.
15:39 PM
akshaaatt[m]

Sounds great!
15:40 PM
lucifer

i'll get to the other two probably tonight or early tomorrow.
15:42 PM
ritiek joined the channel
15:50 PM
peterhil has quit
15:59 PM
BrainzGit

[acousticbrainz-server] 14alastair merged pull request #410 (03master…similarity-n-jobs): Specify number of threads to use when building annoy index https://github.com/metabrainz/acousticbrainz-se...
15:59 PM
[acousticbrainz-server] release 03v-2021-07-26.1 has been published by 14github-actions[bot]: https://github.com/metabrainz/acousticbrainz-se...
16:02 PM
[bookbrainz-site] 14MonkeyDo merged pull request #676 (03master…fix-achievement): FIX(BB-627): set badges in achievement section https://github.com/bookbrainz/bookbrainz-site/p...
16:04 PM
akshaaatt[m]

Thanks a lot lucifer . I have written a medium article as well recently regarding the GSoC experience. Will share it when we have the meeting in an hour :D
16:43 PM
alastairp

zas: I need to rsync 50gb of data between servers. should it be faster through public IP, or private network, or should they be the same? I'm seeing only 3-6MB/sec transfer, which seems low. Are you imposing any speed restrictions?
16:44 PM
(frank to clash)
16:45 PM
zas

nope no speed restriction, it should be faster over private network
16:46 PM
ruaok

Freso: slow week for me, just some bits and bobs, was on semi vacation am on vacation now.
16:46 PM
ruaok runs off to dinner
16:47 PM
https://usercontent.irccloud-cdn.com/file/hx85D...
16:49 PM
zas

enjoy!
16:59 PM
peterhil joined the channel
17:00 PM
Freso

<BANG>
17:00 PM
Estas Esperanta Lundo!
17:00 PM
https://www.youtube.com/watch?v=4dNRtJ9ZEGY
17:00 PM
Mi ricevis tri recenzojn senditajn al mi:
17:00 PM
yyoung diras:
17:00 PM
Freso has quit
17:00 PM
Freso joined the channel
17:00 PM
R.I.P. :(
17:01 PM
"""
17:01 PM
- I made the last changes to PR #2151 and communicated with editors on the forum.
17:01 PM
- Experimented a way to implement MBS-9902 and opened a PR for review.
17:01 PM
- Made a UI prototype for MBS-3774 according to reosarevok 's suggestion and posted it on the forum for feedback.
17:01 PM
- That's all, thank you.
17:01 PM
"""
17:01 PM
BrainzBot

MBS-9902: Support auto-select/cleanup/validation of more than one relationship type for external links https://tickets.metabrainz.org/browse/MBS-9902
17:01 PM
MBS-3774: Add URL relationship with begin and end dates https://tickets.metabrainz.org/browse/MBS-3774
17:01 PM
Freso

monkey diras:
17:01 PM
"""
17:01 PM
Hi everyone!
17:01 PM
Last week I mostly worked on some improvements for the ListenBrainz player. Some stability improvements and the addition of new features, namely native notifications and media control. Associated PRs are #1539 and #1561 which have more detailed descriptions.
17:01 PM
You can test this all out on test.listenbrainz.org <http://test.listenbrainz.org/>;, and feedback would be very appreciated if you find anything that looks broken or unintended (or if you’re happily surprised too).