in #metabrainz

0:24 AM
Hellow2 has quit
2:20 AM
ssam has quit
2:20 AM
chrisshepherd has quit
2:20 AM
piwu has quit
2:20 AM
kaine2 has quit
2:20 AM
d4rk-ph0enix has quit
2:20 AM
genpaku has quit
2:20 AM
HenryG has quit
2:20 AM
akshaaatt has quit
2:20 AM
aerozol has quit
2:20 AM
chrisshepherd joined the channel
2:20 AM
ssam joined the channel
2:20 AM
piwu joined the channel
2:20 AM
aerozol joined the channel
2:20 AM
d4rkie joined the channel
2:20 AM
genpaku joined the channel
2:20 AM
akshaaatt joined the channel
2:23 AM
HenryG joined the channel
3:09 AM
pprkut has quit
3:10 AM
pprkut joined the channel
6:02 AM
yvanzo

O’Moin
6:17 AM
lucifer: I’ve run your drafted patch to SIR with sqltimings and put logs under ~musicbrainz/sir-perf-reg/ on wolf; It slightly improved only.
6:28 AM
BrainzGit

[musicbrainz-server] 14reosarevok opened pull request #2710 (03master…type-constraint-genre): [WIP] Fix Perl type constraint issues in Selenium tests https://github.com/metabrainz/musicbrainz-serve...
8:13 AM
lucifer

yvanzo: oh :(, will take a look again then
8:15 AM
weird that the sql queries decreased but it didn't improve much.
8:15 AM
zas

I switched critiquebrainz.org (as well as beta.cb) to use new gateways, I switched picard.musicbrainz.org and meta.metabrainz.org 2 days ago.
8:51 AM
yvanzo

lucifer: it improved still :)
8:52 AM
mayhem

mooin!
8:54 AM
lucifer: when you have a moment to chat about the similarity stuff, lemme know
9:05 AM
zas

atj: have a look at https://stats.metabrainz.org/goto/M8gHZoN4z?org...
9:08 AM
mayhem

lucifer: damselfish didn't get daily jams today. :(
9:08 AM
zas

response time seems to degrade, but I'm not sure why yet (it is the response as measured by openresty and stored in logs). We need to investigate this before moving more services to new gateways. For now, I have no explanation.
9:14 AM
hmmm, something wrong with data representation
9:16 AM
yup, I know what's happening (more or less), to get correct values (well, almost, since that's a mean of means), be sure to only select rex+rudi in host selector
9:18 AM
when comparing this way, no significant difference (and that's expected)
9:28 AM
atj: check https://stats.metabrainz.org/goto/QER-MoHVz?org... for a better way to compare
9:29 AM
ignore kiki values after the switch (it is based on very few check queries, so not representative at all)
9:29 AM
atj

ah, ok
9:30 AM
so it looks pretty good to me
9:30 AM
zas

the performance actually improves, but keep in mind rex/rudi have very low load for now
9:30 AM
atj

when did you do the switch for CB?
9:31 AM
zas

https://stats.metabrainz.org/d/NuTEpcHVz/new-ga...
9:31 AM
8:08 UTC
9:32 AM
(well, around, because DNS TTL is 5 minutes, so that's progressive)
9:35 AM
atj

OK, so mean request time on kiki was ~40ms and rex/rudi it seems to have settled ~25ms
9:35 AM
but kiki was handling much more traffic
9:37 AM
zas

yes
9:39 AM
traffic is very low on switched websites, we'll have better figures when we'll switch coverartarchive (and mb ofc)
9:39 AM
atj

will be interesting to see given we have an extra layer on the new gateways
9:41 AM
zas

according to my measurements, extra layers should have minimal impact: we have the load balancer, then haproxy then openresty, and they use proxy protocol (which means an extra step and lower mtu)
9:41 AM
but all this is very fast (few ms)
9:41 AM
atj

and the servers are faster
9:41 AM
zas

yes, and much more scalable
9:42 AM
lucifer

mayhem: i see, why that happened. jukevox, alastair etc also didn't get daily jams for same reason 🤦
9:42 AM
zas

on https traffic (which is cpu intensive) new setup should be much better overall
9:45 AM
BrainzGit

[listenbrainz-server] 14amCap1712 opened pull request #2223 (03master…fix-dail-jams): Fix error handing in daily jams https://github.com/metabrainz/listenbrainz-serv...
9:48 AM
zas

One thing missing is moving keydb to rex/rudi (they use kiki/herb ones still)
9:49 AM
atj

can we start putting docker volumes in /srv or something going forward?
9:50 AM
zas

yes, especially openresty
9:50 AM
atj

too many containers with volumes in /home/zas ;)
9:50 AM
zas

yup, I agree
9:50 AM
Lotheric has quit
9:51 AM
Lotheric joined the channel
9:52 AM
I'll not switch more domains until we set up missing parts, the idea here was to get some real traffic in order to detect potential issues
9:52 AM
atj

it's a good idea to do it this way
9:57 AM
I'm not sure about minio, stuff like this gives me pause "MinIO strongly recommends production clusters consist of a minimum of 4 minio server nodes in a Server Pool for proper high availability and durability guarantees."
9:57 AM
but then the alternatives look worse
9:58 AM
building a proper MinIO cluser is going to cost €€€
10:02 AM
mayhem

alastairp: monkey : can you please reply to the last.fm rec/UX doodle soon?
10:04 AM
monkey

Yus
10:04 AM
zas

atj: yes, I'm not sure either
10:04 AM
monkey

I tried opening it yesterday but it wasn't workign for me, will try again
10:13 AM
lucifer

mayhem: i am around now as well to discuss similar recordings.
10:13 AM
BrainzGit

[listenbrainz-server] 14amCap1712 merged pull request #2223 (03master…fix-dail-jams): Fix error handing in daily jams https://github.com/metabrainz/listenbrainz-serv...
10:14 AM
mayhem

ok, cool.
10:14 AM
I was playing with the similarities last night and noticed a few things.
10:14 AM
First off, the 7 day data is pretty bad.
10:14 AM
lucifer

i see
10:14 AM
mayhem

which makes little sense, because the first dataset you had a few days ago was even smaller and it was... kinda good.
10:15 AM
so, I can't explain that. might've been a fluke.
10:15 AM
the next realization is that I had an insight, but that insight hasn't been tested yet.
10:15 AM
namely, if we train on too much data, that our data set tends towards noise.
10:15 AM
I dont know if that is actually true.
10:16 AM
lucifer

we can train on 180-365 days to test that?
10:16 AM
mayhem

so, I would like to train data sets on: 30d, 90d, 180d, 1year, 3 year, 10 year, all time data sets.
10:16 AM
lucifer

sounds good
10:17 AM
but we'll have to use appropriate thresholds to keep lookups fast
10:17 AM
mayhem

and then my goal is to pick a number of tracks as seed tracks and study them.
10:17 AM
yes, that is a good point. can we make the thresholds relative to the amount of data?
10:17 AM
so that it picks itself in a way?
10:18 AM
lucifer

hmm not sure. at max 100 or 200 similar recordings for each track maybe?
10:19 AM
mayhem

do you do any pruning of the data while you are generating it or is the pruning one of the last steps?
10:19 AM
lucifer

last step.
10:19 AM
mayhem

then yes, lets say we pick max 200 tracks for the next round.
10:20 AM
and then we remove the threshold concept?
10:20 AM
well, swap these concepts?
10:20 AM
I think total count is a better way of doing this.
10:20 AM
lucifer

i am thinking to use both. 200 max but only if the track meets the threshold, say 5 or 10.
10:20 AM
mayhem

can we please try without threshold first?
10:21 AM
I would like to see what that is like. it might work better.
10:21 AM
because right now I want to see what the long tail for some of this data is.
10:22 AM
lucifer

sure. can try.
10:22 AM
so 200 tracks per recording without threshold?
10:22 AM
mayhem

yes.
10:22 AM
lucifer

👍
10:23 AM
mayhem

great.
10:23 AM
another q: are hated tracks filtered post CF data generation?
10:26 AM
lucifer

nope currently we don't do anything with loved/hated tracks
10:26 AM
mayhem

but that would be the right place to do this, no?
10:26 AM
lucifer

yes right.
10:27 AM
mayhem

let me put that on your todo list as something to do when you have some time.
10:27 AM
at least the hated bit -- that doesn't involve another round of CF tuning.
10:27 AM
lucifer

where should we do it? spark, LB ingest, LB query time?
10:27 AM
3 degrees of responsiveness
10:28 AM
mayhem

I was thinking post CF generation. take all the generated tracks from CF and remove all the ones that appear in the user's hated list.
10:28 AM
but I am not married to that approach.
10:28 AM
I think a more comprehensive approach is to tune the inputs to CF so that CF knows all about this.
10:28 AM
lucifer

yes makes sense, let's start there and see where to go from there.
10:29 AM
mayhem

but I am hesitant to get into a round of CF filtering right now. it seems to be working and I feel like focusing on other things.
10:29 AM
great!
10:32 AM
alastairp

mayhem: done. I put a maybe on tues because it's holiday here, not sure what I'll be doing
10:33 AM
mayhem

seems that we're leaning towards thursday anyway.
10:33 AM
thanks
10:33 AM
yep, thursday it is. will send meeting invite later.
10:33 AM
alastairp

ok
10:34 AM
mayhem

alastairp: lucifer : right now the cover art stuff is in a separate repo (initial goals where unclear) and is being served from a different URL.
10:35 AM
these endpoints might get a lot of traffic in the future -- not sure.
10:35 AM
should this code be integrated into the LB codebase and simply be made a new endpoint or two or should we start a new docker container?
10:35 AM
integrate now., move later?
10:36 AM
zas: atj_mb: you two about for this question?
10:36 AM
alastairp

I think integrate now/move later is better
10:36 AM
is the python code fast?
10:36 AM
lucifer

sounds good to put in LB for now. when we want to publicize it or show it on lb.org etc, move
10:36 AM
mayhem

its a question about future growth and I wonder how the new gateways impact this.
10:36 AM
alastairp

keeping it in the same codebase but starting a new container for it should be straightforward too
10:36 AM
mayhem

alastairp: there isn't much slow stuff in the code, but it does need to make requests to the DB to resolve IDs.
10:37 AM
and with the new gateways it should be easy to slice one endpoint and have it go to a new server, or?
10:37 AM
I'll proceed with a PR into the codebase for now. loads easier.
10:38 AM
lucifer

mayhem, i created daily jams now for all users who didn't jams yesterday.
10:38 AM
mayhem

thank you!
10:39 AM
atj

o/
10:40 AM
"cover art stuff" being the code to create the grids?
10:40 AM
alastairp

yeah, we already send specific URLs/domains to different containers in the gateways (like websockets)
10:41 AM
Hellow1 joined the channel
10:41 AM
mayhem

atj: yes.
10:41 AM
atj: this is a general traffic growth question.
10:42 AM
does it ever matter for us to move heavy traffic things to a separate sub-domain or should we always assume that we can do per-URL level routing to back-ends?
10:42 AM
atj

right, well the new gateways should be able to handle a lot more traffic, however I don't think the gateways are the bottleneck generally
10:42 AM
mayhem

can we?
10:42 AM
agreed -- this convo is more about the user's perspective and not having to move redirect to new subdomains for when traffic grows.