in #metabrainz

1:41 AM
rdswift has quit
1:41 AM
rdswift_ is now known as rdswift
2:31 AM
noet_all joined the channel
2:32 AM
noet_all has left the channel
2:43 AM
opal joined the channel
2:48 AM
relaxoMob has quit
2:51 AM
relaxoMob joined the channel
3:16 AM
relaxoMob has quit
3:34 AM
relaxoMob joined the channel
4:45 AM
relaxoMob has quit
4:51 AM
relaxoMob joined the channel
5:13 AM
relaxoMob has quit
5:20 AM
relaxoMob joined the channel
5:29 AM
bitmap

zas: there's not much info in pg logs other than that checkpoints were occurring too frequently (usually indicating heavy writes)
5:30 AM
however the load averages on wolf and the spark cluster nodes were higher than usual during the same time frame, so perhaps something heavy was being processed there
6:14 AM
relaxoMob has quit
6:21 AM
relaxoMob joined the channel
6:28 AM
BrainzGit

[docker-python] 14mwiencek opened pull request #19 (03master…update-3.11): [WIP] Update 3.11 image https://github.com/metabrainz/docker-python/pul...
6:29 AM
yvanzo

bitmap, reosarevok: Updating all MB background task containers for reports and admin/run.
6:29 AM
bitmap

ok
6:41 AM
yvanzo

Actually subscriptions are still being processed, will do this afternoon.
7:51 AM
FichteFoll has quit
7:57 AM
FichteFoll joined the channel
8:42 AM
zas

bitmap: there was something running on gaga matching time floyd was under load
8:42 AM
bitmap

ah
8:43 AM
zas

https://stats.metabrainz.org/d/000000068/hetzne...
8:44 AM
bitmap

mbid mapper I guess?
8:56 AM
mayhem

this RO DB server could be a really good idea
8:56 AM
zas

around 16UTC when floyd WALs started to accumulate
8:56 AM
https://www.irccloud.com/pastebin/HRJRjMPx/
8:58 AM
mayhem

ironically that is the MB metadata cache, designed to keep the load on MB low. 🤭
8:58 AM
lucifer

mayhem: we write data to a few tables too so we need write access as well.
8:58 AM
zas

btw, the same thing happened on 7 sep, at same time
8:58 AM
mayhem

yes, we can write back to the RW server.
8:59 AM
zas

https://stats.metabrainz.org/d/000000067/postgr...
8:59 AM
mayhem

zas: its a cron job, so yes.
8:59 AM
zas

look at WAL graphs at the bottom
8:59 AM
mayhem

what peaks? I dont see any giant GB large peaks. do you?
9:00 AM
😒
9:00 AM
fletchto99 has quit
9:02 AM
lucifer: this is something for us to discuss in person. but I think we should make more use of couchdb.
9:02 AM
zas

actually I think it starts a bit before 16UTC, perhaps that's the previous cron job starting the issue
9:02 AM
Oct 5 15:00:01 gaga CRON[6412]: (listenbrainz) CMD ([ $(date +%d) -le 7 ] && /usr/local/bin/python /code/mapper/manage.py cron-build-mb-metadata-cache >> /code/mapper/cron-mb-metadata-cache.log 2>&1)
9:02 AM
fletchto99 joined the channel
9:02 AM
lucifer

mayhem: we can do that yes but we won't be able to join that with any of our other data then.
9:03 AM
i guess it would be fine to store the mb_metadata_cache but it wouldn't lower the load on MB db because all the data is still read from there.
9:04 AM
imo we need to full proof the incremental updates so that we can totally get rid of the bulk generate from scratch.
9:07 AM
bitmap

can you use unlogged tables? that would solve the wal accumulation issues. (you won't be able to query the tables from a standby though, you'll have to use the master)
9:07 AM
zas

also Jackson5 were clearly active during the time floyd was under load, so this has to be checked too
9:10 AM
lucifer

bitmap: that should be possible i guess. will need to check why so much wal accumulated though. we don't write back much, just read a lot.
9:11 AM
mayhem

lucifer: yes, to all that. the inability to do joins limits what we can do, but I think there are many datasets that we envision hosting with DSH-on-spark could actually be handled by couchdb. mb metadata may not be the best example. but something like last_listened_at for all users could.
9:11 AM
lucifer

mayhem: yes that makes sense
9:13 AM
mayhem

and on making the cached data incremental updates work -- we'll have a chance to look at that real soon. joy.
9:14 AM
zas

bitmap: https://stats.metabrainz.org/d/000000067/postgr... look at db_inserted
9:14 AM
bitmap

yes I was just checking that
9:16 AM
mayhem

oh, yes! Everyone, please bring your summit shirt today! we need to make a group photo
9:18 AM
bitmap

lucifer: the graph zas linked shows 9k insert ops/s, does that seem wrong?
9:20 AM
(that is based on tup_inserted so should mean 9k rows)
9:21 AM
lucifer

bitmap: did some rough calculations, 9k ops/s for ~8 mins ~ 4.5M inserts is possible (actual number is probably closer to 3M) but yes.
9:21 AM
zas

I'm curious about the drop of db_inserted during half an hour
9:22 AM
https://usercontent.irccloud-cdn.com/file/95I9V...
9:30 AM
bitmap

lucifer: are the tables being inserted into all in the mapping schema?
9:31 AM
lucifer

bitmap: yes
9:33 AM
lusciouslover joined the channel
9:33 AM
zas

bitmap: are you aware of Aperture ? https://github.com/fluxninja/aperture https://blog.fluxninja.com/blog/protecting-post...
9:34 AM
bitmap

I am not
9:36 AM
will check it out
9:43 AM
reosarevok

mayhem: I cannot find mine D: did you find a random L shirt laying about in the office any of these days? 😅
9:43 AM
(I will be at the office in 30)
9:43 AM
mayhem makes he L gesture on his forehead
9:44 AM
lusciouslover has quit
10:07 AM
I mean, if that gets me a t-shirt again, I'll take the L
10:07 AM
reosarevok hides
10:48 AM
mayhem

outsidecontext: error handling has been fixed now on the DSH endpoint.
10:48 AM
outsidecontext

mayhem: will check in a minute
10:48 AM
mayhem

now to actually work on why some bits dont return data.
10:56 AM
yvanzo: ping
11:04 AM
outsidecontext

curl -X POST -k -H 'Content-Type: application/json' -i 'https://labs.api.listenbrainz.org/mbid-mapping-release/json' --data '[{"[artist_credit_name]": "Paradise Lost", "[recording_name]": "Paradise Lost", "[release_name]": "Drown in Darkness \\u2013 The Early Demos"}, {"[artist_credit_name]": "Paradise Lost", "[recording_name]": "Paradise Lost (live)", "[release_name]": "Drown in Darkness \\u2013 The Early Demos"},
11:04 AM
{"[artist_credit_name]": "Paradise Lost", "[recording_name]": "Paradise Lost", "[release_name]": "Lost Paradise"}, {"[artist_credit_name]": "Opened Paradise", "[recording_name]": "Opened Paradise", "[release_name]": "Occult"}]'
11:04 AM
atj

amazing https://www.404media.co/bing-is-generating-imag...
11:05 AM
outsidecontext

mayhem: things start work well now, but the above curl returns empty. in general this seems to happen whenever track title matches artist name
11:06 AM
mayhem

outsidecontext: can you paste that as a pastebin?
11:06 AM
or something that doesn't break the url...
11:06 AM
outsidecontext

yes, sorry. actually that was the intention :) one sec
11:07 AM
mayhem

https://www.irccloud.com/pastebin/PuBsLXzJ/
11:07 AM
monkey: ^^ I get this error when I try to start lb-server on the moar-cached-data branch
11:07 AM
outsidecontext

mayhem: https://gist.github.com/phw/92fef67e4ef2958f301...
11:07 AM
mayhem

thx
11:08 AM
monkey

OK, let me push
11:08 AM
mayhem

lucifer: you about?
11:08 AM
lucifer

yes
11:08 AM
mayhem

see the query that outsidecontext posted above. that is against the mapping with releases endpoint.
11:09 AM
can you plz take a look at that if you have a moment?
11:10 AM
monkey

mayhem: pushed, should be fixed now
11:10 AM
outsidecontext

lucifer: all the requested tracks are supposed to yield a result as there is a recording for it. but recording name equals band name. I can reliably reproduce the issue with this endpoint
11:10 AM
lucifer

mayhem, outsidecontext: sure looking into it
11:10 AM
mayhem

thx monkey
11:10 AM
thx lucifer
11:10 AM
lucifer

what branch is it running on?
11:10 AM
mayhem

thats on labs, no? should be current prod more or less.
11:12 AM
lucifer

cool got it
11:15 AM
hand-bot joined the channel
11:17 AM
mayhem

monkey: different error now
11:17 AM
https://www.irccloud.com/pastebin/5UADjJG7/
11:17 AM
did I forget to run something? I did a develop.sh build
11:28 AM
lusciouslover joined the channel
11:30 AM
lucifer

mayhem: looking at the explain-mbid-mapping output, this seems to be like a typesense thing
11:31 AM
aerozol

Reminder to everyone to please put any important discussions/to-dos/decisions into the summit meeting notes: https://docs.google.com/document/d/1bz32duq95jS...
11:32 AM
Both to keep us on track/as a handy reminder, and to keep the community and those who couldn’t make it in the loop ➰
11:33 AM
monkey

mayhem: Looks like the error is related to the manifest file we write to disk (so that flask knows which javascript file to load). I don't think we changed anything, so that's surprising
11:33 AM
lusciouslover has quit
11:36 AM
BrainzGit

[musicbrainz-server] 14akshaaatt closed pull request #2507 (03akshat/intro-new…akshat/login): Login Section https://github.com/metabrainz/musicbrainz-serve...
11:37 AM
mayhem

ok, how do we get oast that monkey?
11:37 AM
past, even
11:38 AM
monkey

I'm looking into it, but I currently have no idea. Maybe comment out this line?
11:38 AM
https://github.com/metabrainz/listenbrainz-serv...
11:40 AM
yeah, that built for me
11:42 AM
mayhem

go two warnings, but looks a lot better.
11:42 AM
css esta borked, however.
11:46 AM
monkey

It's Taco'clock ! https://tacoalto.es/carta
11:47 AM
reosarevok

monkey: which was the taco that did no longer eixst?
11:47 AM
mayhem

monkey: Quesadilla de Pibil , Taco Baja, Taco Tinga Tinga, Taco Gringa, Taco Sinaloa
11:47 AM
reosarevok

exist too
11:47 AM
monkey

mayhem: pull and try again, I had an error in my less file
11:47 AM
mayhem

reosarevok: Quesadilla de Nopales
11:47 AM
monkey

No, they had that one
11:48 AM
mayhem notes that tinga tinga means bit butt in swahili
11:48 AM
mayhem

*big
11:48 AM
monkey

They didn't have the Chihuahua taco, and the Quesadilla de Jamón con Champiñones was actually Tinga con Champiñones instead
11:54 AM
BrainzGit

[listenbrainz-android] 14akshaaatt opened pull request #265 (03main…dev): Dev to Main https://github.com/metabrainz/listenbrainz-andr...
11:55 AM
[listenbrainz-android] 14akshaaatt merged pull request #265 (03main…dev): Dev to Main https://github.com/metabrainz/listenbrainz-andr...
11:58 AM
monkey

Here's the up-to-date menu: https://usercontent.irccloud-cdn.com/file/nbDt6...
11:58 AM
mayhem: no gringa taco, qyuesadilla gringa instead?
11:58 AM
mayhem

taco pirata instread, plz
12:01 PM
reosarevok

bitmap: MBS-12969 has the reproduceable bug case
12:01 PM
BrainzBot

MBS-12969: Recording relationships are not shown as deleted in the "Related Works" column https://tickets.metabrainz.org/browse/MBS-12969
12:04 PM
bitmap

thx
12:06 PM
mayhem

monkey: pushed the change to fetch artist data and emit it into props.
12:06 PM
monkey

Weeeeeee
12:07 PM
mayhem

do you still have the paste that describes what else you need?
12:07 PM
scrollback is long.
12:07 PM
monkey

https://www.irccloud.com/pastebin/uxGwYZ6g/
12:07 PM
Hop ^
12:07 PM
mayhem

thx
12:07 PM
monkey

I guess ignore parts of the data structure at the bottom, as we discussed
12:07 PM
aerozol

lucifer: no pings from you, so I assume you have enough to work with? Keep me updated!
12:23 PM
mayhem

lucifer: the autotag that outsidecontext is working on, is working quite well already. quite exciting, really.
12:23 PM
!m outsidecontext
12:23 PM
BrainzBot

You're doing good work, outsidecontext!
12:23 PM
mayhem

!m lucifer