in #metabrainz

2:54 AM
lucifer

alastairp: https://twitter.com/laurencetratt/status/153851...
4:21 AM
riksucks has quit
4:21 AM
yyoung[m] has quit
4:21 AM
the4oo4 has quit
4:21 AM
rcombs has quit
4:22 AM
the4oo4 joined the channel
4:24 AM
rcombs joined the channel
4:24 AM
riksucks joined the channel
4:27 AM
d4rkie has quit
4:27 AM
everdred has quit
4:27 AM
rdrg109_ has quit
4:27 AM
milkii_ has quit
4:27 AM
BrainzGit has quit
4:27 AM
Freso has quit
4:27 AM
zas has quit
4:27 AM
everdred joined the channel
4:27 AM
zas joined the channel
4:27 AM
d4rkie joined the channel
4:27 AM
milkii joined the channel
4:27 AM
zas has quit
4:27 AM
zas joined the channel
4:27 AM
Freso joined the channel
4:28 AM
rdrg109_ joined the channel
4:28 AM
BrainzGit joined the channel
4:30 AM
elomatreb[m] has quit
4:30 AM
reosarevok has quit
4:30 AM
akshaaatt has quit
4:30 AM
void09 has quit
4:30 AM
reosarevok joined the channel
4:30 AM
akshaaatt joined the channel
4:30 AM
void09 joined the channel
4:33 AM
yyoung[m] joined the channel
4:47 AM
elomatreb[m] joined the channel
4:49 AM
param_ joined the channel
4:52 AM
rektide_ joined the channel
4:53 AM
riksucks has quit
4:53 AM
kgz has quit
4:53 AM
Guest185 has quit
4:53 AM
param has quit
4:53 AM
rektide has quit
4:53 AM
param_ is now known as param
5:04 AM
kgz joined the channel
5:09 AM
Guest185 joined the channel
5:17 AM
riksucks joined the channel
5:33 AM
Pratha-Fish

Moin'
5:43 AM
alastairp: I just realized you've worked w/ freesound.org as well???
5:43 AM
That's insane. I've used that site for so many of my personal music projects lool
5:48 AM
Also, my most popular meme was also a fart sample that I took from freesound.org and visualized with a spectrum analyzer and posted on r/FL_studio because it looked beautiful XD
5:48 AM
Good times
6:36 AM
odnes joined the channel
7:23 AM
odnes has quit
7:47 AM
mayhem

moin moin!
7:51 AM
Pratha-Fish

mayhem: hey tell me the story of how you got amazon to pay a 3 year due invoice by sending them a cake sometime lol
7:51 AM
mayhem

https://boingboing.net/2013/12/04/charity-sends...
7:54 AM
Pratha-Fish

That text on the cake tho 💀👌
7:59 AM
leonardo

just a little passive aggressive
8:21 AM
mayhem

it got the job done.
8:35 AM
leonardo

that's what is called a sweet revenge
8:36 AM
alastairp

morning
8:36 AM
Pratha-Fish: yes, freesound too
8:51 AM
yvanzo

O’Moin
9:21 AM
reosarevok

yvanzo: moin! Anything special for this docker release?
9:22 AM
yvanzo

hi reosarevok: yes, I drafted release notes but have to make some improvements.
9:23 AM
reosarevok

Ok :) I'll start the prod release (won't be around in the evening probably)
9:23 AM
but we can look at that bit later
9:36 AM
CatQuest

oh hey reo ˆ__ˆ
9:36 AM
reosarevok

Hi!
9:37 AM
CatQuest

:D
9:59 AM
mayhem

lucifer: on TS the listened_at_track_name_user_id_ndx_listen index was created live and we didn't decide at the time if we wanted to keep it, yes?
9:59 AM
because if that is so then PR 2042 makes sense. :)
10:02 AM
lucifer

mayhem: yes it was created live. we needed it to keep the on conflict clauses working.
10:03 AM
still need to figure out how many dupes are there in the db and how to delete those.
10:06 AM
mayhem

there is dup detection and removal code in the MBID mapping stuff, you can take a look at it.
10:06 AM
to use it for TS, I think we would have to do it on a set of chunks at the same time
10:07 AM
well, one at a time, once the new index is in place.
10:07 AM
lucifer

we cant create the index without deleting dupes.
10:07 AM
mayhem

why not delete the dups?
10:08 AM
lucifer

ah no, i mean we should delete the dupes. i misunderstood your message as to create index first and delete afterwards
10:09 AM
mayhem

that would be ideal, but not possible.
10:09 AM
we will have the problem that new dups can be created while we are deleting the old ones.
10:09 AM
but I wonder if we can make the script that deletes dups work on ranges or the whole listen table.
10:10 AM
then we do a month or so at a time and then once that is done, we try to create the index.
10:10 AM
if that fails, we delete dups across the whole table.
10:10 AM
but I doubt that would work, so we might end up chasing our tail on this one.
10:11 AM
lucifer

i think dup deletion should be fast enough that we can stop ts writer while the script runs.
10:11 AM
mayhem

I really doubt that.
10:12 AM
lucifer

i see, lets try how fast it goes on one chunk and then decide what to do accordingly.
10:12 AM
mayhem

well, if we do it in python then maybe. but pure SQL, I think that is going to OOM
10:13 AM
lucifer

hmm, dont think it should oom but yeah really cant say without trying
10:13 AM
mayhem

if we just fetch all the tracks ordered by listened_at and the other dedup fields and then just slowly delete all the dups, that could work. it might be fast enough for the second pass to run with TS writer stopped.
10:15 AM
lucifer

makes sense
10:17 AM
yvanzo

reosarevok: the release draft is up for review
10:22 AM
reosarevok

yvanzo: I forget, is this expected?
10:22 AM
https://www.irccloud.com/pastebin/Wb5PoRaF/
10:24 AM
yvanzo

it depends, for which container?
10:25 AM
reosarevok

aphex
10:25 AM
prod
10:25 AM
ws
10:39 AM
yvanzo

I guess it is expected as the webservice container is running neither renderer nor website.
10:41 AM
reosarevok

Ok :)
10:51 AM
yellowhatpro

akshaaatt: can you create a new branch for bp integration. I was thinking to create a pr that covers services and some repo stuff
10:55 AM
reosarevok

yvanzo: blog post ready for review
10:59 AM
akshaaatt

Done, yellowhatpro . Please use the ‘brainzplayer’ branch for the same
11:00 AM
yellowhatpro

yussir thankss
11:03 AM
reosarevok

yvanzo: all ready, when you feel it looks good, please put out the blog and the docker release (linked the tag already)
11:03 AM
BrainzGit

[musicbrainz-android] 14yellowHatpro opened pull request #126 (03brainzplayer…bp-implementation): initial BrainzPlayer integration commit https://github.com/metabrainz/musicbrainz-andro...
11:05 AM
yvanzo

reosarevok: I just unlinked bfan.link, looks good otherwise. Can you please review the docker release if you have time?
11:05 AM
reosarevok

I did, nothing seemed *weird* but of course I also didn't try the steps :)
11:06 AM
yvanzo

I did, so we are good for release :)
11:07 AM
BrainzGit

[musicbrainz-docker] release 03v-2022-06-20 has been published by 14yvanzo: https://github.com/metabrainz/musicbrainz-docke...
11:07 AM
reosarevok

Yay
11:07 AM
!m yvanzo
11:07 AM
BrainzBot

You're doing good work, yvanzo!
11:08 AM
reosarevok goes back to the sauna
11:10 AM
yellowhatpro

akshaaatt: the pr is very basic for now, the classes were interdependent so I had to create all of the classes related to the service
11:10 AM
Currently it doesnt do much work, but I am soon gonna update the pr with more stuf
11:17 AM
akshaaatt

Sure yellowhatpro. Would you like me to review it now or after a while once you’ve added more stuff? I’m fine with anything.
11:23 AM
yellowhatpro

Umm, anything would be fine. Currently the classes aren't complete but yeah reviews will be great..
12:52 PM
Etua joined the channel
12:55 PM
Etua has quit
12:55 PM
KevlarNoir has quit
12:55 PM
KevlarNoir joined the channel
13:22 PM
BrainzGit

[listenbrainz-server] 14amCap1712 merged pull request #2040 (03master…user-mbids-spark): Include user submitted mbids in spark dumps https://github.com/metabrainz/listenbrainz-serv...
13:32 PM
yvanzo

Just published the blog post, apparently my first attempt failed.
13:59 PM
mayhem

lucifer: which is the last_played API endpoint? I can't find it in the docs...
14:00 PM
lucifer

mayhem: you mean when the recommendation was last played? if so, there is not separate endpoint. the recs json includes the timestamp with the mbid.
14:01 PM
mayhem

ahhh, ok, no wonder I couldn't find it.
14:01 PM
lucifer

those times are available for all recordings but only stored in spark currently. before sending recs to LB, that data is merged with the recs to add a timestamp filed.
14:01 PM
mayhem

easy then. :)
14:12 PM
Pratha-Fish

hey alastairp sorry for the delay. Had couldn't do a lot today, but I am getting started with the updated to-do list right now.
14:13 PM
The to-do list is hosted in the journal BTW.
14:13 PM
Updating it with specifics of the artist conflation issue too
14:20 PM
BrainzGit

[acousticbrainz-server] 14alastair merged pull request #403 (03master…python3): Start Python 3 migration https://github.com/metabrainz/acousticbrainz-se...
14:21 PM
[acousticbrainz-server] 14alastair closed pull request #392 (03master…dump): Use a loop to iterate dataset tables during dumps https://github.com/metabrainz/acousticbrainz-se...
14:21 PM
[acousticbrainz-server] 14alastair closed pull request #396 (03master…AB-407): AB-407: Redirect legacy API endpoints to new endpoint with http redirect https://github.com/metabrainz/acousticbrainz-se...
14:23 PM
[acousticbrainz-server] 14alastair opened pull request #424 (03master…disable-submissions): Update data/download descriptions to shut down AB https://github.com/metabrainz/acousticbrainz-se...
14:37 PM
Pratha-Fish

alastairp: also, you mentioned the part about making a csv with the following columns: mlhd_recording_mbid, mlhd_artist_mbid, mlhd_recording_name, mlhd_artist_name, mb_recording_artist_credit, mb_artist_mbids, mb_canonical_recording_mbid
14:37 PM
TBH I am still a bit confused about this one. Maybe breaking it down into some macro steps could help :)
14:39 PM
BrainzGit

[critiquebrainz] 14alastair opened pull request #438 (03master…sampledb-missing-entities): Always return dummy data in debug mode if it's not in MusicBrainz https://github.com/metabrainz/critiquebrainz/pu...
14:39 PM
alastairp

Pratha-Fish: sure. maybe let's deal with the first 4 columns then
14:41 PM
you already look up these fields from the mlhd dataset in the `recording` table and the `artist` table
14:41 PM
this will just involve selecting the `name` field from these tables too, and writing them to a new csv file
14:41 PM
lucifer

mayhem: alastairp: https://github.com/metabrainz/listenbrainz-serv...
14:42 PM
to delete listens submitted with same listened_at, user_id but different case for track_name.
14:42 PM
alastairp

lucifer: that code skeleton looks familiar ;)
14:42 PM
lucifer

alastairp: hehe yes, i copied it from your listen_fill_userid script :D
14:43 PM
alastairp

so to confirm, we already reject exact duplicates of (userid, submitted, track_name), but we found these cases where we had case-insensitive dups on track name?
14:43 PM
lucifer

currently we don't reject those. the PR adds an index to fix that.
14:44 PM
before we create the index, we need to cleanup the existing dupes.
14:44 PM
the intent is we do 1 pass, then turn off ts writer, do another pass. try to create index. restart ts writer.