in #metabrainz

10:57 AM
alastairp

regarding the highlevel dump, this one won't need to select an element of a json column, so we could just `\copy (select highlevel as ll_row_id, data from highlevel_model where model in (our models))`
10:58 AM
https://www.irccloud.com/pastebin/qvsaVTHX/
10:59 AM
lucifer

ruaok: oh ok. thiking about this, we should probably have a playlist endpoint in LB itself to add an item from just a recording mbid.
10:59 AM
(as a future enhancement)
11:00 AM
alastairp: i see, nice. lets do two dumps then, one for bpm stuff and one for mood.
11:00 AM
alastairp

great
11:01 AM
oh, you'll need highlevel_model.model too, to differentiate between the different rows for a given ll.id
11:01 AM
ruaok

lucifer: we have that. its one of the most basic functions.
11:01 AM
alastairp

highlevel_model.highlevel is the same as ll.id. I'd dump ll.id, ll.mbid separately so that pg doesn't have to do a join
11:02 AM
lucifer

ruaok: oh, i couldn't find it here so thought it doesn't exist. https://listenbrainz.readthedocs.io/en/v-2021-1...
11:03 AM
ruaok

well, how else would we add a track to a playlist?
11:05 AM
https://listenbrainz.org/playlist/3f8f0e65-ee6a...
11:05 AM
lucifer: remind me, what exactly was that list of MBIDs?
11:06 AM
the outliers in my BPM charts?
11:06 AM
lucifer

the mbids were part of the 165 bpm peak
11:07 AM
ruaok

not a single track I've clicked on is 165 BPM.
11:09 AM
alastairp

the first one - the hi-hat/snare is about 165
11:10 AM
ruaok

yerp, could see that one.
11:10 AM
ok, so those cute BPM charts are useless too. 😭
11:10 AM
alastairp

first peak weight is "mean": 0.62 :(
11:11 AM
so it seems pretty certain about that. means that we can't always use the weight for this either
11:12 AM
BrainzGit

[bookbrainz-site] 14MonkeyDo merged pull request #718 (03master…fix#BB-458): Fix[BB-458]: Showing error conflict page on deleting entity twice https://github.com/bookbrainz/bookbrainz-site/p...
11:13 AM
lucifer

what about the second one? Come Meh Way
11:18 AM
https://www.irccloud.com/pastebin/iaaDCRoQ/
11:18 AM
this is what spotify has for the Cantelowes track ruaok shared earlier. AB has 185 BPM for this.
11:21 AM
monkey

Is there a way we can sort items by the most confident high level indicator, for example for this one maybe 'loudness', for another track might be 'danceability' ?
11:33 AM
alastairp

yeah, that's why I wanted to consider using bpm_histogram_first_peak_weight, but it seems that there are many cases where it's pretty confident with this value as well
11:34 AM
lucifer

is this an issue with just high bpm recordings or can occur in any type of recordings. if its just high, then we could try querying spotify if a bpm is above a threshold?
11:35 AM
but regardless, querying the bpm's spotify has and comparing with AB might be a nice way to detect issues with AB data and possibly identify the issue with the bpm detection alg.
11:37 AM
alastairp

no, sorry
11:38 AM
yvanzo

Rescheduled MB search indexes dump to run soon (11:40 UTC) with I/O patch.
11:38 AM
lucifer

ah ok, no worries.
11:38 AM
alastairp

https://usercontent.irccloud-cdn.com/file/lkP1C...
11:39 AM
monkey: in the playlist view, the currently playing track has its right-hand data offset to the right a little bit, is that known?
11:39 AM
monkey

Yep ruaok said the same yesterday :)
11:39 AM
The play icon with and without the circle around it aren't the same width
11:40 AM
alastairp

right, as long as you're aware!
11:41 AM
monkey

Thanks
12:04 PM
ruaok

alastairp: monkey akshaaatt: Freso: invoices please!
12:05 PM
alastairp

it's not december already
12:05 PM
I refuse to believe it
12:05 PM
akshaaatt

On it ruaok
12:05 PM
ruaok

did you do your own research?
12:05 PM
alastairp

I'm trying to, but I can't find any sources which corroborate my belief
12:06 PM
but I "feel" that I'm correct
12:07 PM
ruaok

maybe its time to make a site that disavows the fact its Dec already.
12:07 PM
itsnotdecemberitsallaliberellie.com ?
12:10 PM
monkey

Heading to the office, will invoice from there
12:10 PM
Thanks for the reminder
12:17 PM
BrainzGit

[musicbrainz-docker] 14nikosmichas opened pull request #213 (03master…tcp_max_tw_buckets): Limit tcp_max_tw_buckets to avoid connection issues between services https://github.com/metabrainz/musicbrainz-docke...
12:42 PM
ruaok

alastairp: any idea why this would give a 502 ?
12:43 PM
https://www.irccloud.com/pastebin/AqXMhAOo/
12:43 PM
monkey

Too long a list perhaps?
12:43 PM
ruaok

docs don't specify how many IDs I can include in a call.
12:44 PM
alastairp

https://acousticbrainz.readthedocs.io/api.html#...
12:44 PM
maybe it's not visible enough
12:44 PM
ruaok

ah no, i clearly missed that.
12:44 PM
but 502?
12:44 PM
alastairp

but good point - we reject listens if they make it to the webserver and there are too many
12:44 PM
ruaok

should be 400, no?
12:45 PM
alastairp

but if you give _so many_ that uwsgi gives up, it returns 502 before it even gets to the AB code
12:45 PM
ruaok

should that be mentioned in the docs?
12:46 PM
alastairp

yeah, I think we can fit that in
12:50 PM
lucifer

this is same as LB-993
12:50 PM
BrainzBot

LB-993: User feedback endpoint returns 502 when querying too many recordings https://tickets.metabrainz.org/browse/LB-993
12:51 PM
alastairp

yep
12:51 PM
lucifer

we could also increase buffer size for AB like we intend to do for LB
12:52 PM
alastairp

yeah, I don't know what the effect of the buffer size is on the number of workers that we have, I assume it'll increase some memory usage per worker
12:53 PM
but its an interesting issue that there's always a point where if it's too large, we get the error from uwsgi not our server
12:54 PM
unless you make the buffer size hundres of mb
12:54 PM
lucifer

possibly but we are talking about 4kb here per request, so the overall memory increase shouldn't be much.
12:54 PM
alastairp

https://stackoverflow.com/a/417184
12:54 PM
lucifer

buffer-size is limited to 65536 bytes :)
12:55 PM
alastairp

cloudflare has a limit of 32k, other CDNs have a limit of 8k
12:56 PM
that query that ruaok showed was only 3.7k, so not sure how it hit the limit
12:56 PM
oh, including headers might have put it over
12:57 PM
lucifer

yup
12:57 PM
alastairp

OK, you've convinced me that 8k is fine
12:59 PM
lucifer

out of curiousity, i wonder how would a lookup of say 1000 mbids be done at a time, fat get or post request?
13:00 PM
ruaok

22% match rate on AB on this one example test case. :(
13:01 PM
lucifer

recordings missing from AB?
13:02 PM
ruaok

24% for alastairp, 15 for monkey, 18 for lucifer.
13:02 PM
lucifer: yep.
13:02 PM
that's a dead end then.
13:02 PM
lucifer

yeah :/
13:02 PM
ruaok

I think AB needs a drastic re-think, alastairp.
13:05 PM
alastairp

what are you looking up?
13:06 PM
ruaok

high level data for tracks that come out of the misssed tracks datasets query.
13:08 PM
https://www.irccloud.com/pastebin/zIrpwa0I/
13:08 PM
the last column is mood_aggressive
13:09 PM
it suggests that it could work, if we had sufficient coverage in AB.
13:11 PM
alastairp

a shame that we didn't get around to finishing the mbid redirect table task
13:12 PM
lucifer: maybe we should just go with the external database for this for now so that we can get a bunch of improvements made
13:12 PM
lucifer

sure, sounds good to me.
13:13 PM
alastairp

maybe I'll have a look at that this week
13:14 PM
lucifer

so something like setting up a cron job that queries all recording_redirect entries since last run and then update recording mbids in AB tables?
13:16 PM
alastairp

no, I'd do it on demand. you query an MBID, it looks up all of the possible redirects, then it returns you data for all submissions for all mbids in the "set"
13:16 PM
this way, the results of a query don't change over time
13:18 PM
lucifer

i see. if you want i could take a quick stab at it, put it on ab beta so that ruaok can test it.
13:19 PM
ruaok

lucifer: alastairp : I think you two can stop working on an of this stuff. its just not useful.
13:19 PM
lets go back to whatever original tasks we had.
13:20 PM
alastairp: do we have any algorithms that we incorporate into a new-AB? Let assume for a second we ditch our AB DB and start over. How would we do that?
13:21 PM
can we collect better data with much smaller segment times that would allow us to build better algorithms for better feature detection later?
13:36 PM
param: trivial PR for you, if you have a sec: https://github.com/paramsingh/pylistenbrainz/pu...
13:41 PM
BrainzGit

[bookbrainz-site] 14dependabot[bot] opened pull request #726 (03master…dependabot/npm_and_yarn/normalize-url-4.5.1): chore(deps): bump normalize-url from 4.5.0 to 4.5.1 https://github.com/bookbrainz/bookbrainz-site/p...
13:41 PM
param

ruaok: done! https://github.com/paramsingh/pylistenbrainz/re...
13:42 PM
I also added you as a collaborator to the repo for the future
13:42 PM
ruaok

sweet, thanks!
13:51 PM
BrainzGit

[bookbrainz-site] 14dependabot[bot] closed pull request #724 (03master…dependabot/npm_and_yarn/tar-4.4.19): chore(deps): bump tar from 4.4.13 to 4.4.19 https://github.com/bookbrainz/bookbrainz-site/p...
13:51 PM
[bookbrainz-site] 14dependabot[bot] closed pull request #726 (03master…dependabot/npm_and_yarn/normalize-url-4.5.1): chore(deps): bump normalize-url from 4.5.0 to 4.5.1 https://github.com/bookbrainz/bookbrainz-site/p...
13:52 PM
[bookbrainz-site] 14dependabot[bot] opened pull request #727 (03master…dependabot/npm_and_yarn/postcss-8.4.4): chore(deps): bump postcss from 8.2.4 to 8.4.4 https://github.com/bookbrainz/bookbrainz-site/p...
13:52 PM
[bookbrainz-site] 14dependabot[bot] opened pull request #728 (03master…dependabot/npm_and_yarn/browserslist-4.18.1): chore(deps): bump browserslist from 4.12.0 to 4.18.1 https://github.com/bookbrainz/bookbrainz-site/p...
13:53 PM
[bookbrainz-site] 14dependabot[bot] opened pull request #729 (03master…dependabot/npm_and_yarn/webpack-5.64.4): chore(deps-dev): bump webpack from 5.12.3 to 5.64.4 https://github.com/bookbrainz/bookbrainz-site/p...
13:53 PM
[bookbrainz-site] 14dependabot[bot] closed pull request #717 (03master…dependabot/npm_and_yarn/webpack-5.64.2): chore(deps-dev): bump webpack from 5.12.3 to 5.64.2 https://github.com/bookbrainz/bookbrainz-site/p...
13:53 PM
monkey

Yeah, that's right !
13:53 PM
Go Dependabot, go !
13:58 PM
Solving security alerts by removing a security alert mitigation tool. How ironic.
14:00 PM
BrainzGit

[bookbrainz-site] 14MonkeyDo merged pull request #728 (03master…dependabot/npm_and_yarn/browserslist-4.18.1): chore(deps): bump browserslist from 4.12.0 to 4.18.1 https://github.com/bookbrainz/bookbrainz-site/p...
14:03 PM
[bookbrainz-site] 14MonkeyDo merged pull request #727 (03master…dependabot/npm_and_yarn/postcss-8.4.4): chore(deps): bump postcss from 8.2.4 to 8.4.4 https://github.com/bookbrainz/bookbrainz-site/p...
14:25 PM
[bookbrainz-site] 14MonkeyDo merged pull request #725 (03master…dependabot/npm_and_yarn/path-parse-1.0.7): chore(deps): bump path-parse from 1.0.6 to 1.0.7 https://github.com/bookbrainz/bookbrainz-site/p...
14:50 PM
[musicbrainz-server] 14reosarevok opened pull request #2354 (03master…MBS-12114): MBS-12114: Account for the "Disk" alternative spelling to "Disc" https://github.com/metabrainz/musicbrainz-serve...
14:51 PM
reosarevok

Threw that tiny change into the milestone ^ :)
14:58 PM
Also
14:58 PM
Apparently some reports failed to run or something (MBS-12112)
14:58 PM
BrainzBot

MBS-12112: Several reports are incorrectly empty https://tickets.metabrainz.org/browse/MBS-12112
14:59 PM
reosarevok

I tried to log into a musicbrainz-website-prod container and run ./admin/RunReports.pl but I get
14:59 PM
root@4bd87d4e8684:/home/musicbrainz/musicbrainz-server# ./admin/RunReports.pl Can't locate List/AllUtils.pm in @INC (you may need to install the List::AllUtils module) (@INC contains: /home/musicbrainz/musicbrainz-server/admin/../lib /etc/perl /usr/local/lib/x86_64-linux-gnu/perl/5.30.0 /usr/local/share/perl/5.30.0 /usr/lib/x86_64-linux-gnu/perl5/5.30 /usr/share/perl5 /usr/lib/x86_64-linux-gnu/perl/5.30 /usr/share/perl/5.30
14:59 PM
/usr/local/lib/site_perl /usr/lib/x86_64-linux-gnu/perl-base) at ./admin/RunReports.pl line 8. BEGIN failed--compilation aborted at ./admin/RunReports.pl line 8.
14:59 PM
yvanzo, bitmap: how do you run reports manually when needed? I know at least bitmap has done that in the past
15:00 PM
I mean, I assume the answer is not "run cpanm"
15:11 PM
akshaaatt

Spotify wrapped has dropped for this year
15:11 PM
Ngl it is pretty good
15:12 PM
alastairp

hi reosarevok, I saw more feedback from you on the genres, thanks
15:12 PM
reosarevok

np :) Happy to help with whatever comes next
15:13 PM
alastairp

as I said last week, I'll go through the first sheet and hard-code all of our decisions that we weren't able to do automatically. Once that looks good I'll put together a tag submitter
15:13 PM
BrainzGit

[musicbrainz-server] 14reosarevok opened pull request #2355 (03master…hide-deleted-subscriptions): Don't show deleted users' Subscription tabs to admins https://github.com/metabrainz/musicbrainz-serve...
15:14 PM
alastairp

ruaok: good question. there are new feature extractors in essentia that do more detailed low level features. now that deep learning algorithms seem to give good results (especially for classification), this seems to be the minimum required amount of data
15:15 PM
in fact, I was speaking with Dmitry about this a few weeks ago, and he thinks that they're close with a proposed extractor that we could include in AB in order to have better features
15:15 PM
ruaok

are any of them good enough for to think about an AB reboot?
15:15 PM
alastairp

yes, he and I are planning on prototyping a data refresh if not by the end of this year, definitely early next year
15:16 PM
ruaok

are you in the office tomorrow?
15:16 PM
alastairp

I think it's been interesting to see how the scale of AB "breaks" many of the algorithms in essentia - we knew this early on with the machine learning stuff
15:16 PM
but the bpm stuff is interesting too. there are a lot of good results, but we just have so much stuff that there is also bad stuff too
15:17 PM
office - unsure. laptop is out for repair and I'm not 100% healthy yet
15:17 PM
ruaok

ok, next week then.
15:18 PM
but I am rather down on AB right now. I'm questioning the further existence of the project. at very least we need to have a hard look at our short term plans.
15:18 PM
right now we're putting on band-aids and performance improvements of something that seems entirely worthless to me.
15:19 PM
so, question for next week: What should the MVP for AB so that it can provide some value to its users?
15:21 PM
alastairp

right, we're going to have to think about the value of 100% automated algorithms that we just set and forget
15:22 PM
ruaok

and we need to think about being able to iterate on the algorithms more easily.