#metabrainz

/

      • alastairp
        regarding the highlevel dump, this one won't need to select an element of a json column, so we could just `\copy (select highlevel as ll_row_id, data from highlevel_model where model in (our models))`
      • lucifer
        ruaok: oh ok. thiking about this, we should probably have a playlist endpoint in LB itself to add an item from just a recording mbid.
      • (as a future enhancement)
      • alastairp: i see, nice. lets do two dumps then, one for bpm stuff and one for mood.
      • alastairp
        great
      • oh, you'll need highlevel_model.model too, to differentiate between the different rows for a given ll.id
      • ruaok
        lucifer: we have that. its one of the most basic functions.
      • alastairp
        highlevel_model.highlevel is the same as ll.id. I'd dump ll.id, ll.mbid separately so that pg doesn't have to do a join
      • lucifer
        ruaok: oh, i couldn't find it here so thought it doesn't exist. https://listenbrainz.readthedocs.io/en/v-2021-1...
      • ruaok
        well, how else would we add a track to a playlist?
      • lucifer: remind me, what exactly was that list of MBIDs?
      • the outliers in my BPM charts?
      • lucifer
        the mbids were part of the 165 bpm peak
      • ruaok
        not a single track I've clicked on is 165 BPM.
      • alastairp
        the first one - the hi-hat/snare is about 165
      • ruaok
        yerp, could see that one.
      • ok, so those cute BPM charts are useless too. 😭
      • alastairp
        first peak weight is "mean": 0.62 :(
      • so it seems pretty certain about that. means that we can't always use the weight for this either
      • BrainzGit
        [bookbrainz-site] 14MonkeyDo merged pull request #718 (03master…fix#BB-458): Fix[BB-458]: Showing error conflict page on deleting entity twice https://github.com/bookbrainz/bookbrainz-site/p...
      • lucifer
        what about the second one? Come Meh Way
      • this is what spotify has for the Cantelowes track ruaok shared earlier. AB has 185 BPM for this.
      • monkey
        Is there a way we can sort items by the most confident high level indicator, for example for this one maybe 'loudness', for another track might be 'danceability' ?
      • alastairp
        yeah, that's why I wanted to consider using bpm_histogram_first_peak_weight, but it seems that there are many cases where it's pretty confident with this value as well
      • lucifer
        is this an issue with just high bpm recordings or can occur in any type of recordings. if its just high, then we could try querying spotify if a bpm is above a threshold?
      • but regardless, querying the bpm's spotify has and comparing with AB might be a nice way to detect issues with AB data and possibly identify the issue with the bpm detection alg.
      • alastairp
        no, sorry
      • yvanzo
        Rescheduled MB search indexes dump to run soon (11:40 UTC) with I/O patch.
      • lucifer
        ah ok, no worries.
      • alastairp
      • monkey: in the playlist view, the currently playing track has its right-hand data offset to the right a little bit, is that known?
      • monkey
        Yep ruaok said the same yesterday :)
      • The play icon with and without the circle around it aren't the same width
      • alastairp
        right, as long as you're aware!
      • monkey
        Thanks
      • ruaok
        alastairp: monkey akshaaatt: Freso: invoices please!
      • alastairp
        it's not december already
      • I refuse to believe it
      • akshaaatt
        On it ruaok
      • ruaok
        did you do your own research?
      • alastairp
        I'm trying to, but I can't find any sources which corroborate my belief
      • but I "feel" that I'm correct
      • ruaok
        maybe its time to make a site that disavows the fact its Dec already.
      • monkey
        Heading to the office, will invoice from there
      • Thanks for the reminder
      • BrainzGit
        [musicbrainz-docker] 14nikosmichas opened pull request #213 (03master…tcp_max_tw_buckets): Limit tcp_max_tw_buckets to avoid connection issues between services https://github.com/metabrainz/musicbrainz-docke...
      • ruaok
        alastairp: any idea why this would give a 502 ?
      • monkey
        Too long a list perhaps?
      • ruaok
        docs don't specify how many IDs I can include in a call.
      • alastairp
      • maybe it's not visible enough
      • ruaok
        ah no, i clearly missed that.
      • but 502?
      • alastairp
        but good point - we reject listens if they make it to the webserver and there are too many
      • ruaok
        should be 400, no?
      • alastairp
        but if you give _so many_ that uwsgi gives up, it returns 502 before it even gets to the AB code
      • ruaok
        should that be mentioned in the docs?
      • alastairp
        yeah, I think we can fit that in
      • lucifer
        this is same as LB-993
      • BrainzBot
        LB-993: User feedback endpoint returns 502 when querying too many recordings https://tickets.metabrainz.org/browse/LB-993
      • alastairp
        yep
      • lucifer
        we could also increase buffer size for AB like we intend to do for LB
      • alastairp
        yeah, I don't know what the effect of the buffer size is on the number of workers that we have, I assume it'll increase some memory usage per worker
      • but its an interesting issue that there's always a point where if it's too large, we get the error from uwsgi not our server
      • unless you make the buffer size hundres of mb
      • lucifer
        possibly but we are talking about 4kb here per request, so the overall memory increase shouldn't be much.
      • alastairp
      • lucifer
        buffer-size is limited to 65536 bytes :)
      • alastairp
        cloudflare has a limit of 32k, other CDNs have a limit of 8k
      • that query that ruaok showed was only 3.7k, so not sure how it hit the limit
      • oh, including headers might have put it over
      • lucifer
        yup
      • alastairp
        OK, you've convinced me that 8k is fine
      • lucifer
        out of curiousity, i wonder how would a lookup of say 1000 mbids be done at a time, fat get or post request?
      • ruaok
        22% match rate on AB on this one example test case. :(
      • lucifer
        recordings missing from AB?
      • ruaok
        24% for alastairp, 15 for monkey, 18 for lucifer.
      • lucifer: yep.
      • that's a dead end then.
      • lucifer
        yeah :/
      • ruaok
        I think AB needs a drastic re-think, alastairp.
      • alastairp
        what are you looking up?
      • ruaok
        high level data for tracks that come out of the misssed tracks datasets query.
      • the last column is mood_aggressive
      • it suggests that it could work, if we had sufficient coverage in AB.
      • alastairp
        a shame that we didn't get around to finishing the mbid redirect table task
      • lucifer: maybe we should just go with the external database for this for now so that we can get a bunch of improvements made
      • lucifer
        sure, sounds good to me.
      • alastairp
        maybe I'll have a look at that this week
      • lucifer
        so something like setting up a cron job that queries all recording_redirect entries since last run and then update recording mbids in AB tables?
      • alastairp
        no, I'd do it on demand. you query an MBID, it looks up all of the possible redirects, then it returns you data for all submissions for all mbids in the "set"
      • this way, the results of a query don't change over time
      • lucifer
        i see. if you want i could take a quick stab at it, put it on ab beta so that ruaok can test it.
      • ruaok
        lucifer: alastairp : I think you two can stop working on an of this stuff. its just not useful.
      • lets go back to whatever original tasks we had.
      • alastairp: do we have any algorithms that we incorporate into a new-AB? Let assume for a second we ditch our AB DB and start over. How would we do that?
      • can we collect better data with much smaller segment times that would allow us to build better algorithms for better feature detection later?
      • param: trivial PR for you, if you have a sec: https://github.com/paramsingh/pylistenbrainz/pu...
      • BrainzGit
        [bookbrainz-site] 14dependabot[bot] opened pull request #726 (03master…dependabot/npm_and_yarn/normalize-url-4.5.1): chore(deps): bump normalize-url from 4.5.0 to 4.5.1 https://github.com/bookbrainz/bookbrainz-site/p...
      • param
      • I also added you as a collaborator to the repo for the future
      • ruaok
        sweet, thanks!
      • BrainzGit
        [bookbrainz-site] 14dependabot[bot] closed pull request #724 (03master…dependabot/npm_and_yarn/tar-4.4.19): chore(deps): bump tar from 4.4.13 to 4.4.19 https://github.com/bookbrainz/bookbrainz-site/p...
      • [bookbrainz-site] 14dependabot[bot] closed pull request #726 (03master…dependabot/npm_and_yarn/normalize-url-4.5.1): chore(deps): bump normalize-url from 4.5.0 to 4.5.1 https://github.com/bookbrainz/bookbrainz-site/p...
      • [bookbrainz-site] 14dependabot[bot] opened pull request #727 (03master…dependabot/npm_and_yarn/postcss-8.4.4): chore(deps): bump postcss from 8.2.4 to 8.4.4 https://github.com/bookbrainz/bookbrainz-site/p...
      • [bookbrainz-site] 14dependabot[bot] opened pull request #728 (03master…dependabot/npm_and_yarn/browserslist-4.18.1): chore(deps): bump browserslist from 4.12.0 to 4.18.1 https://github.com/bookbrainz/bookbrainz-site/p...
      • [bookbrainz-site] 14dependabot[bot] opened pull request #729 (03master…dependabot/npm_and_yarn/webpack-5.64.4): chore(deps-dev): bump webpack from 5.12.3 to 5.64.4 https://github.com/bookbrainz/bookbrainz-site/p...
      • [bookbrainz-site] 14dependabot[bot] closed pull request #717 (03master…dependabot/npm_and_yarn/webpack-5.64.2): chore(deps-dev): bump webpack from 5.12.3 to 5.64.2 https://github.com/bookbrainz/bookbrainz-site/p...
      • monkey
        Yeah, that's right !
      • Go Dependabot, go !
      • Solving security alerts by removing a security alert mitigation tool. How ironic.
      • BrainzGit
        [bookbrainz-site] 14MonkeyDo merged pull request #728 (03master…dependabot/npm_and_yarn/browserslist-4.18.1): chore(deps): bump browserslist from 4.12.0 to 4.18.1 https://github.com/bookbrainz/bookbrainz-site/p...
      • [bookbrainz-site] 14MonkeyDo merged pull request #727 (03master…dependabot/npm_and_yarn/postcss-8.4.4): chore(deps): bump postcss from 8.2.4 to 8.4.4 https://github.com/bookbrainz/bookbrainz-site/p...
      • [bookbrainz-site] 14MonkeyDo merged pull request #725 (03master…dependabot/npm_and_yarn/path-parse-1.0.7): chore(deps): bump path-parse from 1.0.6 to 1.0.7 https://github.com/bookbrainz/bookbrainz-site/p...
      • [musicbrainz-server] 14reosarevok opened pull request #2354 (03master…MBS-12114): MBS-12114: Account for the "Disk" alternative spelling to "Disc" https://github.com/metabrainz/musicbrainz-serve...
      • reosarevok
        Threw that tiny change into the milestone ^ :)
      • Also
      • Apparently some reports failed to run or something (MBS-12112)
      • BrainzBot
        MBS-12112: Several reports are incorrectly empty https://tickets.metabrainz.org/browse/MBS-12112
      • reosarevok
        I tried to log into a musicbrainz-website-prod container and run ./admin/RunReports.pl but I get
      • root@4bd87d4e8684:/home/musicbrainz/musicbrainz-server# ./admin/RunReports.pl Can't locate List/AllUtils.pm in @INC (you may need to install the List::AllUtils module) (@INC contains: /home/musicbrainz/musicbrainz-server/admin/../lib /etc/perl /usr/local/lib/x86_64-linux-gnu/perl/5.30.0 /usr/local/share/perl/5.30.0 /usr/lib/x86_64-linux-gnu/perl5/5.30 /usr/share/perl5 /usr/lib/x86_64-linux-gnu/perl/5.30 /usr/share/perl/5.30
      • /usr/local/lib/site_perl /usr/lib/x86_64-linux-gnu/perl-base) at ./admin/RunReports.pl line 8. BEGIN failed--compilation aborted at ./admin/RunReports.pl line 8.
      • yvanzo, bitmap: how do you run reports manually when needed? I know at least bitmap has done that in the past
      • I mean, I assume the answer is not "run cpanm"
      • akshaaatt
        Spotify wrapped has dropped for this year
      • Ngl it is pretty good
      • alastairp
        hi reosarevok, I saw more feedback from you on the genres, thanks
      • reosarevok
        np :) Happy to help with whatever comes next
      • alastairp
        as I said last week, I'll go through the first sheet and hard-code all of our decisions that we weren't able to do automatically. Once that looks good I'll put together a tag submitter
      • BrainzGit
        [musicbrainz-server] 14reosarevok opened pull request #2355 (03master…hide-deleted-subscriptions): Don't show deleted users' Subscription tabs to admins https://github.com/metabrainz/musicbrainz-serve...
      • alastairp
        ruaok: good question. there are new feature extractors in essentia that do more detailed low level features. now that deep learning algorithms seem to give good results (especially for classification), this seems to be the minimum required amount of data
      • in fact, I was speaking with Dmitry about this a few weeks ago, and he thinks that they're close with a proposed extractor that we could include in AB in order to have better features
      • ruaok
        are any of them good enough for to think about an AB reboot?
      • alastairp
        yes, he and I are planning on prototyping a data refresh if not by the end of this year, definitely early next year
      • ruaok
        are you in the office tomorrow?
      • alastairp
        I think it's been interesting to see how the scale of AB "breaks" many of the algorithms in essentia - we knew this early on with the machine learning stuff
      • but the bpm stuff is interesting too. there are a lot of good results, but we just have so much stuff that there is also bad stuff too
      • office - unsure. laptop is out for repair and I'm not 100% healthy yet
      • ruaok
        ok, next week then.
      • but I am rather down on AB right now. I'm questioning the further existence of the project. at very least we need to have a hard look at our short term plans.
      • right now we're putting on band-aids and performance improvements of something that seems entirely worthless to me.
      • so, question for next week: What should the MVP for AB so that it can provide some value to its users?
      • alastairp
        right, we're going to have to think about the value of 100% automated algorithms that we just set and forget
      • ruaok
        and we need to think about being able to iterate on the algorithms more easily.