#metabrainz

/

2:15 AM
reg[m] has quit

2022-01-24 02416, 2022

2:47 AM
texke` joined the channel

2022-01-24 02437, 2022

2:47 AM
texke has quit

2022-01-24 02402, 2022

4:42 AM
gcrkrause3 has quit

2022-01-24 02434, 2022

4:45 AM
gcrkrause3 joined the channel

2022-01-24 02428, 2022

6:43 AM
BrainzGit

[musicbrainz-server] 14mwiencek opened pull request #2394 (03master…prod-access-log-middleware): Enable Plack::Middleware::AccessLog in production https://github.com/metabrainz/musicbrainz-server/…

2022-01-24 02429, 2022

6:43 AM
reg[m] joined the channel

2022-01-24 02424, 2022

6:44 AM
akshaaatt

Hi bitmap! I was planning to have the design system hosted on something like design.metabrainz.org and required help with that setup

2022-01-24 02426, 2022

6:45 AM
akshaaatt

https://storybook.js.org/docs/react/sharing/publi…

2022-01-24 02445, 2022

6:51 AM
TOPIC: MetaBrainz Community and Development channel | MusicBrainz non-development: #musicbrainz | BookBrainz: #bookbrainz | Channel is logged; see https://musicbrainz.org/doc/IRC for details | Agenda: Reviews, Doc sprint (yvanzo), Repo review (reo)

2022-01-24 02409, 2022

7:02 AM
BrainzGit

[design-system] 14akshaaatt closed pull request #49 (03master…chromatic-ci): Publish on Chromatic https://github.com/metabrainz/design-system/pull/…

2022-01-24 02447, 2022

7:15 AM
reosarevok

lucifer: you'll love this: https://twitter.com/itsbrandond/status/1484913084…

2022-01-24 02457, 2022

7:27 AM
BrainzGit

[musicbrainz-server] 14reosarevok merged pull request #2390 (03master…remove-coc-file): Delete CODE_OF_CONDUCT.md and use org default https://github.com/metabrainz/musicbrainz-server/…

2022-01-24 02450, 2022

7:31 AM
lucifer

reosarevok: oh lol 🥲

2022-01-24 02456, 2022

9:02 AM
yvanzo

O’Moin

2022-01-24 02413, 2022

9:30 AM
reosarevok

moin!

2022-01-24 02458, 2022

9:43 AM
reosarevok

yvanzo: before I start writing it, we don't have any docs for how to add support for a new URL domain to the code, right?

2022-01-24 02412, 2022

9:44 AM
reosarevok

As in, a doc that explains what files to change for autoselect/cleanup, favicons, sidebar, etc

2022-01-24 02444, 2022

9:46 AM
yvanzo

reosarevok: No, we should even probably have a separate doc explaining how to handle those tickets to check current usage in the DB, old and current links on the web…

2022-01-24 02403, 2022

9:47 AM
yvanzo

Like HACKING-URLCLEANUP.md?

2022-01-24 02421, 2022

9:47 AM
reosarevok

Hmm, I was going to put it on HACKING but that also seems good

2022-01-24 02438, 2022

9:47 AM
reosarevok

We can move it to a dev docs repo if we eventually start one

2022-01-24 02456, 2022

9:47 AM
yvanzo

Sure :)

2022-01-24 02419, 2022

9:48 AM
reosarevok

So maybe have a heading in HACKING like

2022-01-24 02441, 2022

9:48 AM
reosarevok

https://www.irccloud.com/pastebin/fWNzY3mn/

2022-01-24 02405, 2022

9:49 AM
reosarevok

And then write the doc in that file

2022-01-24 02440, 2022

9:54 AM
yvanzo

Yes, the same as Cover Art Archive development

2022-01-24 02404, 2022

9:57 AM
reosarevok

Ok :)

2022-01-24 02423, 2022

9:58 AM
yvanzo

Btw, maybe just HACKING-URL.md if it is not limited to cleanup.

2022-01-24 02409, 2022

10:02 AM
reosarevok

Yeah, fair

2022-01-24 02424, 2022

10:03 AM
Shubh joined the channel

2022-01-24 02439, 2022

10:15 AM
mayhem

moooin!

2022-01-24 02415, 2022

10:16 AM
alastairp

those headers are great, lucifer! https://usercontent.irccloud-cdn.com/file/NeRzIvU…

2022-01-24 02436, 2022

10:16 AM
lucifer

indeed! :D

2022-01-24 02459, 2022

10:16 AM
lucifer

i saw those in RTD tutorial first and thought to add LB docs as well.

2022-01-24 02451, 2022

10:17 AM
alastairp

why do we have "Web root url" in the api docs? I can't see it used anywhere in the docs

2022-01-24 02410, 2022

10:20 AM
lucifer

yeah, i think we can remove that. dosen't make sense to list non-api endpoints there in LB API docs.

2022-01-24 02419, 2022

10:20 AM
mayhem

alastairp: got a sec for some brainstorming?

2022-01-24 02430, 2022

10:20 AM
alastairp

yes, go for it

2022-01-24 02435, 2022

10:20 AM
lucifer

although it could come in handy in dev/maintainer docs

2022-01-24 02457, 2022

10:20 AM
mayhem

let me find a reference real quick. hang on.

2022-01-24 02458, 2022

10:20 AM
alastairp

lucifer: yeah, there may be value in having it somewhere, but it feels a bit out of place right there

2022-01-24 02400, 2022

10:22 AM
lucifer

yup makes sense. i'll remove that in another PR. the current ones depend on each other so making the change here will cause cascading merges/rebases to resolve conflict.

2022-01-24 02410, 2022

10:22 AM
alastairp

no prob

2022-01-24 02412, 2022

10:23 AM
akshaaatt

lucifer, A reminder about the data safety form in playstore 😃

2022-01-24 02403, 2022

10:24 AM
mayhem

meh. can't find the paper that talks about moods derived from listens. but that paper said something along the lines of:

2022-01-24 02417, 2022

10:24 AM
mayhem

"Extrating moods from audio is hard and doesn't work that well".

2022-01-24 02421, 2022

10:24 AM
mayhem

*extracting

2022-01-24 02423, 2022

10:24 AM
alastairp

the one I linked last week?

2022-01-24 02426, 2022

10:24 AM
mayhem

yes.

2022-01-24 02450, 2022

10:24 AM
mayhem

and in general, there is a sentiment that user behaviour is a better input to recommendation that audio files.

2022-01-24 02405, 2022

10:25 AM
mayhem

A point that Dimi made in this defense.

2022-01-24 02420, 2022

10:25 AM
mayhem

so, then it seems that AB is really barking up the wrong tree.

2022-01-24 02457, 2022

10:25 AM
mayhem

not only does it not provide good results, its also very hard to do unless you have access to a huge cache of files. Easy for spotify, very hard for us.

2022-01-24 02443, 2022

10:26 AM
alastairp

yeah, the question of "can you throw machine learning at audio signals and get some results?" has always been uncertain

2022-01-24 02420, 2022

10:27 AM
mayhem

And I think that we fell in love with the concept; we were never fully clear on what data we wanted to collect, and we just got ourselves hornswaggled on this approach. I think it its time to step back even further than we already have and re-examine.

2022-01-24 02441, 2022

10:27 AM
mayhem

exactly. so, how about we don't. period.

2022-01-24 02401, 2022

10:28 AM
mayhem

instead, lets us ask ourselves: what data do we want to collect?

2022-01-24 02430, 2022

10:28 AM
mayhem

once we answer that, we should find ways to collect those pieces of data, by the easiest means possible. meaning: what approaches fit with an open source approach.

2022-01-24 02446, 2022

10:28 AM
alastairp

so, instead of "description of songs based on analysing audio", maybe we actually just want "description of songs"

2022-01-24 02447, 2022

10:28 AM
lucifer

akshaaatt: thanks for the reminder, lets talk with mayhem about that after this conversation.

2022-01-24 02459, 2022

10:28 AM
mayhem

DING! EXACTLY THAY!

2022-01-24 02400, 2022

10:29 AM
mayhem

THAT

2022-01-24 02401, 2022

10:30 AM
mayhem

because reading the moods from behaviour paper gives a clear insight. If transferlearning is needed -- well, we have a CF filtering setup already. If we're bolting another piece of on top of that, then this may not be that much work in the grand scheme of things.

2022-01-24 02418, 2022

10:30 AM
mayhem

right now AB needs: DSP and ML knoweldge.

2022-01-24 02439, 2022

10:30 AM
mayhem

the new approach only needs ML knowledge. and everyone wants to do this right now -- its the hot thing to be doing.

2022-01-24 02449, 2022

10:30 AM
alastairp

-> https://arxiv.org/pdf/2010.11512.pdf link if you had lost it

2022-01-24 02450, 2022

10:30 AM
mayhem

and we have the infrastructure for it already. so, lets use that.

2022-01-24 02402, 2022

10:31 AM
mayhem

thanks!

2022-01-24 02433, 2022

10:31 AM
mayhem

there are bits and pieces of AB that are clearly useful. such as creating datasets, managing them and then throwing them at ML.

2022-01-24 02451, 2022

10:31 AM
mayhem

newAB could then throw away all the audio stuff and keep all the ML/dataset stuff.

2022-01-24 02428, 2022

10:32 AM
alastairp

yes, right

2022-01-24 02440, 2022

10:32 AM
mayhem

we could start with collecting mood information manually from users as part of BrainzPlayer. (listen to a track, select the mood from a dropdown, submit to newAB.

2022-01-24 02420, 2022

10:33 AM
mayhem

once we have a body of those, we can start doing the ML as described in the paper. the paper allows us to "fill in the blanks" of the data we're missing.

2022-01-24 02430, 2022

10:33 AM
mayhem

with me so far?

2022-01-24 02436, 2022

10:33 AM
alastairp

oh, that's interesting

2022-01-24 02454, 2022

10:33 AM
alastairp

working out how to collect data has always been a bit of an open question

2022-01-24 02402, 2022

10:34 AM
mayhem

and those are our strengths OR are things that are tangibly close to being within our reach.

2022-01-24 02422, 2022

10:34 AM
alastairp

and I think that the AB way of saying "here are some results, do they look wrong? give us feedback" was a bit weak because the data was a bit sparse

2022-01-24 02434, 2022

10:34 AM
mayhem

so, lets focus on that. lets build an OpenML project with a focus on music.

2022-01-24 02445, 2022

10:34 AM
mayhem

yes, indeed.

2022-01-24 02446, 2022

10:34 AM
alastairp

but maybe integrating into BP might be a better way of getting that data

2022-01-24 02426, 2022

10:35 AM
alastairp

the previous way that we've done it is give it to students as an assignment. "your task is to listen to these 500 songs from jamendo and categorise genre/mood/other things, then do some analysis on the results"

2022-01-24 02428, 2022

10:35 AM
mayhem

I think BP is the outlier on this front. I think with enough effort, we can do music analysis on it. I think you're close to showing that.

2022-01-24 02447, 2022

10:35 AM
mayhem

but, just to get BPM, do we really need to build all this infra for it?

2022-01-24 02455, 2022

10:35 AM
mayhem

what if we wrote BPM stuff as a pluging for picard?

2022-01-24 02411, 2022

10:36 AM
mayhem

and picard runs 3+ algs and if there is consensus, submit the data.

2022-01-24 02417, 2022

10:36 AM
mayhem

bam, we're done.

2022-01-24 02424, 2022

10:36 AM
alastairp

how do you mean? people use picard and tap along with the song?

2022-01-24 02429, 2022

10:36 AM
alastairp

oh, for the algorithms. mmm

2022-01-24 02437, 2022

10:36 AM
alastairp

one sec, let me commit some stuff

2022-01-24 02448, 2022

10:36 AM
mayhem

yes. since it looks like there are promising algs, lets stuff them into picard and be done on that front.

2022-01-24 02406, 2022

10:37 AM
mayhem

then focus ML in the newAB and I think we have a project might might get more people interested in helping.

2022-01-24 02412, 2022

10:37 AM
mayhem wonders if lucifer has any ideas

2022-01-24 02458, 2022

10:37 AM
monkey

Are "moods" a specific subset of tags?

2022-01-24 02409, 2022

10:38 AM
lucifer

nothing to add but following along the discussion. +1 on all BP, picard and new AB ideas

2022-01-24 02438, 2022

10:38 AM
monkey

(I've been wanting to add a tags input and display into the ListenCard on LB, perhaps another input can prompt for adding/selecting moods)

2022-01-24 02450, 2022

10:38 AM
zas

legoktm[m]: ping

2022-01-24 02445, 2022

10:39 AM
mayhem

monkey: moods could be tags -- yes, but I think I would build a more general infrastructure to support this in newAB.

2022-01-24 02413, 2022

10:40 AM
mayhem

The only thing that bugs me about this is that newAB no longer deals with acoustics. thus the name is now borked.

2022-01-24 02459, 2022

10:40 AM
alastairp

yeah, that's more like music-dataset-brainz

2022-01-24 02433, 2022

10:41 AM
alastairp

https://github.com/metabrainz/mir-analysis/blob/m… I did this at the end of last week

2022-01-24 02403, 2022

10:42 AM
alastairp

some odd results still in 04 - Pink Floyd - Wish You Were Here.flac and 03 - Miles Davis - Blue in Green.flac

2022-01-24 02408, 2022

10:45 AM
mayhem

wish you were here is also a tough candidate for this shit.

2022-01-24 02416, 2022

10:45 AM
monkey

Seems like a consensus approach would give pretty reliable results

2022-01-24 02440, 2022

10:45 AM
alastairp

yeah, I realised that pink floyd and miles davis probably aren't the easiest candidate set

2022-01-24 02451, 2022

10:45 AM
mayhem

there is consensus, but is shine on 140bpm??

2022-01-24 02404, 2022

10:46 AM
mayhem

that seems too fast to me.

2022-01-24 02424, 2022

10:46 AM
mayhem

I guess spotify agrees with it.

2022-01-24 02409, 2022

10:47 AM
mayhem

well tempocnn and madmom seem like easy candidates to toss out.

2022-01-24 02433, 2022

10:47 AM
alastairp

shine on is interesting because the parts are really different

2022-01-24 02444, 2022

10:47 AM
mayhem

yerp

2022-01-24 02449, 2022

10:47 AM
alastairp

there are some slower parts (60ish), and some really fast parts too

2022-01-24 02412, 2022

10:48 AM
mayhem

lol @ blue and green. Not 170BPM. collective fail.

2022-01-24 02433, 2022

10:48 AM
alastairp

and so you have to start thinking if you want to do a single bpm or measure how it changes over the whole song

2022-01-24 02458, 2022

10:48 AM
mayhem

I guess we're tying to find "representative BPM", not actual BPM.

2022-01-24 02410, 2022

10:49 AM
mayhem

because actual BPM is... variable over the song.

2022-01-24 02425, 2022

10:49 AM
mayhem

but for a lot of use cases that we're chasing after, that doesn't matter.

2022-01-24 02457, 2022

10:49 AM
alastairp

but averaging all of the BPM values over the duration of the song isn't great either, because you end up with none of them

2022-01-24 02444, 2022

10:50 AM
alastairp

for the 1 queen song, I plotted bpm for most of the algorithms: https://github.com/metabrainz/mir-analysis/blob/m…

2022-01-24 02457, 2022

10:50 AM
mayhem

yeah, that wasn't necessarily my suggestion though.

2022-01-24 02405, 2022

10:51 AM
alastairp

that track seems kind of stable

2022-01-24 02413, 2022

10:51 AM
alastairp

oh sure - that's not the only way of doing it

2022-01-24 02450, 2022

10:51 AM
mayhem

I think if we define BPM as "the BPM of the most 'recognizable portion of the song" then we narrow the scope of the problem.

2022-01-24 02402, 2022

10:52 AM
mayhem

we make the problem more human.

2022-01-24 02407, 2022

10:52 AM
mayhem

and we have humans.

2022-01-24 02440, 2022

10:52 AM
mayhem

what if we had this workfow:

2022-01-24 02402, 2022

10:53 AM
mayhem

1. User runs automatic BPM detection that is tuned to only accept the clearest results.

2022-01-24 02411, 2022

10:53 AM
mayhem

2. If there is clear consensus, we're done.

2022-01-24 02427, 2022

10:53 AM
BrainzGit

[bookbrainz-site] 14MonkeyDo merged pull request #754 (03master…collectionPageNameSection): fix(CollectionPage): Added Horizontal rule in CollectionPage https://github.com/metabrainz/bookbrainz-site/pul…

2022-01-24 02454, 2022

10:53 AM
mayhem

3. If there is not, have the user select the most "salient" parts of the song. and then run analysis again. do we get a stable result now? if so, we're done.

2022-01-24 02422, 2022

10:54 AM
mayhem

4. If not... not sure. "tap on every downbeat" and we'll calculate it?

2022-01-24 02457, 2022

10:55 AM
mayhem

https://usercontent.irccloud-cdn.com/file/3pQHOjq…

2022-01-24 02402, 2022

10:56 AM
alastairp

within something like picard? BP?

2022-01-24 02402, 2022

10:56 AM
mayhem

https://www.reddit.com/r/PleX/comments/r7g8v3/tip…

2022-01-24 02419, 2022

10:56 AM
mayhem

alastairp: both, ideally.

2022-01-24 02416, 2022

10:57 AM
alastairp

I like the idea of "indicate to us what you think the most representative part of this song is"

2022-01-24 02428, 2022

10:57 AM
mayhem

MLBrainz would also work great for building candidate sets for music recommendations.

2022-01-24 02459, 2022

10:57 AM
mayhem

yes, coupled with loads of users doing this, we can gather the best data out there.

2022-01-24 02422, 2022

10:58 AM
alastairp

the trick is to convince users that it's worth their time to help with this

2022-01-24 02438, 2022

10:58 AM
mayhem

that hasn't been a problem for us in the past.

2022-01-24 02416, 2022

10:59 AM
mayhem

I think the key is to make it easy enough for them to do it. Adding "calls to action" to BP has been on the wishlist for a while.

2022-01-24 02424, 2022

10:59 AM
mayhem

I would like to see all sorts of things there:

2022-01-24 02444, 2022

10:59 AM
mayhem

1. This artist has no patronage link. Do you know of one?

2022-01-24 02406, 2022

11:00 AM
mayhem

2. This track has no BPM, would you like to help us submit it?

2022-01-24 02446, 2022

11:00 AM
mayhem

3. This track has no genre tags: We think it might be these tags. click a tag or enter new tags.

2022-01-24 02401, 2022

11:03 AM
BrainzGit

[musicbrainz-server] 14reosarevok opened pull request #2395 (03master…document-url-in-hacking): Document URL support in MBS https://github.com/metabrainz/musicbrainz-server/…

2022-01-24 02417, 2022

11:03 AM
alastairp

BPM on BP is interesting. tap along is something that we could get users to do

2022-01-24 02434, 2022

11:03 AM
alastairp

but I'm also wondering about js-based algorithms + using the stream as the user is listening to the track

2022-01-24 02420, 2022

11:04 AM
mayhem

having access to the audio is the trick. I doubt that is even possible in most cases, unless we are doing it for a local file.