in #metabrainz

2:15 AM
reg[m] has quit
2:47 AM
texke` joined the channel
2:47 AM
texke has quit
4:42 AM
gcrkrause3 has quit
4:45 AM
gcrkrause3 joined the channel
6:43 AM
BrainzGit

[musicbrainz-server] 14mwiencek opened pull request #2394 (03master…prod-access-log-middleware): Enable Plack::Middleware::AccessLog in production https://github.com/metabrainz/musicbrainz-serve...
6:43 AM
reg[m] joined the channel
6:44 AM
akshaaatt

Hi bitmap! I was planning to have the design system hosted on something like design.metabrainz.org and required help with that setup
6:45 AM
https://storybook.js.org/docs/react/sharing/pub...
6:51 AM
TOPIC: MetaBrainz Community and Development channel | MusicBrainz non-development: #musicbrainz | BookBrainz: #bookbrainz | Channel is logged; see https://musicbrainz.org/doc/IRC for details | Agenda: Reviews, Doc sprint (yvanzo), Repo review (reo)
7:02 AM
BrainzGit

[design-system] 14akshaaatt closed pull request #49 (03master…chromatic-ci): Publish on Chromatic https://github.com/metabrainz/design-system/pul...
7:15 AM
reosarevok

lucifer: you'll love this: https://twitter.com/itsbrandond/status/14849130...
7:27 AM
BrainzGit

[musicbrainz-server] 14reosarevok merged pull request #2390 (03master…remove-coc-file): Delete CODE_OF_CONDUCT.md and use org default https://github.com/metabrainz/musicbrainz-serve...
7:31 AM
lucifer

reosarevok: oh lol 🥲
9:02 AM
yvanzo

O’Moin
9:30 AM
reosarevok

moin!
9:43 AM
yvanzo: before I start writing it, we don't have any docs for how to add support for a new URL domain to the code, right?
9:44 AM
As in, a doc that explains what files to change for autoselect/cleanup, favicons, sidebar, etc
9:46 AM
yvanzo

reosarevok: No, we should even probably have a separate doc explaining how to handle those tickets to check current usage in the DB, old and current links on the web…
9:47 AM
Like HACKING-URLCLEANUP.md?
9:47 AM
reosarevok

Hmm, I was going to put it on HACKING but that also seems good
9:47 AM
We can move it to a dev docs repo if we eventually start one
9:47 AM
yvanzo

Sure :)
9:48 AM
reosarevok

So maybe have a heading in HACKING like
9:48 AM
https://www.irccloud.com/pastebin/fWNzY3mn/
9:49 AM
And then write the doc in that file
9:54 AM
yvanzo

Yes, the same as Cover Art Archive development
9:57 AM
reosarevok

Ok :)
9:58 AM
yvanzo

Btw, maybe just HACKING-URL.md if it is not limited to cleanup.
10:02 AM
reosarevok

Yeah, fair
10:03 AM
Shubh joined the channel
10:15 AM
mayhem

moooin!
10:16 AM
alastairp

those headers are great, lucifer! https://usercontent.irccloud-cdn.com/file/NeRzI...
10:16 AM
lucifer

indeed! :D
10:16 AM
i saw those in RTD tutorial first and thought to add LB docs as well.
10:17 AM
alastairp

why do we have "Web root url" in the api docs? I can't see it used anywhere in the docs
10:20 AM
lucifer

yeah, i think we can remove that. dosen't make sense to list non-api endpoints there in LB API docs.
10:20 AM
mayhem

alastairp: got a sec for some brainstorming?
10:20 AM
alastairp

yes, go for it
10:20 AM
lucifer

although it could come in handy in dev/maintainer docs
10:20 AM
mayhem

let me find a reference real quick. hang on.
10:20 AM
alastairp

lucifer: yeah, there may be value in having it somewhere, but it feels a bit out of place right there
10:22 AM
lucifer

yup makes sense. i'll remove that in another PR. the current ones depend on each other so making the change here will cause cascading merges/rebases to resolve conflict.
10:22 AM
alastairp

no prob
10:23 AM
akshaaatt

lucifer, A reminder about the data safety form in playstore 😃
10:24 AM
mayhem

meh. can't find the paper that talks about moods derived from listens. but that paper said something along the lines of:
10:24 AM
"Extrating moods from audio is hard and doesn't work that well".
10:24 AM
*extracting
10:24 AM
alastairp

the one I linked last week?
10:24 AM
mayhem

yes.
10:24 AM
and in general, there is a sentiment that user behaviour is a better input to recommendation that audio files.
10:25 AM
A point that Dimi made in this defense.
10:25 AM
so, then it seems that AB is really barking up the wrong tree.
10:25 AM
not only does it not provide good results, its also very hard to do unless you have access to a huge cache of files. Easy for spotify, very hard for us.
10:26 AM
alastairp

yeah, the question of "can you throw machine learning at audio signals and get some results?" has always been uncertain
10:27 AM
mayhem

And I think that we fell in love with the concept; we were never fully clear on what data we wanted to collect, and we just got ourselves hornswaggled on this approach. I think it its time to step back even further than we already have and re-examine.
10:27 AM
exactly. so, how about we don't. period.
10:28 AM
instead, lets us ask ourselves: what data do we want to collect?
10:28 AM
once we answer that, we should find ways to collect those pieces of data, by the easiest means possible. meaning: what approaches fit with an open source approach.
10:28 AM
alastairp

so, instead of "description of songs based on analysing audio", maybe we actually just want "description of songs"
10:28 AM
lucifer

akshaaatt: thanks for the reminder, lets talk with mayhem about that after this conversation.
10:28 AM
mayhem

DING! EXACTLY THAY!
10:29 AM
THAT
10:30 AM
because reading the moods from behaviour paper gives a clear insight. If transferlearning is needed -- well, we have a CF filtering setup already. If we're bolting another piece of on top of that, then this may not be that much work in the grand scheme of things.
10:30 AM
right now AB needs: DSP and ML knoweldge.
10:30 AM
the new approach only needs ML knowledge. and everyone wants to do this right now -- its the hot thing to be doing.
10:30 AM
alastairp

-> https://arxiv.org/pdf/2010.11512.pdf link if you had lost it
10:30 AM
mayhem

and we have the infrastructure for it already. so, lets use that.
10:31 AM
thanks!
10:31 AM
there are bits and pieces of AB that are clearly useful. such as creating datasets, managing them and then throwing them at ML.
10:31 AM
newAB could then throw away all the audio stuff and keep all the ML/dataset stuff.
10:32 AM
alastairp

yes, right
10:32 AM
mayhem

we could start with collecting mood information manually from users as part of BrainzPlayer. (listen to a track, select the mood from a dropdown, submit to newAB.
10:33 AM
once we have a body of those, we can start doing the ML as described in the paper. the paper allows us to "fill in the blanks" of the data we're missing.
10:33 AM
with me so far?
10:33 AM
alastairp

oh, that's interesting
10:33 AM
working out how to collect data has always been a bit of an open question
10:34 AM
mayhem

and those are our strengths OR are things that are tangibly close to being within our reach.
10:34 AM
alastairp

and I think that the AB way of saying "here are some results, do they look wrong? give us feedback" was a bit weak because the data was a bit sparse
10:34 AM
mayhem

so, lets focus on that. lets build an OpenML project with a focus on music.
10:34 AM
yes, indeed.
10:34 AM
alastairp

but maybe integrating into BP might be a better way of getting that data
10:35 AM
the previous way that we've done it is give it to students as an assignment. "your task is to listen to these 500 songs from jamendo and categorise genre/mood/other things, then do some analysis on the results"
10:35 AM
mayhem

I think BP is the outlier on this front. I think with enough effort, we can do music analysis on it. I think you're close to showing that.
10:35 AM
but, just to get BPM, do we really need to build all this infra for it?
10:35 AM
what if we wrote BPM stuff as a pluging for picard?
10:36 AM
and picard runs 3+ algs and if there is consensus, submit the data.
10:36 AM
bam, we're done.
10:36 AM
alastairp

how do you mean? people use picard and tap along with the song?
10:36 AM
oh, for the algorithms. mmm
10:36 AM
one sec, let me commit some stuff
10:36 AM
mayhem

yes. since it looks like there are promising algs, lets stuff them into picard and be done on that front.
10:37 AM
then focus ML in the newAB and I think we have a project might might get more people interested in helping.
10:37 AM
mayhem wonders if lucifer has any ideas
10:37 AM
monkey

Are "moods" a specific subset of tags?
10:38 AM
lucifer

nothing to add but following along the discussion. +1 on all BP, picard and new AB ideas
10:38 AM
monkey

(I've been wanting to add a tags input and display into the ListenCard on LB, perhaps another input can prompt for adding/selecting moods)
10:38 AM
zas

legoktm[m]: ping
10:39 AM
mayhem

monkey: moods could be tags -- yes, but I think I would build a more general infrastructure to support this in newAB.
10:40 AM
The only thing that bugs me about this is that newAB no longer deals with acoustics. thus the name is now borked.
10:40 AM
alastairp

yeah, that's more like music-dataset-brainz
10:41 AM
https://github.com/metabrainz/mir-analysis/blob... I did this at the end of last week
10:42 AM
some odd results still in 04 - Pink Floyd - Wish You Were Here.flac and 03 - Miles Davis - Blue in Green.flac
10:45 AM
mayhem

wish you were here is also a tough candidate for this shit.
10:45 AM
monkey

Seems like a consensus approach would give pretty reliable results
10:45 AM
alastairp

yeah, I realised that pink floyd and miles davis probably aren't the easiest candidate set
10:45 AM
mayhem

there is consensus, but is shine on 140bpm??
10:46 AM
that seems too fast to me.
10:46 AM
I guess spotify agrees with it.
10:47 AM
well tempocnn and madmom seem like easy candidates to toss out.
10:47 AM
alastairp

shine on is interesting because the parts are really different
10:47 AM
mayhem

yerp
10:47 AM
alastairp

there are some slower parts (60ish), and some really fast parts too
10:48 AM
mayhem

lol @ blue and green. Not 170BPM. collective fail.
10:48 AM
alastairp

and so you have to start thinking if you want to do a single bpm or measure how it changes over the whole song
10:48 AM
mayhem

I guess we're tying to find "representative BPM", not actual BPM.
10:49 AM
because actual BPM is... variable over the song.
10:49 AM
but for a lot of use cases that we're chasing after, that doesn't matter.
10:49 AM
alastairp

but averaging all of the BPM values over the duration of the song isn't great either, because you end up with none of them
10:50 AM
for the 1 queen song, I plotted bpm for most of the algorithms: https://github.com/metabrainz/mir-analysis/blob...
10:50 AM
mayhem

yeah, that wasn't necessarily my suggestion though.
10:51 AM
alastairp

that track seems kind of stable
10:51 AM
oh sure - that's not the only way of doing it
10:51 AM
mayhem

I think if we define BPM as "the BPM of the most 'recognizable portion of the song" then we narrow the scope of the problem.
10:52 AM
we make the problem more human.
10:52 AM
and we have humans.
10:52 AM
what if we had this workfow:
10:53 AM
1. User runs automatic BPM detection that is tuned to only accept the clearest results.
10:53 AM
2. If there is clear consensus, we're done.
10:53 AM
BrainzGit

[bookbrainz-site] 14MonkeyDo merged pull request #754 (03master…collectionPageNameSection): fix(CollectionPage): Added Horizontal rule in CollectionPage https://github.com/metabrainz/bookbrainz-site/p...
10:53 AM
mayhem

3. If there is not, have the user select the most "salient" parts of the song. and then run analysis again. do we get a stable result now? if so, we're done.
10:54 AM
4. If not... not sure. "tap on every downbeat" and we'll calculate it?
10:55 AM
https://usercontent.irccloud-cdn.com/file/3pQHO...
10:56 AM
alastairp

within something like picard? BP?
10:56 AM
mayhem

https://www.reddit.com/r/PleX/comments/r7g8v3/t...
10:56 AM
alastairp: both, ideally.
10:57 AM
alastairp

I like the idea of "indicate to us what you think the most representative part of this song is"
10:57 AM
mayhem

MLBrainz would also work great for building candidate sets for music recommendations.
10:57 AM
yes, coupled with loads of users doing this, we can gather the best data out there.
10:58 AM
alastairp

the trick is to convince users that it's worth their time to help with this
10:58 AM
mayhem

that hasn't been a problem for us in the past.
10:59 AM
I think the key is to make it easy enough for them to do it. Adding "calls to action" to BP has been on the wishlist for a while.
10:59 AM
I would like to see all sorts of things there:
10:59 AM
1. This artist has no patronage link. Do you know of one?
11:00 AM
2. This track has no BPM, would you like to help us submit it?
11:00 AM
3. This track has no genre tags: We think it might be these tags. click a tag or enter new tags.
11:03 AM
BrainzGit

[musicbrainz-server] 14reosarevok opened pull request #2395 (03master…document-url-in-hacking): Document URL support in MBS https://github.com/metabrainz/musicbrainz-serve...
11:03 AM
alastairp

BPM on BP is interesting. tap along is something that we could get users to do
11:03 AM
but I'm also wondering about js-based algorithms + using the stream as the user is listening to the track
11:04 AM
mayhem

having access to the audio is the trick. I doubt that is even possible in most cases, unless we are doing it for a local file.