#metabrainz

/

14:40 PM
rbsam176

thank you both by the way, I'm sure you're super busy so I appreciate the help

2022-02-14 04558, 2022

14:55 PM
tandy1000

<riksucks> "alastairp: I have been hearing a..." <- i think the issue with the current acousticbrainz model is that to sidestep the fact that we cant store copyrighted music the algorithm tries to get necessary data from the music, so we have to have suitable data that can be trained on

2022-02-14 04514, 2022

14:56 PM
tandy1000

but ofc the algo wasnt useful sadly

2022-02-14 04538, 2022

14:56 PM
riksucks

yes the algo did feature extraction and then used SVM iirc

2022-02-14 04555, 2022

14:56 PM
riksucks

truly believe that some sort of deep learning model is needed ngl

2022-02-14 04557, 2022

14:56 PM
tandy1000

agree

2022-02-14 04505, 2022

14:57 PM
tandy1000

what if we used the dataset that the dude on the forums created?

2022-02-14 04514, 2022

14:57 PM
tandy1000

all the free bandcamp downloads..

2022-02-14 04520, 2022

14:57 PM
tandy1000

that might be legal?

2022-02-14 04532, 2022

14:57 PM
riksucks

https://arxiv.org/pdf/1809.07276.pdf

2022-02-14 04532, 2022

14:57 PM
riksucks

refer to section 3.1 here, it seems like an interesting algo

2022-02-14 04549, 2022

14:57 PM
riksucks

tandy: interesting point

2022-02-14 04559, 2022

14:57 PM
riksucks

do we have access to lyrics btw?

2022-02-14 04522, 2022

14:58 PM
mayhem

rbsam176: lets look at this in detail tomorrow.

2022-02-14 04522, 2022

15:30 PM
reosarevok

bitmap, yvanzo: I'm missing the meeting tonight, but don't forget we're meeting tomorrow at the same time for schema change discussion :)

2022-02-14 04543, 2022

15:31 PM
MRiddickW joined the channel

2022-02-14 04555, 2022

15:33 PM
BrainzGit

[listenbrainz-server] 14akshaaatt opened pull request #1861 (03master…akshat/code-cleanup4): Refactor Codebase Part Four https://github.com/metabrainz/listenbrainz-server…

2022-02-14 04514, 2022

15:41 PM
CatQuest

[13:19] <monkey> And it might not sound like it, but I'm in favor of dropping the "my" prefix and find a better way to clarify what's personalized for the user

2022-02-14 04515, 2022

15:41 PM
CatQuest

<-- yes pls, no "my-ing" all the time

2022-02-14 04504, 2022

15:43 PM
rbsam176

mayhem: Sounds good! I'll shoot over a message tomorrow 👍

2022-02-14 04513, 2022

15:43 PM
monkey

riksucks: Not really. Lyrics are copyrighted content so we can''t store them. There are some relationships to websites where lyrics can be found though.

2022-02-14 04523, 2022

15:47 PM
riksucks

I see, makes sense

2022-02-14 04537, 2022

16:00 PM
agatzk has quit

2022-02-14 04552, 2022

16:01 PM
agatzk joined the channel

2022-02-14 04524, 2022

16:32 PM
rbsam176 has quit

2022-02-14 04518, 2022

16:33 PM
bitmap

reosarevok: ack

2022-02-14 04527, 2022

17:23 PM
alastairp

tandy1000: yeah, you are correct - the current SVM algorithms that we have in the current version of AB work well in some cases, but research has moved on to CNNs and other types of neural networks, and results have shown that the more audio data that you have available, the better they work

2022-02-14 04523, 2022

17:31 PM
alastairp

however storing enough data to do that is problematic - you're rght. the current data that we have cannot be reconstructed back into audio, however the much more detailed data used in CNNs can be (even though the quality isn't perfect)

2022-02-14 04526, 2022

17:32 PM
alastairp

we built some datasets with CC licensed audio to get around the copyright issue and used it in some challenges for mood recognition, https://github.com/MTG/mtg-jamendo-dataset and https://multimediaeval.github.io/2021-Emotion-and…

2022-02-14 04512, 2022

17:38 PM
MRiddickW has quit

2022-02-14 04551, 2022

17:38 PM
alastairp

riksucks: you might be interested in looking at some of the results for this challenge - https://multimediaeval.github.io/2021-Emotion-and…, I'm just checking to get the URLs to the rest of the entries, we have source code available

2022-02-14 04547, 2022

17:39 PM
alastairp

the other big issue is getting enough training data to build the model, what we're planning to do in listenbrainz now could be a great tool for this, asking users to contribute values for mood

2022-02-14 04505, 2022

17:41 PM
alastairp

having the data itself is great, but having a bunch of data validated by people is also good for training the models

2022-02-14 04547, 2022

17:41 PM
alastairp

as usual, with this kind of thing, building the model is easy - it's gathering the data and working out if it's useful which is the hard part

2022-02-14 04507, 2022

17:44 PM
alastairp

riksucks: I have a few deep learning models for mood from MTG, I have a notebook that runs all combinations, let me upload it for you

2022-02-14 04550, 2022

17:44 PM
alastairp

I'm not as familar with the process of training models, but I'll have a look for the code that my coworkers use (we're training on 4 GPUs, this is another problem with deep learning, the hardware resources are much higher than other ML Models)

2022-02-14 04544, 2022

17:49 PM
TOPIC: MetaBrainz Community and Development channel | MusicBrainz non-development: #musicbrainz | BookBrainz: #bookbrainz | Channel is logged; see https://musicbrainz.org/doc/IRC for details | Agenda: Reviews

2022-02-14 04536, 2022

17:50 PM
CatQuest

sorry i'll be afk too

2022-02-14 04540, 2022

17:50 PM
CatQuest

Freso:

2022-02-14 04506, 2022

17:55 PM
alastairp

I have a repairman at home, might arrive a bit late, I'll let you know when I'm back

2022-02-14 04543, 2022

17:57 PM
tandy1000

<alastairp> "however storing enough data to..." <- is the reconstructable data allowed to be stored?

2022-02-14 04550, 2022

17:57 PM
tandy1000

thanks for the clarifications btw alastair!

2022-02-14 04508, 2022

18:00 PM
Freso

<BANG>

2022-02-14 04508, 2022

18:00 PM
Freso

It’s V‐Monday <https://www.vday.org/>!

2022-02-14 04508, 2022

18:00 PM
Freso

https://www.youtube.com/watch?v=ZmWBrN7QV6Y

2022-02-14 04508, 2022

18:00 PM
Freso

We have a couple of people not here today, but other than those, the regulars are up: yvanzo, zas, monkey, akshaaatt, mayhem, bitmap, lucifer, alastairp, Freso – anyone else who wish to give review, let me know ASAP. :)

2022-02-14 04508, 2022

18:00 PM
Freso

reosarevok: Go!

2022-02-14 04509, 2022

18:00 PM
Freso

reosarevok says…

2022-02-14 04509, 2022

18:00 PM
Freso

"""

2022-02-14 04510, 2022

18:00 PM
Freso

Hi! I'm trying to have a chill night today so just mailing this in instead :)

2022-02-14 04510, 2022

18:00 PM
Freso

Last week I mostly worked on tests (both adding a few missing ones and adding documentation for a bunch of existing ones).

2022-02-14 04511, 2022

18:00 PM
Freso

I also updated a bunch of PRs for the next MBS milestone, and released a new MBS version.

2022-02-14 04511, 2022

18:00 PM
Freso

Also, I'm finally done helping CatQuest document instrument adding, and I think it looks quite good now:

2022-02-14 04512, 2022

18:00 PM
Freso

https://wiki.musicbrainz.org/User:CatCat/INSTwave

2022-02-14 04528, 2022

18:00 PM
KassOtsimine

:O

2022-02-14 04536, 2022

18:00 PM
Freso

It would be great if someone else who wasn't used to instrument work could check and see if they see something missing, because both of us know how it works and we might have overlooked something that is not obvious for others :)

2022-02-14 04536, 2022

18:00 PM
Freso

This week I expect to mostly write more tests and update / improve more PRs.

2022-02-14 04536, 2022

18:00 PM
Freso

Fin! Go CatQuest maybe :)

2022-02-14 04536, 2022

18:00 PM
Freso

"""

2022-02-14 04536, 2022

18:00 PM
Freso

CatQuest won’t around though, but atj is not either! atj: Go!

2022-02-14 04537, 2022

18:00 PM
Freso

atj says…

2022-02-14 04537, 2022

18:00 PM
Freso

"""

2022-02-14 04538, 2022

18:00 PM
Freso

Last week I had a meeting with zas who explained the MB infrastructure and its history as well as the challenges that need to be resolved in the short to medium term. I created a syswiki page based on this information for future reference. I also spent some time reading various documentation on the syswiki to try and familiarise myself more with the infrastructure and how things work etc.

2022-02-14 04538, 2022

18:00 PM
Freso

zas and I then had another meeting to discuss Ansible and how it could be used to simplify deployment, configuration and maintenance of MB infrastructure going forward. I created a simple playbook to run against a test VM to demonstrate some of the concepts and functionality. zas’ feedback was positive and we plan to use Ansible to deploy a replacement server in the near future as a real world test.

2022-02-14 04539, 2022

18:00 PM
Freso

"""

2022-02-14 04539, 2022

18:00 PM
Freso

yvanzo: Go!

2022-02-14 04539, 2022

18:01 PM
yvanzo

Hi!

2022-02-14 04500, 2022

18:02 PM
yvanzo

Last week I installed a Jira add-on to help Freso with pruning spam in tickets.

2022-02-14 04529, 2022

18:02 PM
yvanzo

Also merged an utility script to collect container logs, can be useful to: akshaaatt, alastair, bitmap, lucifer, and mayhem mostly, but also to: atj and zas.

2022-02-14 04544, 2022

18:02 PM
yvanzo

(alastairp ^)

2022-02-14 04500, 2022

18:03 PM
akshaaatt

Thanks yvanzo !

2022-02-14 04505, 2022

18:03 PM
yvanzo

Completed the deployment of an MBS mirror instance for VolumIO. It helped with debugging a few issues in musicbrainz-docker and sir.

2022-02-14 04540, 2022

18:03 PM
yvanzo

Reviewed many PRs, and updated my PR on improving log's timestamps in MBS.

2022-02-14 04503, 2022

18:04 PM
yvanzo

Plus documented sysadmin (container logs, SolrCloud alert, MB incident log), and tuned GitHub settings for some repositories.

2022-02-14 04516, 2022

18:04 PM
yvanzo

Fin. Go zas!

2022-02-14 04537, 2022

18:04 PM
zas

Hey

2022-02-14 04546, 2022

18:04 PM
Freso

(Still up: monkey, akshaaatt, mayhem, bitmap, lucifer, alastairp, Freso – anyone else who wish to give review, let me know ASAP. :))

2022-02-14 04558, 2022

18:04 PM
zas

I did upgrades on grafana & discourse (security fixes)

2022-02-14 04513, 2022

18:05 PM
zas

also worked on Picard, PR reviews mainly

2022-02-14 04543, 2022

18:05 PM
zas

Plus usual infrastructure supervision, minor issues handling

2022-02-14 04521, 2022

18:06 PM
zas

As said by atj, we had very constructive meetings, we are thinking about deploying Ansible to help us deploy & maintain servers

2022-02-14 04559, 2022

18:06 PM
zas

I reviewed possible candidates for real-life tests, I'm thinking about cage first

2022-02-14 04531, 2022

18:07 PM
zas

We will order a new server (if ruaok agrees), and set up Ansible to fully deploy initial stuff, up to docker

2022-02-14 04547, 2022

18:07 PM
zas

I think Ansible could help a lot to maintain containers

2022-02-14 04512, 2022

18:08 PM
zas

that's it for me. monkey ?

2022-02-14 04519, 2022

18:08 PM
monkey

Hello !

2022-02-14 04503, 2022

18:09 PM
monkey

last week I helped O'Yvanz debug the new remote log collector script

2022-02-14 04525, 2022

18:09 PM
monkey

I reviewed a bunch of PRs for ListenBrainz and BookBrainz

2022-02-14 04517, 2022

18:10 PM
monkey

Continued helping Shubh work on the server routes and userscript to import entities from other websites

2022-02-14 04559, 2022

18:10 PM
monkey

I continued my homework reading up on and thinking about mood classification, and had a good discussion with alastair on that

2022-02-14 04522, 2022

18:11 PM
monkey

I spent some time fiddling with Webpack, because that's what JS developers do.

2022-02-14 04537, 2022

18:11 PM
monkey

On Friday I helped mayhem hack on a new endpoint to quick make a playable page in ListenBrainz from recording or album MBIDs.

2022-02-14 04548, 2022

18:11 PM
monkey

I made a simple userscript that makes MusicBrainz playable that way. It adds a “Play on ListenBrainz" button in the sidebar on MB pages (release, release group, recording and collection)

2022-02-14 04508, 2022

18:12 PM
monkey

I've been waiting for this since I learneed about MusicBrainz, so it's really nice to play with it

2022-02-14 04532, 2022

18:12 PM
monkey

To try it out visit this link: https://gist.github.com/MonkeyDo/6b919302e97dc979…

2022-02-14 04540, 2022

18:12 PM
monkey

(currently in testing phase)

2022-02-14 04504, 2022

18:13 PM
monkey

And that's it for me! lucifer go!

2022-02-14 04511, 2022

18:13 PM
lucifer

hi all!

2022-02-14 04550, 2022

18:13 PM
lucifer

I made some progress on moving the delete listens stuff to a periodic cron job according to the discussion I had with mayhem.

2022-02-14 04507, 2022

18:14 PM
lucifer

that led to some scary realisations about postgres transactions.

2022-02-14 04528, 2022

18:15 PM
lucifer

i also worked on artist/recording similarity and spent much of the week on helping debug that. due to some reason, using user_id instead of uesr_names for this is creating worse results. still not reached to the bottom of that issue.

2022-02-14 04555, 2022

18:15 PM
alastairp

i'm here

2022-02-14 04509, 2022

18:16 PM
lucifer

Reviewed some CB PRs for Ansh and worked with alastairp on a CB release.

2022-02-14 04552, 2022

18:16 PM
lucifer

also, worked on some BU improvements. and finally worked with mayhem to restart the Yotube Quota verification process.

2022-02-14 04505, 2022

18:17 PM
lucifer

thats it for me, alastairp next?

2022-02-14 04516, 2022

18:17 PM
alastairp

yo! tube, the hot new app

2022-02-14 04527, 2022

18:17 PM
alastairp

I reviewed some of lucifer's PRs on LB, which simplify some of our underlying connection handling and we merged them

2022-02-14 04534, 2022

18:17 PM
alastairp

I fixed a bug in the checking of out-of-date dumps in LB and added some new functionality.

2022-02-14 04539, 2022

18:17 PM
alastairp

lucifer finished some work on some old CB PRs and we tested them, merged, and deployed. I spent some time with monkey talking about ideas for capturing genre annotations from users in LB

2022-02-14 04544, 2022

18:17 PM
alastairp

I also started upgrading some dependencies in LB - python 3.9, and flask 2 to fix a bug with caching of static items in LB during development

2022-02-14 04557, 2022

18:17 PM
alastairp

akshaaatt: you're up?

2022-02-14 04502, 2022

18:18 PM
akshaaatt

Yes!

2022-02-14 04510, 2022

18:18 PM
akshaaatt

Hi Everyone!

2022-02-14 04520, 2022

18:18 PM
akshaaatt

I continued my work by revamping MB further.

2022-02-14 04530, 2022

18:18 PM
akshaaatt

Also, focused on LB as we are refactoring the frontend codebase. We have successfully updated the LB homepage and some other sections!

2022-02-14 04551, 2022

18:18 PM
akshaaatt

I have also been designing some cool real components for us to use in our websites! Let's see how that goes.

2022-02-14 04507, 2022

18:19 PM
akshaaatt

Other than that our colleges have reopened and I was busy adjusting back to hostel life and visiting the campus daily!

2022-02-14 04508, 2022

18:19 PM
Freso

(Only mayhem, bitmap, and myself are still up. Last call for anyone else who wish to give review!)

2022-02-14 04516, 2022

18:19 PM
akshaaatt

That's about it for me. Go mayhem!

2022-02-14 04540, 2022

18:19 PM
akshaaatt

react^^

2022-02-14 04506, 2022

18:20 PM
Freso

mayhem: Ping?

2022-02-14 04513, 2022

18:20 PM
Freso

bitmap: Go in the meantime.

2022-02-14 04518, 2022

18:20 PM
bitmap

hello

2022-02-14 04526, 2022

18:20 PM
Freso

Did I miss a ping from Rob saying he wouldn’t be here?

2022-02-14 04543, 2022

18:20 PM
bitmap

last week I mostly did code review and worked more on schema change branches

2022-02-14 04557, 2022

18:20 PM
bitmap

we plan to decide on tickets for that tomorrow

2022-02-14 04516, 2022

18:21 PM
bitmap

I also documented deployment stuff related to StaticBrainz in syswiki

2022-02-14 04537, 2022

18:21 PM
bitmap

fin! go mayhem if here now

2022-02-14 04524, 2022

18:22 PM
Freso

I guess I’ll go and if mayhem’s here when I’m done, he can go, otherwise we’ll just skip.

2022-02-14 04527, 2022

18:22 PM
Freso

So.

2022-02-14 04530, 2022

18:22 PM
Freso

Freso: Go!

2022-02-14 04532, 2022

18:22 PM
Freso

🙋

2022-02-14 04526, 2022

18:24 PM
Freso

Dealt with some reported editors, looked into client-side image optimisation for the forum after the upgrade, got rid of some spam on Jira, dealt with some flagged forum content, and broke out some discussions into separate topics, etc.

2022-02-14 04548, 2022

18:24 PM
Freso

Got kellnerd a MetaBrainz IRC cloak.

2022-02-14 04506, 2022

18:25 PM
Freso

I think that’s about it, so, fin!

2022-02-14 04524, 2022

18:25 PM
Freso

And no sign-of-life from Rob, so this wraps up the reviews: thank you everyone who contributed!

2022-02-14 04518, 2022

18:26 PM
Freso

And no more items on the agenda, so this also wraps up the meeting.

2022-02-14 04529, 2022

18:26 PM
Freso

Thank you all for your time! Stay safe out there! :)

2022-02-14 04531, 2022

18:26 PM
monkey

Thanks Freso !

2022-02-14 04533, 2022

18:26 PM
Freso

</BANG>

2022-02-14 04537, 2022

18:26 PM
akshaaatt

Thank you!

2022-02-14 04531, 2022

18:27 PM
yvanzo

Thanks!

2022-02-14 04511, 2022

18:30 PM
alastairp

tandy1000: you can take a look at these examples: https://andrebola.github.io/EUSIPCO2020/ - (the "examples" link in the first paragraph) and play back some of the examples

2022-02-14 04556, 2022

18:30 PM
alastairp

they show how as we remove more and more data, it gets more difficult to reconstruct the audio. however, more data is always useful for training, so it's a tradeoff between what can be reconstructed and the results from your model

2022-02-14 04546, 2022

18:31 PM
alastairp

making things more difficult, techniques for _reconstruction_ are always improving too, someone I know did some studies where he took some work that was thought to be not reconstructable and applied some new deep learning techniques to it, and got pretty "good" results

2022-02-14 04552, 2022

18:33 PM
alastairp

in specific response to your question - "is reconstructable data able to be stored?", that's a great question, with no absolute clear answer. however in most cases it's probably going to be "no"

2022-02-14 04505, 2022

18:35 PM
alastairp

5-10 years ago, the data that we have in AB was used for much of the state-of-the-art research, and while it didn't give as good answers as the more recent deep learning models, it worked well in many cases. This data is impossible to reconstruct, so this wasn't as much of a problem

2022-02-14 04503, 2022

18:37 PM
alastairp

yvanzo: thanks for the ping on the log collection tool - I didn't follow the PR last week, but I'll take a look and see if I can use it

2022-02-14 04521, 2022

18:37 PM
tandy1000

very interesting

2022-02-14 04554, 2022

18:44 PM
mayhem

Huh, i'm guessing my status update is still stuck in my outbound mail queue. Sorry for that, everyone!