#metabrainz

/

11:12 AM
alastairp

oh, the other thing that popped up around this time was https://github.com/spotify/annoy

2019-03-05 06428, 2019

11:12 AM
ruaok

is the theory behind it solid enough?

2019-03-05 06432, 2019

11:12 AM
alastairp

which we didn't try but looks great

2019-03-05 06442, 2019

11:12 AM
alastairp

what do you mean solid?

2019-03-05 06450, 2019

11:12 AM
alastairp

does the similarity work? yes

2019-03-05 06456, 2019

11:12 AM
ruaok

that.

2019-03-05 06409, 2019

11:13 AM
alastairp

yeah, we had to filter out duplicate submissions

2019-03-05 06419, 2019

11:13 AM
alastairp

because it kept on saying that they had really high similarity

2019-03-05 06422, 2019

11:13 AM
ruaok

can we frame this in terms of "this was done, we solved the theoretical aspects, but now we need to scale it"?

2019-03-05 06430, 2019

11:13 AM
alastairp

absolutely

2019-03-05 06448, 2019

11:13 AM
ruaok

that sounds like a perfect gsoc project, no?

2019-03-05 06417, 2019

11:14 AM
alastairp

scalability is a potential issue

2019-03-05 06432, 2019

11:14 AM
ruaok

it is *the* issue, no?

2019-03-05 06439, 2019

11:14 AM
alastairp

we had no machine here which was fast enough to really get an idea about how difficult the issue was

2019-03-05 06403, 2019

11:15 AM
alastairp

we were running with about 25% of the database, and it was taking ages to do stuff (but our infrastructure isn't as powerful as the hetzner dedicated servers)

2019-03-05 06403, 2019

11:15 AM
ruaok

I suspect that we'll need to solve this using spark.

2019-03-05 06408, 2019

11:15 AM
rsh7

iliekcomputers: got time to rebase the integration branch?

2019-03-05 06425, 2019

11:15 AM
alastairp

right, so it depends on what our goals with this are, and how we want to query it

2019-03-05 06443, 2019

11:15 AM
alastairp

if we can wait a few seconds or tens of seconds, BQ or spark are ideal candidates

2019-03-05 06454, 2019

11:15 AM
ruaok

or we rent a stupidly big cloud instance and run it on that periodically.

2019-03-05 06416, 2019

11:16 AM
ruaok

my goals is to feed more data into training recommendation engines.

2019-03-05 06418, 2019

11:16 AM
ruaok

so batches are fine.

2019-03-05 06420, 2019

11:16 AM
alastairp

and it's possible that spark may even be better, because we're not constrained by cube's requirement that the distance metrics are linear

2019-03-05 06428, 2019

11:16 AM
alastairp

right, so, it also depends on what the usecase is

2019-03-05 06440, 2019

11:16 AM
alastairp

clustering? or selection of similarity from a single example instance?

2019-03-05 06441, 2019

11:16 AM
ruaok

this is the usecase I want to solve.

2019-03-05 06408, 2019

11:17 AM
ruaok

I'm unsure of how to get there, but I am sure that I want a track similarity mapping in the end

2019-03-05 06420, 2019

11:17 AM
alastairp

I'm not sure if it's responsible to select every item in the database and independently caculate its similarity with every other track

2019-03-05 06434, 2019

11:17 AM
ruaok

not feasible.

2019-03-05 06436, 2019

11:17 AM
alastairp

right

2019-03-05 06453, 2019

11:17 AM
alastairp

but if it's just clustering, then stuff like annoy or t-sne (https://lvdmaaten.github.io/tsne/) might be really nice

2019-03-05 06407, 2019

11:18 AM
ruaok

hence me suggesting some estimation function that we can use to reduce the number of comparisons we need to make

2019-03-05 06419, 2019

11:19 AM
iliekcomputers

rsh7: hey, hi! Yes, today. I completely forgot :(

2019-03-05 06442, 2019

11:20 AM
ruaok

let me read the thesis this afternoon and digest that. I'll look at t-SNE too.

2019-03-05 06450, 2019

11:20 AM
ruaok

pristine--: can you please do the same?

2019-03-05 06408, 2019

11:21 AM
rsh7

iliekcomputers: wokay, no problem

2019-03-05 06457, 2019

11:22 AM
reosarevok

ruaok probably knows everything about annoy already, right?

2019-03-05 06458, 2019

11:23 AM
ruaok

look who is talking!

2019-03-05 06416, 2019

11:25 AM
reosarevok

❤️

2019-03-05 06402, 2019

11:26 AM
reosarevok

Heh

2019-03-05 06410, 2019

11:26 AM
reosarevok

Everyone is angry with these SensCritique people

2019-03-05 06417, 2019

11:26 AM
reosarevok

Because they're no longer getting MB updates

2019-03-05 06432, 2019

11:26 AM
reosarevok

ruaok btw, did you answer the Finns? I forgot

2019-03-05 06450, 2019

11:26 AM
ruaok

what's with SC?

2019-03-05 06404, 2019

11:27 AM
pristine--

ruaok: sure :)

2019-03-05 06406, 2019

11:27 AM
ruaok

yes, they are willing to give us data, but we need to be ready for it.

2019-03-05 06404, 2019

11:28 AM
pristine--

ruaok: thesis and T-sne, right?

2019-03-05 06413, 2019

11:28 AM
alastairp

pristine--: and annoy

2019-03-05 06418, 2019

11:28 AM
reosarevok

Twitter complaints about how new stuff added to MB isn't showing up on their page, ruaok

2019-03-05 06430, 2019

11:28 AM
reosarevok

(and they seem to basically be saying "soon TM" and ignoring people)

2019-03-05 06443, 2019

11:28 AM
alastairp

pristine--: the thesis should give you a good overview about how we consider acoustic similarity

2019-03-05 06412, 2019

11:29 AM
alastairp

iliekcomputers: tell me when we should deploy. it's up to you

2019-03-05 06417, 2019

11:29 AM
pristine--

alastairp: okay:)

2019-03-05 06435, 2019

11:32 AM
iliekcomputers

alastairp: 9PM my time today?

2019-03-05 06452, 2019

11:32 AM
alastairp

no problem

2019-03-05 06429, 2019

11:33 AM
iliekcomputers

Cool, thanks!

2019-03-05 06457, 2019

11:34 AM
BrainzGit

[bookbrainz-site] MonkeyDo merged pull request #254 (master…test-subdomain): "fixes BB-309" docs: Add a section about the "test" subdomain in README https://github.com/bookbrainz/bookbrainz-site/pul…

2019-03-05 06457, 2019

11:34 AM
BrainzBot

BB-309: Reference to test.bookbrainz.org in README https://tickets.metabrainz.org/browse/BB-309

2019-03-05 06423, 2019

11:39 AM
travis-ci joined the channel

2019-03-05 06424, 2019

11:39 AM
travis-ci

Project bookbrainz-site build #2034: passed in 3 min 43 sec: https://travis-ci.org/bookbrainz/bookbrainz-site/…

2019-03-05 06424, 2019

11:39 AM
travis-ci has left the channel

2019-03-05 06449, 2019

11:54 AM
ruaok

alastairp: another favor please... can you recommend some papers that combine user behavioral data with acoustic data in order to build recommendation engines?

2019-03-05 06407, 2019

11:55 AM
ruaok

didn't dimi do his PhD on that?

2019-03-05 06408, 2019

11:55 AM
alastairp

mmm, good question

2019-03-05 06413, 2019

11:55 AM
alastairp

yeah, I think you're right

2019-03-05 06428, 2019

11:55 AM
alastairp

Gabriel did stuff on behaviour data I think, I'm not sure if he included acoustic data

2019-03-05 06429, 2019

11:55 AM
ruaok

that is the next thing we need to understand.

2019-03-05 06452, 2019

11:55 AM
ruaok

yea, we have the CF in spark to do that. but dimi taught me that we need AB as well.

2019-03-05 06402, 2019

11:56 AM
ruaok

but, what are the algs that are going to scale?

2019-03-05 06437, 2019

11:56 AM
alastairp

http://mtg.upf.edu/node/2817

2019-03-05 06439, 2019

11:56 AM
alastairp

that's his thesis

2019-03-05 06440, 2019

11:56 AM
ahmedkrmn_ joined the channel

2019-03-05 06403, 2019

11:57 AM
ruaok

pristine--: iliekcomputers ^^

2019-03-05 06404, 2019

11:57 AM
alastairp

but I'll try and grab him this week and get him to write a handful of notes with a more distilled focus

2019-03-05 06412, 2019

11:57 AM
ruaok

that would be excellent, thank you!

2019-03-05 06420, 2019

11:57 AM
ruaok

perhaps next week maybe the three of us go to lunch?

2019-03-05 06437, 2019

11:59 AM
ahmedkrmn has quit

2019-03-05 06446, 2019

11:59 AM
ahmedkrmn_ is now known as ahmedkrmn

2019-03-05 06413, 2019

12:00 PM
pristine--

ruaok: should I also come along. Lol

2019-03-05 06445, 2019

12:00 PM
pristine--

And got the paper :)

2019-03-05 06408, 2019

12:02 PM
ruaok

that would be nice, but the commute is a killer.

2019-03-05 06449, 2019

12:05 PM
iliekcomputers

Where's the visa

2019-03-05 06453, 2019

12:05 PM
iliekcomputers

:) :)

2019-03-05 06405, 2019

12:06 PM
ruaok

gaaaaaaaaaah!

2019-03-05 06414, 2019

12:06 PM
ruaok starts twitching madly

2019-03-05 06423, 2019

12:07 PM
iliekcomputers

The commute involves landslides :D

2019-03-05 06455, 2019

12:07 PM
iliekcomputers

And civilians axing trees to free themselves

2019-03-05 06425, 2019

12:08 PM
heisthepirate joined the channel

2019-03-05 06434, 2019

12:08 PM
amCap1712 joined the channel

2019-03-05 06405, 2019

12:09 PM
ruaok

I've told that story to several friends already. to make a contrast between europeans and indians.

2019-03-05 06407, 2019

12:09 PM
Mr_Monkey

iliekcomputers, ruaok : I'm seeing duplicated in my listens after I relinked my Spotify account yesterday. Is that expected?

2019-03-05 06429, 2019

12:09 PM
ruaok

Ive seen it before and we should consider that a bug.

2019-03-05 06433, 2019

12:09 PM
ruaok

file a ticket, Mr_Monkey ?

2019-03-05 06437, 2019

12:09 PM
Mr_Monkey

Will file

2019-03-05 06438, 2019

12:09 PM
Mr_Monkey

:)

2019-03-05 06458, 2019

12:10 PM
CatQuest

pristine--: morn morn is the norwegian equivalent of "moin"

2019-03-05 06416, 2019

12:11 PM
CatQuest

it's commonly said as "morn morn" twice not just "morn" once

2019-03-05 06403, 2019

12:13 PM
code_master5 joined the channel

2019-03-05 06400, 2019

12:14 PM
pristine--

CatQuest: oh. I see.

2019-03-05 06409, 2019

12:14 PM
pristine--

ruaok: yeah. Visa. Lol

2019-03-05 06443, 2019

12:15 PM
CatQuest

oh no not th visas again :C

2019-03-05 06449, 2019

12:15 PM
CatQuest

:(

2019-03-05 06431, 2019

12:17 PM
heisthepirate has quit

2019-03-05 06439, 2019

12:21 PM
CatQuest

that could be clearer in the edit

2019-03-05 06435, 2019

12:22 PM
travis-ci joined the channel

2019-03-05 06436, 2019

12:22 PM
travis-ci

metabrainz/picard#4404 (master - 9a7b323 : Laurent Monin): The build passed.

2019-03-05 06436, 2019

12:22 PM
travis-ci

Change view : https://github.com/metabrainz/picard/compare/6d78…

2019-03-05 06436, 2019

12:22 PM
travis-ci

Build details : https://travis-ci.org/metabrainz/picard/builds/50…

2019-03-05 06436, 2019

12:22 PM
travis-ci has left the channel

2019-03-05 06447, 2019

12:26 PM
djinni`_ has quit

2019-03-05 06413, 2019

12:30 PM
reosarevok registered for the "do you know Estonian laws" exam

2019-03-05 06424, 2019

12:30 PM
reosarevok

One step closer to being a citizen, yay

2019-03-05 06430, 2019

12:30 PM
CatQuest

reosarevok: even more school though.

2019-03-05 06433, 2019

12:30 PM
CatQuest

perpetual student you

2019-03-05 06435, 2019

12:30 PM
reosarevok

Nah

2019-03-05 06439, 2019

12:30 PM
djinni` joined the channel

2019-03-05 06450, 2019

12:30 PM
CatQuest

but yay!

2019-03-05 06455, 2019

12:30 PM
reosarevok

You basically just go there and are given the Constitution and stuff and just need to show you can figure it out :p

2019-03-05 06410, 2019

12:31 PM
reosarevok

So I don't think it actually requires any studying

2019-03-05 06420, 2019

12:31 PM
reosarevok

Might like read it once ahead of time just in case, but

2019-03-05 06431, 2019

12:31 PM
CatQuest

yea beter be safe thna srry :P

2019-03-05 06436, 2019

12:31 PM
CatQuest

better sorry*

2019-03-05 06405, 2019

12:38 PM
gr0uch0mars joined the channel

2019-03-05 06444, 2019

12:39 PM
amCap1712

hi gr0uch0mars

2019-03-05 06415, 2019

12:40 PM
amCap1712

Can you explain the way you referred in the PR comment to organize code

2019-03-05 06410, 2019

12:41 PM
gr0uch0mars

hi. yes I was going to look for a good post to link here, but meanwhile I referred to organizing certain files into features

2019-03-05 06449, 2019

12:41 PM
gr0uch0mars

like all files of the presentation-layer of Artist together: viewModel, activity, adapters, fragments…

2019-03-05 06441, 2019

12:42 PM
gr0uch0mars

that way, it's easier to have a quick preview of what does the app offers (something related to Artist), and there's “only” one place if you have to touch the code

2019-03-05 06404, 2019

12:45 PM
gr0uch0mars

Here is a link about Clean Architecture (way beyond simply “grouping features”) that, although difficult to implement in its totality, it worth reading about: https://fernandocejas.com/2018/05/07/architecting…

2019-03-05 06433, 2019

12:47 PM
ruaok

the supporters page is getting pretty long! https://metabrainz.org/supporters

2019-03-05 06454, 2019

12:47 PM
gr0uch0mars

amCap1712: take a look at the post and share your thoughts. Working with a good architecture is as important as making code work (although not urgent)

2019-03-05 06454, 2019

12:47 PM
amCap1712

thanks gr0uch0mars

2019-03-05 06418, 2019

12:51 PM
ahmedkrmn has quit

2019-03-05 06430, 2019

12:51 PM
gr0uch0mars

amCap1712: other question I was thinking yesterday. Is there a design for the app UI? Or can we work on an improved design?

2019-03-05 06458, 2019

12:51 PM
amCap1712

gr0uch0mars: we can work on improved design

2019-03-05 06416, 2019

12:53 PM
gr0uch0mars

great. Let me think of some ideas and I'll share them. Meanwhile we can work on presenting the data retrieved from the API in an “ordered” manner, like you are doing for Artists

2019-03-05 06427, 2019

12:53 PM
Freso

Hm. Does it make sense to continue to list AcousticBrainz on /supporters with it also being listed on /projects?

2019-03-05 06436, 2019

12:53 PM
amCap1712

ok great gr0uch0mars

2019-03-05 06451, 2019

12:53 PM
Freso

It feels a bit like "oh, hey, we support ourselves!", no?

2019-03-05 06406, 2019

12:54 PM
Freso

UPF is already listed on their own.

2019-03-05 06439, 2019

12:55 PM
alastairp

Freso: I think that "supporters" is directly linked to "has an API key to download the database"

2019-03-05 06445, 2019

12:55 PM
alastairp

which is why AB is on supporters

2019-03-05 06424, 2019

12:57 PM
Freso

Could be. Just looks a bit odd and self‐congratulatory to me is all. :)

2019-03-05 06433, 2019

12:57 PM
alastairp

maybe we can hide the account from the page if you really want, but I'm not sure if it's needed

2019-03-05 06449, 2019

12:57 PM
Freso

Nah, not if it's something that takes effort.

2019-03-05 06455, 2019

12:57 PM
alastairp

I think it's possible

2019-03-05 06427, 2019

13:04 PM
D4RK-PH0ENiX has quit

2019-03-05 06430, 2019

13:11 PM
gr0uch0m_ joined the channel

2019-03-05 06430, 2019

13:11 PM
gr0uch0mars has quit

2019-03-05 06419, 2019

13:12 PM
ruaok

https://twitter.com/MetaBrainz/status/11029193092…