because it kept on saying that they had really high similarity
2019-03-05 06422, 2019
ruaok
can we frame this in terms of "this was done, we solved the theoretical aspects, but now we need to scale it"?
2019-03-05 06430, 2019
alastairp
absolutely
2019-03-05 06448, 2019
ruaok
that sounds like a perfect gsoc project, no?
2019-03-05 06417, 2019
alastairp
scalability is a potential issue
2019-03-05 06432, 2019
ruaok
it is *the* issue, no?
2019-03-05 06439, 2019
alastairp
we had no machine here which was fast enough to really get an idea about how difficult the issue was
2019-03-05 06403, 2019
alastairp
we were running with about 25% of the database, and it was taking ages to do stuff (but our infrastructure isn't as powerful as the hetzner dedicated servers)
2019-03-05 06403, 2019
ruaok
I suspect that we'll need to solve this using spark.
2019-03-05 06408, 2019
rsh7
iliekcomputers: got time to rebase the integration branch?
2019-03-05 06425, 2019
alastairp
right, so it depends on what our goals with this are, and how we want to query it
2019-03-05 06443, 2019
alastairp
if we can wait a few seconds or tens of seconds, BQ or spark are ideal candidates
2019-03-05 06454, 2019
ruaok
or we rent a stupidly big cloud instance and run it on that periodically.
2019-03-05 06416, 2019
ruaok
my goals is to feed more data into training recommendation engines.
2019-03-05 06418, 2019
ruaok
so batches are fine.
2019-03-05 06420, 2019
alastairp
and it's possible that spark may even be better, because we're not constrained by cube's requirement that the distance metrics are linear
2019-03-05 06428, 2019
alastairp
right, so, it also depends on what the usecase is
2019-03-05 06440, 2019
alastairp
clustering? or selection of similarity from a single example instance?
2019-03-05 06441, 2019
ruaok
this is the usecase I want to solve.
2019-03-05 06408, 2019
ruaok
I'm unsure of how to get there, but I am sure that I want a track similarity mapping in the end
2019-03-05 06420, 2019
alastairp
I'm not sure if it's responsible to select every item in the database and independently caculate its similarity with every other track
alastairp: another favor please... can you recommend some papers that combine user behavioral data with acoustic data in order to build recommendation engines?
2019-03-05 06407, 2019
ruaok
didn't dimi do his PhD on that?
2019-03-05 06408, 2019
alastairp
mmm, good question
2019-03-05 06413, 2019
alastairp
yeah, I think you're right
2019-03-05 06428, 2019
alastairp
Gabriel did stuff on behaviour data I think, I'm not sure if he included acoustic data
2019-03-05 06429, 2019
ruaok
that is the next thing we need to understand.
2019-03-05 06452, 2019
ruaok
yea, we have the CF in spark to do that. but dimi taught me that we need AB as well.
reosarevok registered for the "do you know Estonian laws" exam
2019-03-05 06424, 2019
reosarevok
One step closer to being a citizen, yay
2019-03-05 06430, 2019
CatQuest
reosarevok: even more school though.
2019-03-05 06433, 2019
CatQuest
perpetual student you
2019-03-05 06435, 2019
reosarevok
Nah
2019-03-05 06439, 2019
djinni` joined the channel
2019-03-05 06450, 2019
CatQuest
but yay!
2019-03-05 06455, 2019
reosarevok
You basically just go there and are given the Constitution and stuff and just need to show you can figure it out :p
2019-03-05 06410, 2019
reosarevok
So I don't think it actually requires any studying
2019-03-05 06420, 2019
reosarevok
Might like read it once ahead of time just in case, but
2019-03-05 06431, 2019
CatQuest
yea beter be safe thna srry :P
2019-03-05 06436, 2019
CatQuest
better sorry*
2019-03-05 06405, 2019
gr0uch0mars joined the channel
2019-03-05 06444, 2019
amCap1712
hi gr0uch0mars
2019-03-05 06415, 2019
amCap1712
Can you explain the way you referred in the PR comment to organize code
2019-03-05 06410, 2019
gr0uch0mars
hi. yes I was going to look for a good post to link here, but meanwhile I referred to organizing certain files into features
2019-03-05 06449, 2019
gr0uch0mars
like all files of the presentation-layer of Artist together: viewModel, activity, adapters, fragments…
2019-03-05 06441, 2019
gr0uch0mars
that way, it's easier to have a quick preview of what does the app offers (something related to Artist), and there's “only” one place if you have to touch the code
2019-03-05 06404, 2019
gr0uch0mars
Here is a link about Clean Architecture (way beyond simply “grouping features”) that, although difficult to implement in its totality, it worth reading about: https://fernandocejas.com/2018/05/07/architecting…
amCap1712: take a look at the post and share your thoughts. Working with a good architecture is as important as making code work (although not urgent)
2019-03-05 06454, 2019
amCap1712
thanks gr0uch0mars
2019-03-05 06418, 2019
ahmedkrmn has quit
2019-03-05 06430, 2019
gr0uch0mars
amCap1712: other question I was thinking yesterday. Is there a design for the app UI? Or can we work on an improved design?
2019-03-05 06458, 2019
amCap1712
gr0uch0mars: we can work on improved design
2019-03-05 06416, 2019
gr0uch0mars
great. Let me think of some ideas and I'll share them. Meanwhile we can work on presenting the data retrieved from the API in an “ordered” manner, like you are doing for Artists
2019-03-05 06427, 2019
Freso
Hm. Does it make sense to continue to list AcousticBrainz on /supporters with it also being listed on /projects?
2019-03-05 06436, 2019
amCap1712
ok great gr0uch0mars
2019-03-05 06451, 2019
Freso
It feels a bit like "oh, hey, we support ourselves!", no?
2019-03-05 06406, 2019
Freso
UPF is already listed on their own.
2019-03-05 06439, 2019
alastairp
Freso: I think that "supporters" is directly linked to "has an API key to download the database"
2019-03-05 06445, 2019
alastairp
which is why AB is on supporters
2019-03-05 06424, 2019
Freso
Could be. Just looks a bit odd and self‐congratulatory to me is all. :)
2019-03-05 06433, 2019
alastairp
maybe we can hide the account from the page if you really want, but I'm not sure if it's needed