this is what I did over the weekend. I have not included recommed.py stuff in here because candidate_sets inferences is enough to throw light on how recommend.py will change
2019-11-11 31554, 2019
pristine__
let me know whenever you get the chance to read
2019-11-11 31530, 2019
nav2002 joined the channel
2019-11-11 31515, 2019
Darkloke joined the channel
2019-11-11 31515, 2019
Nyanko-sensei joined the channel
2019-11-11 31528, 2019
D4RK joined the channel
2019-11-11 31529, 2019
Nyanko-sensei has quit
2019-11-11 31538, 2019
D4RK-PH0ENiX has quit
2019-11-11 31514, 2019
Wizzup has quit
2019-11-11 31534, 2019
iliekcomputers
A Google doc / Dropbox paper would probably be much more readable than a gist.
2019-11-11 31515, 2019
pristine__
iliekcomputers: thanks for the suggestion :)
2019-11-11 31548, 2019
Wizzup joined the channel
2019-11-11 31521, 2019
D4RK has quit
2019-11-11 31516, 2019
D4RK-PH0ENiX joined the channel
2019-11-11 31536, 2019
Wizzup has quit
2019-11-11 31518, 2019
Wizzup joined the channel
2019-11-11 31518, 2019
iliekcomputers
No peob
2019-11-11 31520, 2019
iliekcomputers
Prob
2019-11-11 31544, 2019
Darkloke has quit
2019-11-11 31501, 2019
pristine__
Though gists are easier for me :p
2019-11-11 31517, 2019
Wizzup has quit
2019-11-11 31529, 2019
Wizzup joined the channel
2019-11-11 31511, 2019
Omnipoint joined the channel
2019-11-11 31559, 2019
Omnipoint has quit
2019-11-11 31502, 2019
Gazooo has quit
2019-11-11 31536, 2019
Gazooo joined the channel
2019-11-11 31515, 2019
DjSlash
pristine__: if you'd rename it to a .md file, then github should render it
2019-11-11 31512, 2019
pristine__
DjSlash: lol I know. It was just the first draft. I will anyway. Thanks
2019-11-11 31509, 2019
ruaok
DjSlash: that's a pretty easy, but good suggestion.
2019-11-11 31526, 2019
iliekcomputers
DjSlash: nice nick
2019-11-11 31530, 2019
iliekcomputers
:D
2019-11-11 31539, 2019
ruaok
also,moooin!
2019-11-11 31545, 2019
iliekcomputers
Moin!
2019-11-11 31548, 2019
nav2002 has quit
2019-11-11 31553, 2019
DjSlash
iliekcomputers: ha, thanks :)
2019-11-11 31554, 2019
iliekcomputers
The pipeline werks
2019-11-11 31500, 2019
ruaok
niiiice!
2019-11-11 31521, 2019
iliekcomputers
Although sending gigs of data in a single rmq message probably doesn't make any sense
2019-11-11 31537, 2019
ruaok
is that the output of all stats?
2019-11-11 31537, 2019
iliekcomputers
There's lots of easy wins in optimization left
2019-11-11 31508, 2019
iliekcomputers
ruaok: the query oomed when we calculated all three stats for all users
2019-11-11 31521, 2019
iliekcomputers
All three being artists, release recording
2019-11-11 31529, 2019
ruaok
oy.
2019-11-11 31533, 2019
iliekcomputers
So I did just artist for making it work
2019-11-11 31543, 2019
iliekcomputers
And it takes a long time to publish
2019-11-11 31551, 2019
iliekcomputers
Needs more investigation
2019-11-11 31500, 2019
zas
Moiinn
2019-11-11 31501, 2019
iliekcomputers
But hey, it works!
2019-11-11 31533, 2019
pristine__
ruaok: if you are uncomfortable reading that lemme know, I will format it to md Or something
2019-11-11 31505, 2019
ruaok
iliekcomputers: all the big questions have been settled, which is nice.
2019-11-11 31519, 2019
ruaok
pristine__: I really like the adding .md extension and then its all done. please do tjat/
2019-11-11 31519, 2019
iliekcomputers
yes!
2019-11-11 31535, 2019
iliekcomputers
i'd like to create a listenbrainz_spark user or something for the configs
2019-11-11 31544, 2019
iliekcomputers
right now it's running from my account which isn't ideal.
2019-11-11 31545, 2019
ruaok
chhavi says hi and will join us for the meeting tonight.
2019-11-11 31509, 2019
iliekcomputers
hi chhavi
2019-11-11 31542, 2019
ruaok
make me a task for creating a new users on the paper and I'll do it in a bit.
2019-11-11 31541, 2019
zas
Hey chhavi
2019-11-11 31507, 2019
ruaok
you off to a'dam today, zas?
2019-11-11 31500, 2019
zas
Yup, Thalys just left Paris
2019-11-11 31537, 2019
ruaok
looks like it will be cold in the north of europe this week.
2019-11-11 31507, 2019
reosarevok
eh, we're having over-0 temps all week, not that bad :p
2019-11-11 31514, 2019
zas
I'll go to pre-register for haproxyconf on arrival, then to hotel
2019-11-11 31530, 2019
zas
It was very cold in Paris, but Amsterdam should be better, around 6°c, expect rain though
It's cold on the other side of the Atlantic this week too…
2019-11-11 31550, 2019
iliekcomputers
ruaok: task added
2019-11-11 31534, 2019
ruaok
k
2019-11-11 31510, 2019
TOPIC: MetaBrainz Community and Development channel | MusicBrainz non-development: #musicbrainz | New GSoC students start here: https://goo.gl/7jsjG2 | Channel is logged; see https://musicbrainz.org/doc/IRC for details | Meeting (18:00 UTC) agenda: Reviews, Google Code-in (Freso), instrument illustrations (Reo/CatQuest), mbsandbox.org (ruaok)
2019-11-11 31555, 2019
chaban joined the channel
2019-11-11 31557, 2019
BrainzGit
[listenbrainz-server] dependabot-preview[bot] opened pull request #666 (master…dependabot/pip/python-dateutil-2.8.1): Bump python-dateutil from 2.8.0 to 2.8.1 https://github.com/metabrainz/listenbrainz-server…
2019-11-11 31532, 2019
ruaok
pristine__: reading the gist now. so, everything is nice and clear leading up to creating playcounts_df. is that right?
2019-11-11 31528, 2019
ruaok
I wonder if the similar artists table should map artists credits instead of artists.
2019-11-11 31546, 2019
ruaok
then you would not have to explode the recordings_df .
2019-11-11 31542, 2019
pristine__
that means an array of mbids right?
2019-11-11 31545, 2019
pristine__
sounds good
2019-11-11 31556, 2019
pristine__
then ono explode
2019-11-11 31500, 2019
pristine__
no*
2019-11-11 31533, 2019
pristine__
> reading the gist now. so, everything is nice and clear leading up to creating playcounts_df. is that right?
2019-11-11 31535, 2019
pristine__
yes
2019-11-11 31513, 2019
pristine__
I mean I have points and stuff to improve quality but for now it's fine. We can just jot down so that it can help us in next GSOC labs project.
2019-11-11 31548, 2019
pristine__
ruaok: ^
2019-11-11 31545, 2019
ruaok
Let me see if using artist credits makes sense for the artist-artist stuff. I remember it being a question and that it made more sense to an artist-artist level than artisrcredit-artistcredit level.
2019-11-11 31542, 2019
pristine__
sure.
2019-11-11 31508, 2019
pristine__
ruaok: how do you feel about the explode and duplicate recording stuff?
2019-11-11 31549, 2019
ruaok
Not good
2019-11-11 31549, 2019
pristine__
yeah :)
2019-11-11 31511, 2019
pristine__
And do you have any other way other than the two I mentioned?
2019-11-11 31521, 2019
pristine__
ruaok: I mean if any, you feel can be better
2019-11-11 31550, 2019
ruaok
Well, having and ac-ac relation instead of a-a should fix it, no?
2019-11-11 31528, 2019
pristine__
yeah, got that. So i mean if you ever in the middle of night or anytime come across any lil point that can in a way fit into a recommndation engine in future no may be at this hour, do share, we can discuss and build docs as we walk the road map and use it sometime somewhere.
2019-11-11 31529, 2019
pristine__
:)
2019-11-11 31504, 2019
iliekcomputers
man i <3 dependabot
2019-11-11 31517, 2019
chaban has quit
2019-11-11 31506, 2019
ruaok
pristine__: ok, will do. now let me examine the a-a/ac-ac case
2019-11-11 31505, 2019
pristine__
sure :)
2019-11-11 31539, 2019
pristine__
and share your findings please :)
2019-11-11 31528, 2019
ruaok
reosarevok: you about?
2019-11-11 31521, 2019
ruaok
pristine__: ok, from where I stand I think it doesn't matter very much from my artist relations perspective.
2019-11-11 31543, 2019
pristine__
what perspective?
2019-11-11 31500, 2019
ruaok
the script that calculates the a-a relations.
2019-11-11 31511, 2019
pristine__
okay
2019-11-11 31525, 2019
ruaok
it automatically explodes the results, but the semantic meaning remains the same.
2019-11-11 31540, 2019
ruaok
so I will create two outputs: one for a-a and one for ac-ac
2019-11-11 31557, 2019
pristine__
do I need to use the former?
2019-11-11 31509, 2019
ruaok
no, you should use the latter going forward
2019-11-11 31542, 2019
pristine__
yeah.
2019-11-11 31547, 2019
ruaok
and really it will be [artist-mbids] - [artist-mbids] as the actual mapping.
2019-11-11 31555, 2019
ruaok
an array to an array.
2019-11-11 31500, 2019
ruaok
since AC's do not have MBIDs.
2019-11-11 31508, 2019
pristine__
that is awsome.
2019-11-11 31509, 2019
pristine__
but
2019-11-11 31516, 2019
reosarevok
ruaok: now I am
2019-11-11 31516, 2019
ruaok
shit. a but.
2019-11-11 31532, 2019
pristine__
the array in ac-ac will always be singular, no?
2019-11-11 31535, 2019
ruaok
reosarevok: perfect timing. I just answered all the questions I had. lol.
2019-11-11 31540, 2019
reosarevok
haha
2019-11-11 31541, 2019
reosarevok
Neat!
2019-11-11 31508, 2019
pristine__
like [a] similar to [b]
2019-11-11 31517, 2019
ruaok
pristine__: only one array mapping to antoher array, yes. but each array could have one or more entries.
2019-11-11 31546, 2019
ruaok
alternatively I can output an ID for artist credit: AC_0,AC_1, relation
2019-11-11 31501, 2019
pristine__
how? I can't clearly understand that I think. Can you give an example. oh, so till now it was like a similar to b, a similar to c
2019-11-11 31511, 2019
pristine__
now it will be together
2019-11-11 31518, 2019
pristine__
a similar to [b,c]
2019-11-11 31522, 2019
pristine__
is it ?
2019-11-11 31524, 2019
ruaok
you fully understand artist credits, yes?
2019-11-11 31554, 2019
pristine__
i guess so. an artist appear with another artist in how many collabs
2019-11-11 31525, 2019
ruaok
yes, but more importantly know that any recording is attributed to an artist_credit. NOT an artist.
2019-11-11 31529, 2019
pristine__
umm....cool. I like this line.
2019-11-11 31533, 2019
pristine__
clear
2019-11-11 31535, 2019
ruaok
so, if we want to avoid exploding the recordings_df, we need to rework your candidate artist work to work on artist_credits, not artists.