#metabrainz

/

12:33 PM
BrainzBot

MBS-11212: Incorrect quality attribute in ws/2/release https://tickets.metabrainz.org/browse/MBS-11212

2020-11-10 31534, 2020

12:33 PM
alastairp

right, that's an interesting idea. it certainly depends on how we want to present the information to users

2020-11-10 31513, 2020

12:34 PM
alastairp

"here is a list of artists that are similar to what you listen to", "here are artists similar to what you have been listening to in the last month", "here is a playlist of tracks to listen to that you haven't heard before"

2020-11-10 31528, 2020

12:34 PM
pristine___

So what I mean is, for one week, you will see the same recs, if you refresh your page because there is only one row for you in the db. The next week when you refresh your recs will change and the last updated too

2020-11-10 31546, 2020

12:34 PM
pristine___

> "here is a list of artists that are similar to what you listen to", "here are artists similar to what you have been listening to in the last month", "here is a playlist of tracks to listen to that you haven't heard before"

2020-11-10 31502, 2020

12:35 PM
pristine___

Yeah, though at this point I am not really sure about what we need

2020-11-10 31512, 2020

12:35 PM
pristine___

So I have just tried to keep it simple

2020-11-10 31519, 2020

12:35 PM
alastairp

in fact, perhaps that's a good starting point. We haven't actually talked about this yet

2020-11-10 31527, 2020

12:35 PM
alastairp

and it should be the first thing in the document

2020-11-10 31539, 2020

12:35 PM
alastairp

what is our end goal? Why do we need this? What are we creating?

2020-11-10 31555, 2020

12:35 PM
alastairp

could you add some ideas to the beginning of the document? It doesn't have to be complete

2020-11-10 31511, 2020

12:36 PM
pristine___

Sure

2020-11-10 31514, 2020

12:36 PM
alastairp

you've mentioned a few interesting ideas already

2020-11-10 31521, 2020

12:36 PM
alastairp

others can join in if they want

2020-11-10 31524, 2020

12:36 PM
pristine___

:)

2020-11-10 31516, 2020

12:38 PM
alastairp

oh, I just saw your specification of the `artist` column. that's OK. you should be clearer if you want to just have an artist mbid, or just a credit id, or both

2020-11-10 31524, 2020

12:38 PM
alastairp

and I guess you might have a few too many levels of objects

2020-11-10 31508, 2020

12:39 PM
alastairp

you have {'artist': [...stuff...]}, but I think you can get away with [...stuff...]. the column name needs to be more descriptive too, this should say that they are _recommendations

2020-11-10 31542, 2020

12:39 PM
alastairp

oh, here are 2 more ideas for this table: a list of the artists that were passed into the model, and an indication if the user has listened to the resulting artists before or not

2020-11-10 31557, 2020

12:39 PM
alastairp

API endpoint is fine.

2020-11-10 31500, 2020

12:40 PM
pristine___

I think we just need a unique identifier for an artist which we use later to do a look up for data, so artist mbid is good?

2020-11-10 31516, 2020

12:40 PM
alastairp

if you are able to get an mbid at this point, that's fine

2020-11-10 31534, 2020

12:40 PM
alastairp

what if it's a credit? do you break it down into many artist mbids?

2020-11-10 31549, 2020

12:41 PM
pristine___

No. Just the artist mbid, the list. I am okay with credit id too. I think ruaok can help us here, on how he intend to use these recs and therefore in what format he needs tha data

2020-11-10 31519, 2020

12:42 PM
alastairp

this is where it will be useful to have a clearer understanding of the input data

2020-11-10 31520, 2020

12:42 PM
pristine___

> oh, here are 2 more ideas for this table: a list of the artists that were passed into the model, and an indication if the user has listened to the resulting artists before or not

2020-11-10 31548, 2020

12:42 PM
alastairp

because we know that we can only get out the same type of data that we put in

2020-11-10 31554, 2020

12:42 PM
ruaok

pristine___: the goal of the aritst-artist CF is to replace the ac-ac-relations.

2020-11-10 31516, 2020

12:43 PM
ruaok

so everything needs to be artist_credit, not just artist.

2020-11-10 31521, 2020

12:43 PM
alastairp

your artists_df dataframe has an artist credit id, and mbids for each artist in that credit, and a textual name?

2020-11-10 31530, 2020

12:43 PM
pristine___

What if we filter the artist listened to by the user in the last week in spark and send the resultant (recommended but not listened to in the lastbweek)

2020-11-10 31542, 2020

12:43 PM
pristine___

Rn the plan it to generate recs weekly

2020-11-10 31557, 2020

12:43 PM
pristine___

alastairp: yes

2020-11-10 31505, 2020

12:44 PM
alastairp

so it seems like during the input stage you're going to take a listen, convert it to an acid, and then that'll be the core input in the model. that means the output of the model will also be an acid.

2020-11-10 31534, 2020

12:44 PM
alastairp

you can decide if it makes sense to convert that back to a string and artist mbids as part of the model lookup, or later in listenbrainz

2020-11-10 31507, 2020

12:45 PM
pristine___

Cool, I think it makes sense to send over (artost_credit_id, score) as of now

2020-11-10 31518, 2020

12:45 PM
alastairp

pristine___: right, once you have output from the model in spark, we should do some post-processing to see if the user has listened to that artist before

2020-11-10 31523, 2020

12:45 PM
alastairp

and add that to the response

2020-11-10 31547, 2020

12:45 PM
alastairp

this is where having a list of use-cases will be useful. so we can see what data we need to return

2020-11-10 31556, 2020

12:45 PM
pristine___

Agreed!

2020-11-10 31509, 2020

12:46 PM
alastairp

cool

2020-11-10 31515, 2020

12:46 PM
pristine___

> And add that to the response

2020-11-10 31521, 2020

12:46 PM
pristine___

So there are two options here

2020-11-10 31544, 2020

12:47 PM
pristine___

1. Just remove the artist listened to by the user in the last week and send the remaining artist over the queue

2020-11-10 31544, 2020

12:47 PM
pristine___

2. Flag that these artists which are a part of the recs were listened to by the user in the last week and send over the queue

2020-11-10 31532, 2020

12:48 PM
BrainzGit

[musicbrainz-server] reosarevok opened pull request #1782 (master…MBS-11188): MBS-11188: Block odesli.co smart links https://github.com/metabrainz/musicbrainz-server/…

2020-11-10 31533, 2020

12:48 PM
BrainzBot

MBS-11188: Block smart links: album.link https://tickets.metabrainz.org/browse/MBS-11188

2020-11-10 31505, 2020

12:49 PM
alastairp

I strongly recommend that we flag them, rather than remove them

2020-11-10 31514, 2020

12:49 PM
alastairp

it allows us to do more things with the data later

2020-11-10 31522, 2020

12:49 PM
pristine___

I agree.

2020-11-10 31529, 2020

12:49 PM
pristine___

We can maybe open a ticket

2020-11-10 31553, 2020

12:49 PM
pristine___

And do the same with recording recs. Rn we are removing them. More the data, better it is

2020-11-10 31553, 2020

12:50 PM
alastairp

sure

2020-11-10 31506, 2020

12:52 PM
alastairp

your timeline needs a lot more work. this task can be very easily broken down into multiple steps so it would be good to estimate each of these individually.

2020-11-10 31527, 2020

12:52 PM
pristine___

Yea, I wasn't sure about the timeline then.

2020-11-10 31538, 2020

12:52 PM
pristine___

Breaking into multiple steps sounds good

2020-11-10 31544, 2020

12:52 PM
pristine___

But what is your estimate?

2020-11-10 31556, 2020

12:52 PM
pristine___

How long should it take?

2020-11-10 31516, 2020

12:53 PM
alastairp

you should also add if you expect to be blocked by something - e.g. "this item has to be reviewed and merged and released before I can move on to the next thing"

2020-11-10 31546, 2020

12:53 PM
pristine___

Right

2020-11-10 31558, 2020

12:53 PM
alastairp

maybe you will be able to work on some things in parallel - e.g. you could work on the listenbrainz code while you're waiting for someone to merge the spark code

2020-11-10 31531, 2020

12:54 PM
alastairp

yes, I mean time estimate

2020-11-10 31522, 2020

12:57 PM
pristine___

I was asking, according to you how much time this work should take?

2020-11-10 31549, 2020

12:58 PM
alastairp

honestly, I have no idea. I don't know how much time you have to dedicate to this, I don't know how the spark system works, or how long it takes to do a review and deploy to test something

2020-11-10 31505, 2020

12:59 PM
alastairp

I think with your experience in the recording recommendation, you should be able to make a good estimate for each part

2020-11-10 31537, 2020

13:00 PM
pristine___

Cool

2020-11-10 31521, 2020

13:01 PM
pristine___

So I will be moving to Berlin in a few days, that can delay the work, other than that I think 3-4 weeks.

2020-11-10 31523, 2020

13:01 PM
alastairp

you've also added some APIs and tables to listebrainz, so you should have a pretty good idea of how long that has taken you in the past

2020-11-10 31533, 2020

13:01 PM
pristine___

Right

2020-11-10 31542, 2020

13:01 PM
alastairp

cool! of course, you don't need to work on the plane :)

2020-11-10 31544, 2020

13:02 PM
shivam-kapila

Oh coding on plane is funnnn

2020-11-10 31536, 2020

13:03 PM
pristine___

Haha

2020-11-10 31504, 2020

13:05 PM
pristine___

So we done on artist recs, alastairp ?

2020-11-10 31553, 2020

13:09 PM
alastairp

I guess so

2020-11-10 31535, 2020

13:10 PM
pristine___

Nice.

2020-11-10 31558, 2020

13:10 PM
pristine___

Let's discuss the feedback stuff?

2020-11-10 31503, 2020

13:11 PM
alastairp

I'm going to lunch now, and I have other plans for the afternoon

2020-11-10 31512, 2020

13:11 PM
alastairp

do you have time another day for the feedback?

2020-11-10 31542, 2020

13:12 PM
pristine___

Sure. Maybe tomorrow? Or day after?

2020-11-10 31553, 2020

13:12 PM
alastairp

tomorrow is OK. same time

2020-11-10 31557, 2020

13:12 PM
v6lur_ joined the channel

2020-11-10 31508, 2020

13:13 PM
alastairp

Mr_Monkey: will you be around 12:00 tomorrow?

2020-11-10 31516, 2020

13:13 PM
pristine___

And it will be good if you could have a look at #1149. For reference before meeting

2020-11-10 31525, 2020

13:13 PM
pristine___

Cool. Tomorrow sounds good

2020-11-10 31535, 2020

13:13 PM
alastairp

yes, I've looked at it

2020-11-10 31502, 2020

13:14 PM
pristine___

Nice

2020-11-10 31545, 2020

13:15 PM
v6lur has quit

2020-11-10 31554, 2020

13:29 PM
BrainzGit

[musicbrainz-server] reosarevok opened pull request #1783 (master…MBS-11126): MBS-11126: display track lengths of 0 ms or -1 ms as unknown https://github.com/metabrainz/musicbrainz-server/…

2020-11-10 31556, 2020

13:29 PM
BrainzBot

MBS-11126: Historic edits: display track lengths of 0 ms or -1 ms as unknown https://tickets.metabrainz.org/browse/MBS-11126

2020-11-10 31504, 2020

13:32 PM
sumedh has quit

2020-11-10 31517, 2020

13:36 PM
sumedh joined the channel

2020-11-10 31534, 2020

13:58 PM
Mr_Monkey

alastairp, pristine___ : I can do 12:00 Barcelona time feedback meeting

2020-11-10 31520, 2020

14:00 PM
alastairp

thanks

2020-11-10 31537, 2020

14:35 PM
v6lur_ has quit

2020-11-10 31540, 2020

14:54 PM
BrainzGit

[listenbrainz-server] MonkeyDo merged pull request #1173 (master…LB-717): LB-717: Return array instead of object when no feedback https://github.com/metabrainz/listenbrainz-server…

2020-11-10 31541, 2020

14:54 PM
BrainzBot

LB-717: Error thrown in loadFeedback function (React) https://tickets.metabrainz.org/browse/LB-717

2020-11-10 31559, 2020

14:59 PM
alastairp

yvanzo: hi, I'm looking at python image PR, perhaps easier to talk here, because I have a handful of questions

2020-11-10 31551, 2020

15:00 PM
alastairp

it looks like you are looking for an image labeled `metabrainz/python:x.y`, and if it exists, look for the created date, extract that out, and re-tag the version as `x.y-date.seq`

2020-11-10 31545, 2020

15:01 PM
alastairp

my initial feeling is that this is too complex for such a simple task. Why can we not just use `date` to get the current date, and use that as the date tag in the version?

2020-11-10 31511, 2020

15:02 PM
alastairp

we update these images very rarely. therefore I think that 95% of the time we're going to update all of them at once

2020-11-10 31536, 2020

15:02 PM
alastairp

I think that we should have no `:x.y` versions. Only `:x.y-date.seq`

2020-11-10 31535, 2020

15:12 PM
ruaok

http://loser.com/

2020-11-10 31536, 2020

15:12 PM
ruaok

lol

2020-11-10 31506, 2020

15:15 PM
ruaok

alastairp: 14400000 records inserted into a typesense index. time to see how well it works on real time data.

2020-11-10 31536, 2020

15:16 PM
alastairp

nice

2020-11-10 31547, 2020

15:16 PM
shivam-kapila

Wow the website

2020-11-10 31525, 2020

15:43 PM
niceplace has quit

2020-11-10 31502, 2020

15:45 PM
yvanzo

alastairp: we have :x.y already, we should probably not remove them as they are in use.

2020-11-10 31538, 2020

15:45 PM
yvanzo

Do you want to stop updating them?

2020-11-10 31506, 2020

15:46 PM
alastairp

I found your commit message where you explained what tags will be pushed. now I understand why you make each of the tags

2020-11-10 31534, 2020

15:46 PM
alastairp

yes, my proposal would be to keep :x.y until we move all projects to a specific tag with a date, and then delete them

2020-11-10 31507, 2020

15:47 PM
yvanzo

This is common practice (at Docker Hub) to update the version x to the latest x.y and so on.

2020-11-10 31535, 2020

15:48 PM
yvanzo

(This is why I made these tags as explained.)

2020-11-10 31535, 2020

15:49 PM
alastairp

yes, I see

2020-11-10 31535, 2020

15:50 PM
alastairp

OK, it's not too much of a problem. I think we should explain this in more detail in a readme (instead of just in the commit message), and also add to the readme a recommendation for what tag to use

2020-11-10 31532, 2020

15:51 PM
yvanzo

Right, I will update the README.md too then.

2020-11-10 31553, 2020

15:51 PM
alastairp

my main comment was that it seems like a lot of code for something that we only run one time every 2 years :)

2020-11-10 31505, 2020

15:52 PM
alastairp

but as ruaok mentioned to me yesterday, it's great to have solid tools

2020-11-10 31527, 2020

15:52 PM
yvanzo

alastairp: also "situations where we build such an image more than once in a day" are not frequent but should be made easy to deal with because it's often an emergency.

2020-11-10 31551, 2020

15:52 PM
yvanzo

It already happened for 3.6 btw.

2020-11-10 31503, 2020

15:53 PM
alastairp

yes, exactly

2020-11-10 31511, 2020

15:53 PM
alastairp

in fact, that's my next task

2020-11-10 31529, 2020

15:53 PM
alastairp

we have 2 base images with 2 different versions of consul, that work differently

2020-11-10 31527, 2020

15:54 PM
alastairp

I believe that's why there are the 2 versions, the earlier one has a newer version of consul, and we had to rebuild later with an older consul

2020-11-10 31539, 2020

15:54 PM
yvanzo

You mean 2 different flavors then?

2020-11-10 31552, 2020

15:54 PM
alastairp

not intentionally

2020-11-10 31506, 2020

15:55 PM
alastairp

eventually after my work we will only have the new version

2020-11-10 31530, 2020

15:55 PM
yvanzo

I did not take this possibility into account when writing the script.

2020-11-10 31548, 2020

15:55 PM
alastairp

don't worry about it. we should not support both flavours

2020-11-10 31514, 2020

15:56 PM
alastairp

my plan is to (for example) have 20201110 with -oldconsul, and then 20201115 with -newconsul, after I upgrade downstream projects

2020-11-10 31520, 2020

15:56 PM
alastairp

and then this will stop being a problem

2020-11-10 31506, 2020

15:58 PM
yvanzo

I can possibly add an optional argument to support appending such slug to the tag?

2020-11-10 31529, 2020

15:58 PM
alastairp

I would prefer not to. I think it's additional complexity that we are going to remove soon anyway

2020-11-10 31542, 2020

15:58 PM
yvanzo

Ok

2020-11-10 31530, 2020

16:13 PM
alastairp

https://usercontent.irccloud-cdn.com/file/boqpVuf…

2020-11-10 31534, 2020

16:13 PM
alastairp

https://usercontent.irccloud-cdn.com/file/UNw0fB4…

2020-11-10 31537, 2020

16:13 PM
alastairp

https://usercontent.irccloud-cdn.com/file/Q8ZJvB5…

2020-11-10 31512, 2020

16:14 PM
yvanzo

:)

2020-11-10 31523, 2020

16:14 PM
alastairp

MTG work using acousticbrainz mood data + discogs genre links

2020-11-10 31546, 2020

16:14 PM
alastairp

"find things in genre x that evoke mood y, and are kind of close to each other"

2020-11-10 31520, 2020

16:16 PM
yvanzo

alastairp: Your simplified tagging scheme iiuc: just push 'x.y-date' (common case); If it already exists, push 'x.y-date.increment' instead. Never override existing tags. Remove them by hand once unused.

2020-11-10 31534, 2020

16:17 PM
ruaok

alastairp: neat!

2020-11-10 31538, 2020

16:17 PM
yvanzo

well, maybe I should just stop rethinking this since it works already.

2020-11-10 31536, 2020

16:18 PM
alastairp

yeah, I think that clear documentation is the most important thing at the moment

2020-11-10 31530, 2020

16:19 PM
alastairp

but yes, in my view I think that what you just described would be clearer, but I don't feel strongly that it must be changed

2020-11-10 31516, 2020

16:27 PM
alastairp

ruaok: it's a perfect usecase for another element. I'm sure I told them to use solr, but they decided to use ES :) I remember that you and I talked about having direct search for AB data in addition to the annoy stuff - "give me everything that matches x and y"

2020-11-10 31541, 2020

16:27 PM
alastairp

I'll see if we can improve it and release it on bono

2020-11-10 31504, 2020

17:01 PM
reosarevok

" [#metabrainz] Welcome to #MetaBrainz! This channel has taken over from #musicbrainz-devel, #bookbrainz, #bookbrainz-devel, and #musicbottle - so don't despair if you don't know how you ended up here. Just sit down, have a cup of tea, put your feet up, and feel right at home. :)"

2020-11-10 31514, 2020

17:01 PM
reosarevok

Given we have #bookbrainz again, maybe it's time to change that?

2020-11-10 31526, 2020

17:01 PM
reosarevok

Freso: ^