in #metabrainz

13:35 PM
mayhem

ok, I'll check the logs just in case.
13:36 PM
lucifer

cool
13:37 PM
mayhem

indeed, seems quiet.
13:44 PM
ok, cron restarted.
13:46 PM
nibu joined the channel
13:47 PM
Khagan joined the channel
14:04 PM
nibu has quit
14:07 PM
monkey

mayhem: I forgot to save a file when I made my last commit to the redesign branch. I'll rebuild and redeploy on beta.
14:07 PM
mayhem

ok
14:20 PM
zas

atj: consul is deployed as containers atm but I was looking at https://github.com/ansible-community/ansible-co...
14:21 PM
deploying consul via ansible would be an improvement, it does have to run on docker, an instance (client and/or server) needs to run on nodes anyway. What I suggest is to try an Ansible deployment of a 3-nodes cluster, using 10.10.10.x network, and few clients deployments (consul agent)
14:23 PM
atj

zas: ok we can look at doing a test deployment. i had a brief scan of the readme and there's a few things that concern me, e.g. "it does not currently concern itself (all that much) with performing ongoing maintenance of a cluster" and the OS support matrix only listing Ubuntu 16.04
14:23 PM
zas

when you have time, please go through this module documentation and https://github.com/metabrainz/docker-server-con... so we can discuss pros & cons of this approach
14:23 PM
yup, dunno (yet) how much those points are impacting us
14:24 PM
atj

i'm going to try and merge the borgmatic PR today and migrate the settings from the borg-backup repo
14:24 PM
zas

that said, the idea of managing this part of the infrastructure (consul-stuff) with Ansible makes sense to me
14:24 PM
ok great, keep me updated
14:26 PM
in practice, it means we'll run 2 consul clusters in parallel for a while, new install shouldn't interfere with old docker-based one
14:28 PM
new containers will be encouraged to use the new cluster, while we deprecate the old one, slowly moving from 10.2.2.x to 10.10.10.x which will offer more freedom (not being limited to be physically in same place)
14:30 PM
currently, AFAIK, consul network gets changes only from 2 sources: gitzconsul & serviceregistrator. The first one basically converts a part of docker-servers-config files to consul storage, the second registers running containers (mainly for openresty to autoconfigure http forwarding on gateways)
14:31 PM
both can be executed along old instances but pointing at new consul cluster instead
14:31 PM
then app containers can choose to use one or the other, until we totally remove the old cluster
14:32 PM
it should be noted they might be changes at docker-template level too (so likely rebuilding containers)
14:33 PM
iconoclasthero joined the channel
14:33 PM
but the target is to make lucifer happy at this point.
14:45 PM
BrainzGit

[listenbrainz-server] 14mayhem closed pull request #2431 (03master…tooltip): Tooltip https://github.com/metabrainz/listenbrainz-serv...
14:47 PM
mayhem

arsh: you about?
14:48 PM
jivte joined the channel
14:52 PM
jivte

mayhem: In my proposal should I add code snippets or directly link the closed PR
14:52 PM
mayhem

link is fine
14:52 PM
jivte

and a image could of the feature could be added too :)
14:53 PM
or :|
14:54 PM
mayhem

as you wish, really.
14:54 PM
jivte

Okk thanks for the help :)
14:55 PM
mayhem

np
15:09 PM
lucifer

mayhem: looking into the IA stuff, there's an audio collection that seems to encompass all audio files of the archive.
15:10 PM
however, 1) not all of it is music. 2) the search is slow and does not seem to be recursive.
15:10 PM
mayhem

yea, slow is pretty characteristic of the IA services. sadly.
15:11 PM
but I suppose we could just crawl the collection and build a content resolver from it.
15:11 PM
lucifer

i think a content resolver like local cache will have to be the way
15:11 PM
lol. yup
15:11 PM
mayhem nods
15:13 PM
jivte has quit
15:15 PM
another thing, in favor of a local content resovler is that. most of the music on LB won't be from IA so not very sensible to make slow queries to IA for each track.
15:16 PM
mayhem

yep.
15:24 PM
jivte joined the channel
15:27 PM
jivte

monkey: About that add album feature could you help in improving some design mockups :)
15:28 PM
monkey

Hi jivte! I don't have time available until the 11th I'm afraid
15:30 PM
jivte

That's sad I will ask aerozol for help :)
15:31 PM
jivte asking help sometimes turn into disturbance
15:31 PM
:|
15:31 PM
monkey

He should be around in a few hours
15:37 PM
jivte has quit
15:42 PM
jivte joined the channel
15:42 PM
rdswift

There appears to be a problem with voting down an edit in MusicBrainz. See the ModBot note on https://musicbrainz.org/edit/98126658
15:57 PM
mayhem

lucifer: ping
15:57 PM
lucifer

hi
15:58 PM
mayhem

monkey and I realized that we have another project that we'd need your help on.
15:58 PM
lucifer

sure, what is it?
15:58 PM
mayhem

you know how the CF results have a last listened timestamp? I need those for all tracks for all users.
15:58 PM
lucifer

we have that already stored in spark.
15:58 PM
mayhem

not just CF tracks -- so in artist radio I need to know if the user has listened to a track recently.
15:58 PM
Khagan has quit
15:58 PM
lucifer

where and how often updated do you need it?
15:59 PM
mayhem

for *all* tracks and *all* users ?
15:59 PM
lucifer

yup
15:59 PM
CF joins with that data to add it to recs.
15:59 PM
mayhem

ha great!
15:59 PM
well, so troi needs it.
15:59 PM
which means that when we move this feature over to spark, that will be easy.
16:00 PM
lucifer

yup indeed
16:00 PM
mayhem

monkey also needs it and he would like to show play count of a given track and when the user last listened to it.
16:00 PM
I suppose adding playcounts isn't too hard in this case, right?
16:00 PM
lucifer

yup should be easy.
16:01 PM
so need to store in couchdb i think.
16:01 PM
mayhem

it seems like it is another case of needing to take data from spark and move it to PG then API it.
16:01 PM
lucifer

updated weekly?
16:01 PM
mayhem

not good enough.
16:01 PM
lucifer

yeah i guess PG would be better in this case.
16:01 PM
mayhem

really, i am realizing, we need data set hoster for HDFS
16:01 PM
lucifer

so that we can query both ways by user and by artist.
16:02 PM
mayhem

I really dislike taking data from spark moving it to PG just to serve it.
16:02 PM
lucifer

that would mean exposing spark cluster to web though.
16:02 PM
mayhem

is there any way that we can make APIs on top of spark? seems no, because of a mistmatch in approaches.
16:02 PM
-t
16:02 PM
lucifer

i think we can but i don't think it would be fast enough.
16:02 PM
mayhem

lets ignore that problem for a second.
16:03 PM
lucifer

there are other techs built on top of spark for such querying i think
16:03 PM
mayhem

yeah. seem unlikely to work well.
16:03 PM
oh? that might be worth exploring.
16:03 PM
lucifer

yeah many even support realtime querying and stuff.
16:04 PM
we could get listens in realtime through rmq to spark
16:04 PM
mayhem

I think this is really worth exploring.
16:05 PM
lucifer

sounds good
16:06 PM
mayhem

I mean, if we could leave the stats data in HDFS and not have to move it around, that would be great.
16:06 PM
well, true of all of the datasets we shovel back and forth.
16:07 PM
lucifer

indeed indeed
16:11 PM
arsh

Hi mayhem: I hope you had a chance to look at the document I sent. Do you any thoughts or ideas that you like that I could build on. Thanks
16:11 PM
mayhem

yes, I do.
16:11 PM
let me finish an email and get back to you.
16:11 PM
arsh

Sure
16:17 PM
Shelly joined the channel
16:17 PM
mayhem

hey arsh!
16:18 PM
arsh

hello
16:18 PM
mayhem

so, the first bit of feedback is that we dont have access to artist images.
16:18 PM
we used to but then some asshole sued us and ruined the party. very long story.
16:18 PM
Shelly

I am currently on master branch and running the listenbrainz-server on local but there's an sql error "psycopg2.errors.UndefinedColumn: column "external_user_id" does not exist
16:18 PM
listenbrainz-web-1 | LINE 6: , external_user_id
16:18 PM
"
16:19 PM
arsh

oh i see
16:19 PM
mayhem

so, your designs need to use no images.
16:19 PM
and I looked at your mock-ups and idea 3 jumps out at me.
16:20 PM
Shelly

Can someone please look into it beacuse when i git pull from master my current branch broke.=L
16:20 PM
lucifer

Shelly: `./develop.sh psql` then execute `ALTER TABLE external_service_oauth ADD COLUMN external_user_id TEXT;`
16:20 PM
arsh

Yeah I feel thats the best because it can work well without images
16:20 PM
mayhem

most of the data needed for the data on the right hand side of idea 3, we have.
16:20 PM
and I find this is a useful layout to go with for such a discovery tool.
16:21 PM
thus, I would recommend that you pick idea 3 and make a full proposal from that.
16:21 PM
arsh

Sure, will do
16:22 PM
mayhem

good luck!
16:22 PM
arsh

And should I make a figma mock for the same
16:22 PM
mayhem

figma would be ideal.
16:22 PM
if you get chosen for this project, you'd be working with aerozol to prove out your design to make sure it meets our new guidelines and all that.
16:23 PM
but the initial designs are important to see how you work.
16:23 PM
Shelly

lucifer: Thanks its working now :)
16:23 PM
mayhem

but, it would also be really good if you could submit a PR to LB, so we can see your work.
16:23 PM
which is a lot to do in the short time there is left.
16:24 PM
arsh

Ok I will try to get as much done to given you the best idea.
16:24 PM
for the PR what should I submit only the mockups or something additional
16:27 PM
Shelly

mayhem: I have implemented the thumb scroll as per figma designs. Should I create a PR for that or just put that on my GSOC application.
16:27 PM
mayhem

the PR should be for something that fixes an issue in LB. something that shows us how you code and how we would end up interacting.
16:28 PM
Shelly: PR please. we won't merge that PR, but we then learn how you work.
16:28 PM
arsh

Oh I see, I had also fixed some documentation for LB and submitted a PR for the same couple days ago. It was for the ticket LB-861.
16:28 PM
BrainzBot

LB-861: Add authorization section to API docs https://tickets.metabrainz.org/browse/LB-861
16:28 PM
mayhem

oh yes. I was very suspicious of that one.
16:28 PM
looked too clean and well informed. :)
16:29 PM
arsh

Yeah I wanted to add that section so beginners like me can navigate the docs better.
16:30 PM
mayhem

ok, good start. focus on the proposal. get that solid, then see if you can do a code PR, ok?
16:30 PM
arsh

Yeah sounds good.
16:32 PM
BrainzGit

[listenbrainz-server] 14Shelly011s opened pull request #2435 (03master…tooltip): Tooltip https://github.com/metabrainz/listenbrainz-serv...
16:35 PM
Shelly has quit
16:39 PM
lucifer

mayhem: ready to discuss ds hoster format if you are
16:39 PM
mayhem

gsoc has left my brain fried.
16:39 PM
can we do it tomorrow?
16:39 PM
lucifer

sure
16:39 PM
this is also gsoc-adjacent anyway ;)
16:40 PM
mayhem

I'm gonna look at one more thing and go home and stay offline for a bit.
16:40 PM
lucifer

sounds good
16:41 PM
mayhem

thx
16:48 PM
BrainzGit

[listenbrainz-server] 14mayhem closed pull request #2435 (03master…tooltip): Tooltip https://github.com/metabrainz/listenbrainz-serv...