in #metabrainz

1:53 AM
niceplace has quit
1:53 AM
niceplace joined the channel
3:29 AM
niceplace has quit
3:30 AM
niceplace joined the channel
8:14 AM
CatQuest has left the channel
8:17 AM
CatQuest joined the channel
8:42 AM
CatQuest has quit
8:42 AM
cats has quit
8:42 AM
aidanlw17 has quit
8:42 AM
kuno has quit
8:42 AM
cats joined the channel
8:43 AM
aidanlw17 joined the channel
8:43 AM
kuno joined the channel
8:47 AM
CatQuest joined the channel
9:01 AM
akhilesh

Moin!
9:05 AM
Gazooo has quit
9:06 AM
Gazooo joined the channel
9:58 AM
pristine__

ruaok: hey. moin
9:58 AM
ruaok

moin moin!
9:59 AM
I haz coffee. can chat.
10:02 AM
pristine__

Nice
10:03 AM
So we need some directories in HDFS for storing models, dataframes and other things
10:03 AM
Shouldn't we have a manage.py for them like we have it for dbs in LB server?
10:05 AM
So that before we start with anything we have required directories in there
10:05 AM
But this is for local setup, I am not sure how it works when running on server
10:06 AM
ruaok

> But this is for local setup, I am not sure how it works when running on server
10:07 AM
can you elaborate what you mean by this?
10:11 AM
pristine__

I mean we won't be running manage.py on server right? It would drop the existing directories and create new one, and we will lost our data.
10:12 AM
How do we ensure that all required directories are already created and we dont have to explicitly do mkdir from the scripts like train_model, create_dataframe etc
10:15 AM
Lose*
10:17 AM
ruaok

which server?
10:17 AM
I would create a manage.py script, as you suggested, that we run on leader.
10:17 AM
and if the directories already exist, stop doing anything.
10:20 AM
pristine__

So we need to add it's path in start_master.sh?
10:20 AM
So whenever we restart it executes?
10:25 AM
ruaok

no, it should be a separate scrip that we invoke when we setup a new cluster...
10:26 AM
speaking on which, I think I am going to order 8 dedicated machines from hetzner for a dedicated cluster.
10:26 AM
that is always on.
10:26 AM
akhilesh

Mr_Monkey: hi!
10:27 AM
ruaok

and I think that in order to bootstrap that, all we need to do is add the new nodes to the swarm, let them replicate and then get rid of the old nodes.
10:31 AM
pristine__

Swarm?
10:34 AM
> Order 8 dedicated machines from hetzner
10:34 AM
?
10:40 AM
ruaok

our current cluster of 4 cloud instances are not particularly good. renting dedicated machines will make our data processing stuff a lot faster.
10:41 AM
and since our machines are in a docker swarm, we can extend the cluster, let everything sync, then told off the old cloud instances.
10:44 AM
pristine__

Okay. I am getting it. I will read cluster.py and I am sure I will have some doubts, I will ping you
10:48 AM
reosarevok

iliekcomputers, ruaok: please, can we up the priority of decoupling LB usernames and profiles? I have another case of someone whose username changed in MB but I guess we can't change in LB. Unless you have a mechanism even if it is by hand-editing the DB? :)
10:49 AM
ruaok

we now have a mechanism. but it isn't automated.
10:49 AM
open a ticket for the specific user. we have a ticket for decoupling the usernames? if so, make it a higher priority and assign to iliekcomputers
10:53 AM
reosarevok

https://tickets.metabrainz.org/browse/LB-467
10:53 AM
BrainzBot

LB-467: Rename user budkin to treeshateorcs
10:54 AM
reosarevok

I see it doesn't automatically assign it to anyone, should I?
10:54 AM
ruaok

yes, plz
10:54 AM
reosarevok

yes meaning "yes, to iliekcomputers" as well? :)
10:54 AM
https://tickets.metabrainz.org/browse/LB-383 is the decoupling one, will assign
10:55 AM
BrainzBot

LB-383: Allow updating usernames when they're changed in MusicBrainz
11:08 AM
CatQuest

why did I read tree shat orcs first ¬_¬
11:11 AM
reosarevok

Maybe it did. Maybe it did.
11:28 AM
Mr_Monkey

Hi akhilesh ! Had some questions?
11:33 AM
akhilesh

Mr_Monkey: Resolved now, but your help will require you create entities for browse endpoint tests.
11:36 AM
CatQuest

oh shi waa supposed to ytest that app. right
11:39 AM
Mr_Monkey

Sure akhilesh, let me know what you need
11:39 AM
CatQuest

amCap1712: i tested it!
11:39 AM
as expected the interface is... lacking :D
11:39 AM
however i came across o crashing type bugs
11:40 AM
no*
11:40 AM
D4RK-PH0ENiX has quit
11:40 AM
the "fingerprint" option seems to work as exepcted! it shows some generally logical selection
11:42 AM
akhilesh

Mr_Monkey: I will try to complete tests today for browse requests, then I will inform you.
11:42 AM
Mr_Monkey

OK
11:43 AM
CatQuest

a problem is "metadata" button. it shows only random selections? (in any case i have yet to find anythingu se full for it)
11:43 AM
it's not clear that saving works, (and I see no copied file in a folder underneath)
11:49 AM
(feature requests: be able to manually edit tags, be able to select tags to "keep old tag", selecting more than one file at a time(!), in reference to that; be able to select which release in a release group to tag with)
11:49 AM
especially this for coverart
11:49 AM
ok, keep up the good work for now!
11:55 AM
D4RK-PH0ENiX joined the channel
11:57 AM
amCap1712

CatQuest: hi
11:58 AM
i know the problem with metadata but am unable to find the root cause
11:59 AM
CatQuest: also the file is saved in the device public storage directory
11:59 AM
D4RK-PH0ENiX has quit
12:03 PM
D4RK-PH0ENiX joined the channel
12:13 PM
iliekcomputers has quit
12:13 PM
iliekcomputers joined the channel
12:29 PM
BrainzGit

[listenbrainz-labs] vansika closed pull request #44 (master…develop-sh): Modify develop.sh to use a single command for building services and starting containers https://github.com/metabrainz/listenbrainz-labs...
12:37 PM
ruaok

pristine__: are you using the cluster right now or can I test some stuff?
12:37 PM
(that will require me starting/stopping the cluster.)
12:38 PM
pristine__

I am not using the cluster.
12:38 PM
Go ahead :)
12:42 PM
ruaok

ok
12:46 PM
travis-ci joined the channel
12:46 PM
travis-ci

metabrainz/picard#4780 (master - 41c481e : Philipp Wolfer): The build passed.
12:46 PM
Change view : https://github.com/metabrainz/picard/compare/06...
12:46 PM
Build details : https://travis-ci.org/metabrainz/picard/builds/...
12:46 PM
travis-ci has left the channel
13:35 PM
spuniun has quit
13:53 PM
pristine__

ruaok: people who want to set up lb labs on their local machine won't be running a script for setting up a cluster?
13:53 PM
They just do develop.sh build, right?
13:54 PM
And install depedencies as stated in readme
13:54 PM
ruaok

well, local machine != cluster
13:54 PM
so, really it is up to us to decide what to do and how to do it. but it might be a while before someone follows our steps and setups up their own cluster
13:56 PM
pristine__

Yes. So ideally manage.py for them should be invoked with develop.sh
13:56 PM
And on leader with setup_cluster.py
13:56 PM
no?
13:57 PM
My next PR involves making a new directory so I want to get it right.
13:59 PM
ruaok

setup_cluster will not be supported anymore.
13:59 PM
once we setup our cluster of dedicated machines, we'll leave the cloud instances behind.
14:00 PM
and create_cluser will not be useful.
14:00 PM
I think manage.py should just be invoked separately from develop.sh
14:01 PM
pristine__

Okay. We can include thay in readme to invoke it separately?
14:01 PM
What do you mean by cloud instances here if I may ask?
14:02 PM
pristine__ apologizes for asking so many questions
14:13 PM
ruaok

right now we rent 4 servers from hetzner cloud. those isntances are not very good for our needs.
14:14 PM
the classic hetzner service allows us to rent a dedicated server where we have full control.
14:14 PM
no one can take CPU power away from us.
14:17 PM
pristine__

Okay.
14:18 PM
So you remember about the error I posted saying I am not able to resolve because it says check driver logs and RPC disassociated all the time?
14:19 PM
ruaok

sort of yes?
14:19 PM
the cluster is up again. can you run a quick test program that runs one simple spark query to see if things are working?
14:20 PM
pristine__

A sec
14:27 PM
https://gist.github.com/vansika/cb54ca9ccec7367...
14:27 PM
ruaok: ^
14:27 PM
ruaok

erp, not good.
14:28 PM
pristine__

hmm
14:31 PM
ruaok

try again?
14:33 PM
pristine__

A sec
14:34 PM
same error
14:45 PM
ruaok: I was going out to buy something, do you need me around? I can go after some time
14:45 PM
ruaok

go, I'll keep playing.
14:45 PM
its there a command line you can give me to run what you run?
14:46 PM
pristine__

I just ran create_dataframes.
14:47 PM
./spark-submit manage.py create_dataframes
14:47 PM
ruaok

thx
14:47 PM
pristine__

:)
14:48 PM
I will tell you about the error and sol later at night. See ya.
14:56 PM
yvanzo

ruaok: sir test vm is unresponsive again, can you please restart it?
15:00 PM
ruaok

on it
15:04 PM
spuniun joined the channel
15:07 PM
yvanzo: 104.197.183.152
15:22 PM
yvanzo

ruaok: thank you!
15:25 PM
ruaok

pristine__: looks better now, but the script ran into a different error.
15:25 PM
have a look when you return?
15:26 PM
pristine__

Okay
15:29 PM
ruaok

I hope it works. I think the configuration is much more sane now.
15:45 PM
yvanzo

ruaok: is it possible to upgrade sir test vm?
15:52 PM
ruaok

how much work would it be for you to start with a new VM?
15:52 PM
I can give you one from azure..
15:53 PM
but if that is a pain, then I will resize the current one.
15:53 PM
either way, let me know desired specs
16:00 PM
yvanzo

ruaok: it will be easy to start with a new one, possibly with 16 threads?
16:00 PM
(or 8 threads at least)