#metabrainz

/

0:21 AM
chaban has quit

2019-07-25 20616, 2019

0:23 AM
chaban joined the channel

2019-07-25 20600, 2019

1:14 AM
ayerhart joined the channel

2019-07-25 20627, 2019

1:30 AM
D4RK-PH0ENiX has quit

2019-07-25 20627, 2019

1:30 AM
Nyanko-sensei joined the channel

2019-07-25 20612, 2019

4:27 AM
Lotheric_ joined the channel

2019-07-25 20632, 2019

4:27 AM
Lotheric has quit

2019-07-25 20653, 2019

4:38 AM
Lotheric_ is now known as Lotheric

2019-07-25 20650, 2019

4:59 AM
travis-ci joined the channel

2019-07-25 20650, 2019

4:59 AM
travis-ci

Project bookbrainz-data-js build #1181: passed in 1 min 49 sec: https://travis-ci.org/bookbrainz/bookbrainz-data-…

2019-07-25 20650, 2019

4:59 AM
travis-ci has left the channel

2019-07-25 20602, 2019

5:30 AM
Darkloke joined the channel

2019-07-25 20643, 2019

5:33 AM
Darkloke

Hi2All. My question is a little bit offtopic, but i am just asking here, cause here are developers and may be u could give me some advice for the noob like me. I am trying to make a java script, which could parse GEMA PRO database and extract track names with details. PM me if u are interested please.

2019-07-25 20602, 2019

6:26 AM
pristine__

ruaok: moin. Can we create a tunnel through worker nodes. Stderr and stdout for workers are visible on 8081

2019-07-25 20658, 2019

7:38 AM
ruaok

moin!

2019-07-25 20610, 2019

7:39 AM
ruaok

visible on port 8081 inside the container?

2019-07-25 20653, 2019

7:44 AM
pristine__

We have created a tunnel for port 4040 running in listenbrainz jobs on leader. This is the Spark UI. Now in this UI there are tabs (stderr, stdout) for every executor to view executor logs. When I click on this, a window opens, and something like *10.x.x.5x:8081.xxxxx* is written in address bar but nothing is displayed.

2019-07-25 20612, 2019

7:45 AM
pristine__

So I guess we need a tunnel for 8081.

2019-07-25 20625, 2019

7:48 AM
ruaok

ok, yes that means a tunnel into a different machine and then into the container. hmm. I'll have to ponder this.

2019-07-25 20654, 2019

7:49 AM
pristine__

Yeah. We just want executor logs to debug. Now, by default spark logs in spark_home/logs. On leader, spark/logs in empty. Idk why. I have spent days on this but idk.

2019-07-25 20619, 2019

7:50 AM
pristine__

By default workers store stderr and stdout in spark_home/work

2019-07-25 20645, 2019

7:50 AM
pristine__

Can you log in to one of the worker and check /usr/local/spark/work

2019-07-25 20647, 2019

7:50 AM
pristine__

?

2019-07-25 20645, 2019

7:51 AM
ruaok

that might be easier.

2019-07-25 20648, 2019

7:51 AM
ruaok

any worker?

2019-07-25 20608, 2019

7:52 AM
pristine__

I have created a requested to join spark users list, I can ask about empty logs there. UI serves the same purpose, but it will stop once the job stops.

2019-07-25 20612, 2019

7:52 AM
pristine__

Yeah. Any worker.

2019-07-25 20655, 2019

7:56 AM
pristine__

But I still say that we should try for workers UI. It is better to apprehend. The stored logs (which must be cleaned regularly) can help us too see error history even if the job has ended. Whislt the job is running, UI can be great.

2019-07-25 20626, 2019

7:59 AM
ruaok

lets see if the files are useful first.

2019-07-25 20642, 2019

7:59 AM
ruaok

on leader in /home/vansika is worker-logs.zip

2019-07-25 20619, 2019

8:03 AM
pristine__

A sec

2019-07-25 20618, 2019

8:12 AM
pristine__

ruaok: can you go to /usr/local/spark and send me a screenshot

2019-07-25 20642, 2019

8:12 AM
pristine__

And then /usr/local/spark/work and a screenshot

2019-07-25 20610, 2019

8:13 AM
ruaok

you have all of the contents of the work directory in the zip file. did the zip file not work?

2019-07-25 20642, 2019

8:13 AM
pristine__

No it worked. I just want to check somethin

2019-07-25 20656, 2019

8:13 AM
ruaok

https://usercontent.irccloud-cdn.com/file/JYHQ4h3…

2019-07-25 20601, 2019

8:17 AM
pristine__

Cool. What about app I ran yesterday? Are they stored on other workers? We don't have logs of previous week/month. Are they cleaned up automatically?

2019-07-25 20609, 2019

8:17 AM
pristine__

So many questions. Lol

2019-07-25 20656, 2019

8:17 AM
ruaok

I presume the other workers have similar files. and I would expect them to get cleaned up after x days, which is likely configurable in some config file.

2019-07-25 20602, 2019

8:18 AM
ruaok

is the output useful?

2019-07-25 20652, 2019

8:19 AM
pristine__

Loads of it. I will go through it and get back to you

2019-07-25 20605, 2019

8:20 AM
pristine__

Can we discuss a lil about cleaning up models?

2019-07-25 20615, 2019

8:20 AM
pristine__

(thanks for the zip)

2019-07-25 20630, 2019

8:20 AM
ruaok

sure.

2019-07-25 20653, 2019

8:21 AM
pristine__

So i was thinking, whenever we save a model (which will be after months most probably) should we clean up the previous one(s)?

2019-07-25 20628, 2019

8:22 AM
ruaok

I wonder what our thinking here should be.

2019-07-25 20641, 2019

8:22 AM
ruaok

clean up by default and mark others as "saved, in use"?

2019-07-25 20656, 2019

8:22 AM
ruaok

or manually clean up and only carefully delete items.

2019-07-25 20634, 2019

8:24 AM
ruaok

or maybe, just keep the latest one? I guess the question is how we specify which module to use for recommendations.

2019-07-25 20647, 2019

8:24 AM
pristine__

The latest one

2019-07-25 20653, 2019

8:24 AM
pristine__

I guess

2019-07-25 20619, 2019

8:25 AM
ruaok

perhaps, this question is premature.

2019-07-25 20625, 2019

8:25 AM
pristine__

If we dont delete, we may run out of space in time.

2019-07-25 20637, 2019

8:25 AM
ruaok

it is a good question, but we're not fully certain how we're going to use data yet.

2019-07-25 20641, 2019

8:25 AM
ruaok

we will.

2019-07-25 20604, 2019

8:26 AM
ruaok

how about we do something simple to start with and simply keep the X latest models, but delete everything else?

2019-07-25 20619, 2019

8:26 AM
pristine__

Cool. So maybe till we are sure about it, i will manual delete all the models which are created while testing.

2019-07-25 20631, 2019

8:26 AM
pristine__

Yes. Sound good.

2019-07-25 20633, 2019

8:26 AM
ruaok

ok

2019-07-25 20607, 2019

8:27 AM
pristine__

What should X be?

2019-07-25 20644, 2019

8:27 AM
ruaok

7?

2019-07-25 20606, 2019

8:29 AM
pristine__

Ummm. It actually depends on how much data we are using for training. Like for around 6 months, consider one gb. 1*7 = 7gb

2019-07-25 20613, 2019

8:29 AM
pristine__

7*3 = 21gb

2019-07-25 20618, 2019

8:29 AM
pristine__

After replication.

2019-07-25 20614, 2019

8:30 AM
pristine__

Also, we should have a json or parquet for storing matadata about models (on which data it eas trained, when trained size etc etc.)

2019-07-25 20632, 2019

8:30 AM
pristine__

Maybe 4 to start with.

2019-07-25 20641, 2019

8:30 AM
ruaok

:)

2019-07-25 20609, 2019

8:31 AM
pristine__

I was just thinking, why would we require folder models? Could not get to an answer?

2019-07-25 20617, 2019

8:31 AM
pristine__

Older*

2019-07-25 20648, 2019

8:31 AM
ruaok

we may find a model that works well and put it into production.

2019-07-25 20602, 2019

8:32 AM
ruaok

but at the same time we will want to continue evolving models.

2019-07-25 20619, 2019

8:32 AM
pristine__

Oh. Right.

2019-07-25 20626, 2019

8:32 AM
ruaok

we need to keep at least one around for production. possible keep more around for various production scenarious.

2019-07-25 20624, 2019

8:33 AM
pristine__

This project is wow. Everything has to be done from scratch, so much brainstorming. Yay! Thank you <3

2019-07-25 20602, 2019

8:34 AM
ruaok

I know the feeling. part of it is exciting, part of it is tiring. but it has been tiring for 20 years, so I am used to it.

2019-07-25 20618, 2019

8:35 AM
pristine__

I have never done something like this before. I like it.

2019-07-25 20639, 2019

8:35 AM
pristine__

Better than going to college

2019-07-25 20642, 2019

8:35 AM
pristine__

Lol.

2019-07-25 20620, 2019

8:36 AM
pristine__

I am on bunk today😎

2019-07-25 20627, 2019

8:38 AM
CatQuest

ruaok: https://xkcd.com/2180

2019-07-25 20620, 2019

8:41 AM
CatQuest

(esp the alt text)

2019-07-25 20602, 2019

8:42 AM
ruaok

yes, indeed.

2019-07-25 20616, 2019

8:42 AM
ruaok

if people make stupid requests, they get stupid macros. lol

2019-07-25 20649, 2019

8:43 AM
pristine__

ruaok: one consumer was running on the worker from where you got the zip?

2019-07-25 20653, 2019

8:43 AM
pristine__

Container*

2019-07-25 20600, 2019

8:44 AM
pristine__

(stupid autocorrect)

2019-07-25 20627, 2019

8:44 AM
CatQuest

"statistics in the last 30 days until today" <-- thats the one I'd want

2019-07-25 20659, 2019

8:45 AM
CatQuest

I mean like asking forstatistics for "ah month" is useful for looking at last years what did i listen to in january

2019-07-25 20628, 2019

8:46 AM
CatQuest

but for "the last month" nothng speial has changed between 1st of july and 31st of june

2019-07-25 20606, 2019

9:04 AM
BestSteve has quit

2019-07-25 20658, 2019

9:06 AM
Gazooo joined the channel

2019-07-25 20612, 2019

9:10 AM
BestSteve joined the channel

2019-07-25 20616, 2019

9:23 AM
ruaok

pristine__: I didn't carefully check to see, sorry.

2019-07-25 20627, 2019

9:23 AM
ruaok

are the logs useful. do we need to find a way to get them to you?

2019-07-25 20648, 2019

9:28 AM
pristine__

Should i get back to you around at night? I have a test at 3:30 so i did not look at them. Each stderr is too big and there are around 8 of them.

2019-07-25 20603, 2019

9:29 AM
pristine__

Around 9 IST*

2019-07-25 20651, 2019

9:29 AM
pristine__

Check the containers whenever you can :)

2019-07-25 20656, 2019

9:43 AM
BestSteve has quit

2019-07-25 20604, 2019

9:45 AM
ruaok

ok, good luck on the test.

2019-07-25 20612, 2019

9:45 AM
ruaok

what should I check in the containers?

2019-07-25 20626, 2019

9:45 AM
BestSteve joined the channel

2019-07-25 20637, 2019

10:06 AM
yvanzo

Mr_Monkey: there currently are 10 open SEC tickets related to BB, see https://tickets.metabrainz.org/issues/?filter=115…

2019-07-25 20647, 2019

10:14 AM
ruaok

ohhh, a spanking by the security czar. bad news.

2019-07-25 20649, 2019

10:15 AM
yvanzo

Would it be alright to archive (making read-only) abandoned repositories such as https://github.com/metabrainz/xmpp-messaging-serv… ?

2019-07-25 20600, 2019

10:16 AM
ruaok

yes, please.

2019-07-25 20636, 2019

10:16 AM
yvanzo

Alright, it can be unarchived at any time.

2019-07-25 20603, 2019

10:17 AM
ruaok

spellew: ping

2019-07-25 20619, 2019

10:21 AM
Mr_Monkey

Thanks yvanzo ! I was away for a bit and they piled up. I'll look at them.

2019-07-25 20646, 2019

10:24 AM
Nyanko-sensei has quit

2019-07-25 20641, 2019

10:31 AM
yvanzo

There might false positive, e.g. 80% of alerts on MBS were not applicable.

2019-07-25 20607, 2019

10:37 AM
yvanzo

ruaok: the proper way to address SEC-40 is to go to https://github.com/metabrainz/listenbrainz-server… and to dismiss as “A fix has already been started”

2019-07-25 20608, 2019

10:37 AM
BrainzBot

SEC-40: [listenbrainz-server] CVE-2019-10744: lodash < 4.17.13 https://tickets.metabrainz.org/browse/SEC-40

2019-07-25 20628, 2019

10:37 AM
ruaok

ah, thanks.

2019-07-25 20643, 2019

10:37 AM
ruaok

done

2019-07-25 20635, 2019

10:38 AM
yvanzo

yup, now closed :)

2019-07-25 20654, 2019

10:38 AM
D4RK-PH0ENiX joined the channel

2019-07-25 20650, 2019

10:41 AM
yvanzo

bitmap: I guess design-system alerts can be all dismissed as “Vulnerable code is not actually used”

2019-07-25 20648, 2019

10:42 AM
yvanzo

See https://github.com/metabrainz/design-system/netwo… and https://tickets.metabrainz.org/issues/?filter=115…

2019-07-25 20616, 2019

10:43 AM
D4RK-PH0ENiX has quit

2019-07-25 20643, 2019

10:45 AM
D4RK-PH0ENiX joined the channel

2019-07-25 20610, 2019

10:57 AM
spellew

ruaok: o/

2019-07-25 20643, 2019

10:57 AM
spellew

If this is about my passport expiring soon, I was planning on getting it renewed

2019-07-25 20648, 2019

11:00 AM
ruaok

no, the prices for the flight keep changing. and not for the better. :(

2019-07-25 20644, 2019

11:02 AM
ruaok

but, do get the passport renewed, don't wait.

2019-07-25 20636, 2019

11:05 AM
ruaok

this one has a one night stop in DUB, but with a 91€ hotel, it still is 110€ cheaper than the shorter flight without a stop

2019-07-25 20636, 2019

11:05 AM
ruaok

https://flights.app.goo.gl/QHV8G

2019-07-25 20601, 2019

11:06 AM
ruaok

could also be done leaving sunday evening.

2019-07-25 20646, 2019

11:06 AM
zas

ruaok: can you have a look at Lemmy's disk space ?

2019-07-25 20613, 2019

11:07 AM
ruaok

bad again?

2019-07-25 20641, 2019

11:07 AM
ruaok

stable where its at, no?

2019-07-25 20648, 2019

11:07 AM
ruaok

meaning not good, but stable.

2019-07-25 20655, 2019

11:07 AM
ruaok

iliekcomputers: you around?

2019-07-25 20643, 2019

11:19 AM
aidanlw17

hi alastairp, the first PR is all ready for you to review!

2019-07-25 20601, 2019

11:20 AM
alastairp

great, I'm doing errands this morning but will look in a few hours

2019-07-25 20604, 2019

11:21 AM
ruaok

zas: do you know what is normally stored in /var/lib/docker/aufs ?

2019-07-25 20629, 2019

11:21 AM
aidanlw17

No problem!

2019-07-25 20649, 2019

11:23 AM
yvanzo

hi zas: can you confirm SEC-32 is harmless?

2019-07-25 20649, 2019

11:23 AM
BrainzBot

SEC-32: [picard-website] CVE-2019-10744: lodash.merge < 4.6.2 https://tickets.metabrainz.org/browse/SEC-32

2019-07-25 20654, 2019

11:24 AM
yvanzo

(can be done by reviewing and dismissing https://github.com/metabrainz/picard-website/netw… )

2019-07-25 20637, 2019

11:29 AM
spellew

ruaok: It's fine

2019-07-25 20656, 2019

11:29 AM
ruaok

leave sunday or tuesday?

2019-07-25 20619, 2019

11:33 AM
spellew

Tuesday?

2019-07-25 20647, 2019

11:33 AM
spellew

I'd prefer that, if it works for you

2019-07-25 20604, 2019

11:34 AM
ruaok

ok, stick around lets buy this.

2019-07-25 20642, 2019

11:37 AM
spellew

Ok

2019-07-25 20642, 2019

11:43 AM
ruaok

apparently booking this ticket is hard. :(

2019-07-25 20656, 2019

11:45 AM
ruaok

ok, I'm finding a much nicer flight combo, but on two separate tickets. which can be dicey, but you'll have one fewer stops to make.

2019-07-25 20654, 2019

11:52 AM
ruaok

I think you need to move, spellew.

2019-07-25 20602, 2019

11:53 AM
ruaok

might be easier. 🤣

2019-07-25 20618, 2019

11:56 AM
ruaok

what other options are there for you to get to NYC?