Hi2All. My question is a little bit offtopic, but i am just asking here, cause here are developers and may be u could give me some advice for the noob like me. I am trying to make a java script, which could parse GEMA PRO database and extract track names with details. PM me if u are interested please.
pristine__
ruaok: moin. Can we create a tunnel through worker nodes. Stderr and stdout for workers are visible on 8081
ruaok
moin!
visible on port 8081 inside the container?
pristine__
We have created a tunnel for port 4040 running in listenbrainz jobs on leader. This is the Spark UI. Now in this UI there are tabs (stderr, stdout) for every executor to view executor logs. When I click on this, a window opens, and something like *10.x.x.5x:8081.xxxxx* is written in address bar but nothing is displayed.
So I guess we need a tunnel for 8081.
ruaok
ok, yes that means a tunnel into a different machine and then into the container. hmm. I'll have to ponder this.
pristine__
Yeah. We just want executor logs to debug. Now, by default spark logs in spark_home/logs. On leader, spark/logs in empty. Idk why. I have spent days on this but idk.
By default workers store stderr and stdout in spark_home/work
Can you log in to one of the worker and check /usr/local/spark/work
?
ruaok
that might be easier.
any worker?
pristine__
I have created a requested to join spark users list, I can ask about empty logs there. UI serves the same purpose, but it will stop once the job stops.
Yeah. Any worker.
But I still say that we should try for workers UI. It is better to apprehend. The stored logs (which must be cleaned regularly) can help us too see error history even if the job has ended. Whislt the job is running, UI can be great.
ruaok
lets see if the files are useful first.
on leader in /home/vansika is worker-logs.zip
pristine__
A sec
ruaok: can you go to /usr/local/spark and send me a screenshot
And then /usr/local/spark/work and a screenshot
ruaok
you have all of the contents of the work directory in the zip file. did the zip file not work?
Cool. What about app I ran yesterday? Are they stored on other workers? We don't have logs of previous week/month. Are they cleaned up automatically?
So many questions. Lol
ruaok
I presume the other workers have similar files. and I would expect them to get cleaned up after x days, which is likely configurable in some config file.
is the output useful?
pristine__
Loads of it. I will go through it and get back to you
Can we discuss a lil about cleaning up models?
(thanks for the zip)
ruaok
sure.
pristine__
So i was thinking, whenever we save a model (which will be after months most probably) should we clean up the previous one(s)?
ruaok
I wonder what our thinking here should be.
clean up by default and mark others as "saved, in use"?
or manually clean up and only carefully delete items.
or maybe, just keep the latest one? I guess the question is how we specify which module to use for recommendations.
pristine__
The latest one
I guess
ruaok
perhaps, this question is premature.
pristine__
If we dont delete, we may run out of space in time.
ruaok
it is a good question, but we're not fully certain how we're going to use data yet.
we will.
how about we do something simple to start with and simply keep the X latest models, but delete everything else?
pristine__
Cool. So maybe till we are sure about it, i will manual delete all the models which are created while testing.
Yes. Sound good.
ruaok
ok
pristine__
What should X be?
ruaok
7?
pristine__
Ummm. It actually depends on how much data we are using for training. Like for around 6 months, consider one gb. 1*7 = 7gb
7*3 = 21gb
After replication.
Also, we should have a json or parquet for storing matadata about models (on which data it eas trained, when trained size etc etc.)
Maybe 4 to start with.
ruaok
:)
pristine__
I was just thinking, why would we require folder models? Could not get to an answer?
Older*
ruaok
we may find a model that works well and put it into production.
but at the same time we will want to continue evolving models.
pristine__
Oh. Right.
ruaok
we need to keep at least one around for production. possible keep more around for various production scenarious.
pristine__
This project is wow. Everything has to be done from scratch, so much brainstorming. Yay! Thank you <3
ruaok
I know the feeling. part of it is exciting, part of it is tiring. but it has been tiring for 20 years, so I am used to it.
pristine__
I have never done something like this before. I like it.