in #metabrainz

16:27 PM
Freso

Gentlecat: Are you adding -195 and -196 as tasks?
16:27 PM
Gentlecat

no, I'm in the class
16:28 PM
Freso

Alright. Want me to do it or should I leave them for you to add later? :)
16:32 PM
Gentlecat

do it! 🐱
16:34 PM
Freso: and I don't know if we are going to allow to review releases
16:34 PM
that's why it's an open question
16:34 PM
though I think ideally you should be able to review all entities
16:44 PM
TwistedFate joined the channel
16:47 PM
zas

search servers again, i'm on it
16:48 PM
reosarevok

Thanks :)
16:50 PM
Freso

Gentlecat: I agree. There I several URLs I want to write a review for! :U
16:50 PM
*There are
16:51 PM
Gentlecat

Freso: I'd review "https://beta.musicbrainz.org/user/freso"
16:53 PM
Freso

But users aren't entities yet. :/
16:55 PM
regagain_ has quit
16:58 PM
Gentlecat

but that's a URL, no?
16:59 PM
ruaok reads backscroll of loads of messages from zas
16:59 PM
ruaok

moin zas, yay for the weekend being over?
16:59 PM
ruaok sighs at search servers
17:00 PM
zas

:)
17:00 PM
nothing new on this side, they still crash
17:01 PM
last one is interesting, have a look at http://stats.musicbrainz.org/dashboard/db/searc...
17:01 PM
check close-wait tcp conns (at the bottom)
17:02 PM
both search servers stopped to answer at the same time @16:27 utc
17:03 PM
stanislas

Freso: i think i finally solved the issue about installing my plugin
17:03 PM
Freso: and i updated it so you might take a look
17:03 PM
Freso: restarting calibre (but not shutting it using ctrl-c) helps
17:06 PM
ruaok wishes the color between left and right were the same
17:07 PM
ruaok

zas: the CLOSE_WAIT... I still can't decide if that is a cause or a symptom.
17:08 PM
zas

it is a symptom
17:08 PM
regagain_ joined the channel
17:08 PM
ruaok

yeah, I think so too
17:08 PM
zas

search search still accepts connections while answering threads are blocked or smt
17:09 PM
ruaok nods
17:09 PM
looks at established
17:09 PM
ruaok

do you have the stack trace for when things are borked?
17:09 PM
zas

i have one
17:09 PM
ruaok

I think we should continue on the path of creating the google doc that we started last week.
17:09 PM
zas

on ernie in /home/zas/
17:09 PM
ruaok

update with everything that has changed -- now we're able to really ask for help since we're not using Fred Flintstone's tools anymore.
17:10 PM
zas

yes
17:13 PM
ruaok

the stacktrace is much more varied this time.
17:13 PM
last time they were all stuck in icu code, now a lot are stuck throwing an exception.
17:14 PM
ok, the plot is thickening
17:14 PM
a lot of threads are blocked in writing network IO.
17:15 PM
which suggests a gateway (related) issue, not a search issue.
17:15 PM
yeeeargh joined the channel
17:17 PM
zas: around the time of search server crashes, have you looked at syslog on the active gateway?
17:19 PM
ruaok sees nothing of interest.
17:20 PM
so, if I read the stackdump correctly, it looks like it is dying trying to write the results to the caller.
17:20 PM
which is nginx
17:22 PM
zas: I think we may want to examine our nginx setup and see if we're running out of ... something.
17:26 PM
dpmittal has left the channel
17:28 PM
zas: ping me when you're back please.
17:29 PM
opatel99 joined the channel
17:29 PM
opatel99

Mineo: I have to admit, I am stumped...
17:31 PM
Mineo

if you tell me why, maybe we can fix that :)
17:33 PM
opatel99

ELI5, what should I do? The threading seems straight forward, but the first portion of your comments was crypticto me. Should albums with MBIDs be clustered?
17:34 PM
Mineo

imho only if the option to ignore mbids is true
17:35 PM
the automatic clustering would be most useful for files that are not associated with anything in MB yet
17:36 PM
if there are already MBIDs in the files, the MBIDs are much better information than can be provided by the clustering
17:36 PM
opatel99

Ok... what if there is a combination?
17:36 PM
Mineo

of options?
17:37 PM
opatel99

of files with MBID and no MBID. Should I cluster the ones with no MBID and leave the MBIDs alone?
17:38 PM
Mineo

ah, I had not actually thought of that yet
17:38 PM
opatel99

:o
17:39 PM
Mineo

in that case, I think it would make sense to have an additional method on the tagger object like 'cluster_non_mbid_files' or something that goes through all unmatched files and collects the ones without MBIDs and clusters those
17:39 PM
tl;dr: yes
17:40 PM
opatel99

Okay. Now what about that in combination with ignore MBIDs? If that option is selected, do everything?
17:40 PM
Mineo

yes, just cluster all files in that case
17:41 PM
opatel99

Cool. Giving it a shot.
17:42 PM
Got any more Picard tasks up your sleeve btw? I am kinda useless here without Picard...
17:42 PM
typhoe

Hello again, when trying to import dumps for the first time with the command "./admin/InitDb.pl -- --createdb --import /tmp/dumps/mbdump*.tar.bz2 --echo", I get an error "psql: FATAL: role "musicbrainz" does not exist"
17:43 PM
zas

ruaok: i checked the nginx conf with bitmap, and we saw nothing wrong with it (that doesnt mean nothing is wrong)
17:44 PM
typhoe

Should I create a role or create a clean db (--createdb --clean) before?
17:44 PM
ruaok

understood.
17:44 PM
there is an admin interface that we can get current stats from, yes?
17:44 PM
Mineo

opatel99: I was thinking of making a task to improve/rewrite https://picard.musicbrainz.org/docs/scripting/ because a lot of people struggle with scripting, but I'm not yet sure what exactly needs to be improved
17:44 PM
ruaok

I wonder if we should graph the number of buffers, number of connections, anything for the search* configurations.
17:45 PM
this latest stacktrace really suggests that this is an internal configuration issue and not a lucene/java issue.
17:45 PM
zas

we can add whatever we can get numbers for
17:45 PM
Mineo

is the stacktrace available somewhere?
17:46 PM
ruaok

Mineo: hang on
17:48 PM
Mineo: https://www.dropbox.com/s/gqqd0g13terr1fm/stack...
17:48 PM
I'd love to get your read on the current state of things.
17:49 PM
akirom has quit
17:50 PM
stanislas

LordSputnik, Leftmost: I've done my second plugin. I would be grateful if you review my work. I've not submitted it on gci yet, I just want to know your opinion at this stage. Link to my repo : https://github.com/stasszczesniak/CalibreBookBr...
17:50 PM
ruaok

zas: this doesn't seem detailed enough for my desires, but let's start graphing:
17:50 PM
http://nginx.org/en/docs/http/ngx_http_stub_sta...
17:58 PM
opatel99

Mineo: Where do I get 900 files from? :P
17:59 PM
Freso

opatel99: You could expand your horizons! I'm about to add two more CB tasks, and there's a bunch of unclaimed beets tasks too, as I mentioned previously.
17:59 PM
zas

http://stats.musicbrainz.org/dashboard/db/all-n...
17:59 PM
Mineo

opatel99: https://archive.org/details/Best_of_8_Bit_Colle... should contain more than that
17:59 PM
LordSputnik

stanislas: I won't be able to try it out until tomorrow, but one thing you could do to improve would be to split out the bits of code that initialize the UI into separate functions, so the large methods you have at the moment become smaller and easier to maintain
18:00 PM
Freso

^ +1 (even if I haven't actually looked at the code :))
18:00 PM
zas

ruaok: this is collected since some time already, if possible (=module enabled)
18:03 PM
Mineo

regarding the stackdump: I wonder why a lot of threads are in some EOFException, all coming from eclipse-persistence's JSONWriterRecord
18:03 PM
stanislas

LordSputnik. Ok, i will try to clean my code. Thanks. Maybe Leftmost is willing to review it today.
18:05 PM
LordSputnik

stanislas: hopefully! I'm willing, but universitry deadlines mean that I've not got the time this evening :(
18:05 PM
opatel99

I have exams this entire week... Gonna be so behind once I am done..
18:06 PM
stanislas

LordSputnik: Oh i understand, i have geometry exam tomorrow :)
18:07 PM
reosarevok

stanislas: less coding, more studying! ;)
18:07 PM
regagain_ has quit
18:08 PM
LordSputnik

stanislas: I could add a day onto the task if you like, just in case we don't have it wrapped up by tomorrow evening?
18:10 PM
Mineo

regarding the bb plugin for calibre: you're aware that you're working around Qts event model by using urllib, right?
18:10 PM
ruaok

Mineo: yes, exactly that.
18:11 PM
stanislas

LordSputnik: seems like a good idea
18:11 PM
Mineo: What do you mean ?
18:11 PM
Freso

stanislas: I'll give it a whirl. :)
18:11 PM
ruaok

JSONWriterRecord sounds like the bog standard send the response to the caller and it gets stuck somehow.
18:11 PM
and that somehow would be caused by nginx, since that is what is on the other end.
18:12 PM
thus leading me to think we need to examine our nginx config.
18:12 PM
stanislas

Mineo: No, I don't.
18:12 PM
ruaok

Mineo: does that line of thinking make sense to you?
18:13 PM
stanislas

Mineo: I don't even understand what do you mean by "working areound Qts event model by using urllib"
18:13 PM
Mineo

ruaok: what's surprising to me is that none of the EOFExceptions are related to xml responses getting written, although I suspect the number of those to be way higher than the json ones
18:14 PM
stanislas

Mineo: Are you talking about my plugin ?
18:14 PM
Mineo

oh, wait, the website uses json as well, right?
18:14 PM
ruaok

yes.
18:14 PM
IIRC
18:14 PM
still, that is an interesting observations.
18:14 PM
-s
18:15 PM
Mineo

stanislas: sorry, I didn't want to try having two conversations at once :-)
18:15 PM
stanislas

Mineo; ok
18:16 PM
Mineo

stanislas: calibre seems to be built on Qt which models everything i/o-related (reading files, sending data over the network etc.) as events with callbacks attached to them
18:17 PM
this allows it to do other things while i/o is happening in the background without having to spawn a new thread for every action
18:17 PM
by using urllib to request data from bookbrainz, everything else is blocked while the http request is in progress
18:18 PM
this works if bookbrainz is responding fast, but doesn't work quite so well if the bookbrainz servers take a long time to respond or are completely offline
18:19 PM
stanislas

Mineo: Would doing all https requests in some other thread solve the problem ?
18:23 PM
Mineo

yes and no :P I would expect there to be some helper methods for plugins in calibre
18:27 PM
opatel99

Mineo: Can you explain why the upload for many files fails with auto cluster, but not without?
18:30 PM
regagain_ joined the channel
18:33 PM
bitmap

ruaok: we've been getting ISEs for cut-off JSON response from the search server since at least 2013, probably longer
18:35 PM
Mineo

opatel99: no, I don't really know why that happens, but I think the clustering engine is not meant to be called from multiple threads
18:36 PM
ruaok

bitmap: that is interesting.
18:36 PM
opatel99

So QSemaphore reserves threads?
18:36 PM
ruaok

how frequent are those?
18:37 PM
I wonder if they happen due to index rotation or if they happen when the servers choke
18:37 PM
can we graph the occurance of those, bitmap, zas?
18:37 PM
bitmap

we usually get a couple/few a day I think
18:38 PM
Mineo

opatel99: no, it allows you to count the number of active threads (which you can't just do by incrementing a normal variable in multiple threads)
18:38 PM
bitmap

they include the JSON and show where it gets cut off (then mbserver fails to parse it, hence the ISE)
18:39 PM
ruaok

ok, a few a day really seems to be related to the index rotation.
18:40 PM
not too much we can do about that with the current setup
18:40 PM
opatel99

Mineo: Would http://doc.qt.io/qt-4.8/qwaitcondition.html not work?
18:41 PM
JefftheBest joined the channel
18:43 PM
Mineo

I did not explore every possible option - if you think it works, give it a try :)
18:43 PM
bitmap

typhoe: what version of node.js do you have? I don't think Map() was added officially until 4.0
18:45 PM
LordSputnik: the website says Map() needs a polyfill, not just the code transform http://babeljs.io/docs/learn-es2015/#map-set-we...
18:46 PM
I think I'll just replace that with a plain object