in #metabrainz

17:19 PM
Freso

(There were a number of diversity discussions about music festivals in Denmark last year. MB would be a perfect fit to codify this data and make reports from.)
17:21 PM
Hobbyboy joined the channel
17:23 PM
hugo

@fresco: here is an overview of the styles https://www.youtube.com/watch?v=I0zEAQTnUrU not the music i listen to daily, it is just folk but this radiostation just plays those sounds
17:36 PM
drsaunders joined the channel
18:07 PM
Sophist-UK

samj1912: Ok - I give up. I have a test_metadata.pyc file and cannot find a matching .py file anywhere.
18:07 PM
samj1912

lol :P
18:08 PM
I think you made it sometime and it stayed
18:08 PM
we have a test_scripts and a test_formats
18:08 PM
that we use for similar tests
18:08 PM
you can add it to test_scripts maybe?
18:12 PM
Sophist-UK

Well the .pyc file would not get deleted by git checkout, only if I do it manually. So it will hang around. But at some point I must have had a matching .py file on my PC and run tests.
18:13 PM
Perhaps I attempted to create a set of tests for metadata and then decided it was too much effort. Who knows. It was 3 years ago and I can't remember what I had for breakfast.
18:27 PM
Mineo

there's a module called decompyle (I think) that can turn a .pyc back into a .py file if you want to have a look at it :)
18:46 PM
Slurpee has quit
18:48 PM
arbenina_ joined the channel
18:56 PM
UmkaDK has quit
19:04 PM
arbenina_ has quit
19:08 PM
reosarevok

hah
19:08 PM
http://muzyka.interia.pl/plyta-pregap-and-data-...
19:08 PM
reosarevok mails to Quesito in case we haven't heard of these yet
19:09 PM
Quesito have not heard of these yet......
19:12 PM
I like the ones who don't even bother to remove the Test Artist
19:12 PM
Quesito

it's a picard facepalm moment....but one that brings a smile :)
19:37 PM
github joined the channel
19:37 PM
github

[picard-plugins] samj1912 closed pull request #93: Fix generation of plugins (master...fixcheckout) https://git.io/vSKts
19:37 PM
github has left the channel
19:37 PM
khan joined the channel
19:38 PM
UmkaDK joined the channel
19:50 PM
Sophist-UK

Mineo: Used uncompyle6 which did an amazing job!!!
19:52 PM
gcilou

ruaok: is this what you're talking about for the LB alpha import page? https://usercontent.irccloud-cdn.com/file/Okj63...
20:10 PM
samj1912

ruaok: will do my initial project pitch to a couple of profs tomorrow
20:11 PM
I was thinking about extending MsB to a NLP based metadata extractor (I think I have a very very vague idea how it currently works on last fm data)
20:12 PM
couldnt find much on MsB-server repo
20:14 PM
I don't even know much about what the scope and future plans for MsB are
20:14 PM
SothoTalker_

MusicBrainz? :D
20:15 PM
samj1912

messybrainz
20:15 PM
SothoTalker_

:-)
20:15 PM
Quesito

hey gcilou! I added a few things to the UX survey doc, about to call it a night, any q's or thoughts before I go?
20:16 PM
gcilou

Hey Quesito :) I looked over it, looks nice!
20:16 PM
Quesito

hopefully getting somewhere....:P let me know your thoughts / comments, etc!
20:18 PM
gcilou

Yeah, definitely! I'll look more thoroughly soon
20:19 PM
Quesito

thank you! any advice, thoughts, etc are needed :)
20:23 PM
Sophist-UK

sam1912: I decompiled the test_metadata.py file and it doesn't look to me like code I would have written. But relatively short - I will try to build a comprehensive test_metadata.py.
20:26 PM
regagain has quit
20:27 PM
samj1912

okay
20:33 PM
samj1912 sleeps
20:47 PM
SothoTalker_

nighty
20:53 PM
is there any way to circumvent 503 errors, except being a paying supporter? ^^
20:58 PM
kepstin

SothoTalker_: what are you trying to do?
20:58 PM
SothoTalker_

just fetch stuff from the webservice
20:59 PM
kepstin

sure, but what kind of stuff, and how much?
20:59 PM
SothoTalker_

all releases of a label
20:59 PM
which are 16 requests in total, to get everything.
21:00 PM
kepstin

so, the way to not get 503 errors is to follow the rate limiting requirements...
21:00 PM
SothoTalker_

i currently make one request every 5 seconds
21:00 PM
kepstin

hmm, that should be ok; the actual limit is ~1/second
21:01 PM
per ip address, for properly configured apps (set user agent, etc.)
21:01 PM
SothoTalker_

it's just a simple script :-)
21:01 PM
kepstin

if you're using some generic user agent rather than specifying one that identifies your tool, you hit much harsher rate limits
21:02 PM
so that's the first thing to do - set a user agent, call it "SothoTalker's Label Scraper" or whatever, doesn't matter - then you should be able to use the webservice at 1 request/second
21:03 PM
see https://musicbrainz.org/doc/XML_Web_Service/Rat... for the details on that
21:06 PM
if you want faster than 1 req/second, then you basically have two options - look into getting an api key with raised rate limit for the musicbrainz.org ws, or run your own replicated server locally which you can query as fast as you like (or even just read info direct from the postgres db)
21:06 PM
SothoTalker_

well, i dont want faster access, i just dont want that 503 error happen to me :D
21:07 PM
i suspect the WS is just hit heavily, since this did not occur earlier like 2 weeks ago.
21:07 PM
kepstin

right now I assume you're being thrown into the "anonymous" group because of the user agent on your webservice requests, and due to the global 50/s limit, that's unpredictable.
21:08 PM
so fix the user agent first, then see if that helps :/
21:09 PM
SothoTalker_

i tried a few ua strings, and it didnt help :)
21:09 PM
i even made my script wait 10 seconds between each request
21:13 PM
kepstin

hmm. I guess the webservice might just be overloaded overall, then.
21:14 PM
SothoTalker_

that's my guess, too.
21:14 PM
kepstin

I don't know what the current global limit is, but I thought it was raised with the switch to newhost
21:14 PM
SothoTalker_

well, that problem does happen since about 2 weeks ago. before i had no problems.
21:15 PM
kepstin

well, the solution is to "try again later" - If you want to encode that into your script, look at having a backoff algorithm so it's not making the overload worse.
21:15 PM
SothoTalker_

and it's not like my script permanently hammers the server. i just run it once or twice a day to update my spreadsheet
21:16 PM
i might have to parse the 503 error message and put a 'try again after a few seconds' code :x
21:16 PM
kepstin

if you're doing any sort of query like that, try to run it at a random time each day rather than the same time every day
21:16 PM
well, you shouldn't retry the entire series of requests - just have it wait for a while, then continue where it left off...
21:17 PM
SothoTalker_

that's what i meant
21:18 PM
kepstin

you probably want to do an exponential backoff - so if you're e.g. waiting 1s between requests, then if one fails, wait 2s before trying again, if it fails again, wait 4s, and so on.
21:18 PM
and if the wait gets too long, maybe decide to give up with an error message ;)
21:36 PM
SothoTalker_

heh
21:36 PM
well, now it went through, luckily
21:39 PM
alastairp

gcilou: I don't think alpha import should have as much focus. Ideally we want people to ignore their alpha account
21:40 PM
gcilou

Ok! I can definitely adjust to that. Rob sent me a comment about his ideal layout for the page, so the whiteboard was following that.
21:40 PM
alastairp

Ah, I didn't see that
21:41 PM
Anyway, that's my opinion
21:41 PM
gcilou

yeah that's cool. I'm trying a few things out to get a feel for it
21:48 PM
colbydray joined the channel
22:02 PM
Mineo has quit
22:05 PM
Slurpee joined the channel
22:05 PM
Slurpee has quit
22:05 PM
Slurpee joined the channel
22:11 PM
CatCat has quit
22:16 PM
CatCat joined the channel
22:16 PM
CatCat has quit
22:16 PM
CatCat joined the channel
23:13 PM
hugo has quit
23:33 PM
Hobbyboy has quit
23:48 PM
Hobbyboy joined the channel