it says "For "python-musicbrainz/0.7.3": we allow through (on average) 50 requests per second, and decline the rest (though recently this has not been hit).For "anonymous" user-agents (see below): we allow through (on average) 50 requests per second, and decline (http 503) the rest.For other user-agents: allow through. "
bitmap
yeah, I don't think that's true for the new rate limiting that's in place since yesterday
pingupingu
so you mean for other user-agents: allow through. is actually 1req/sec
bitmap
I'll have to ask zas how the new stuff works. since headphones uses up the majority of our traffic it seems to be less fair to the rest
pingupingu
bitmap, I tried to download the mbdump, mbedit and mb-something else and load it in my load it on my postgresql instance and then I ran out of space at 25gb
how much disc space do I need for that db dump?
bitmap
you probably don't need the edit dump, which is the vast majority of that space
exclude that and you should be fine, I think
JesseW
from my reading of the rate limit page, it's 50 req/sec for *all the clients with that user-agent*, divided up among IP addresses with 1 req/sec for each one (so effectively, up to 50 different IP addresses using the same user-agent will be allowed through simultanenously, but no more). And for other user-agents, there's no maximimum limit, but still a limit of 1 req/sec from the IP address.
bitmap
zas: I think the rate at which people receive 503s has changed because the 'global' limit is now applied before the user-agent limits, whereas previously it was applied after?
pingupingu
righto.. i just checked i actually used the mb-editor instead of mb-edit (which is 3.7gb)
those ones I am trying to import mbdump-derived.tar.bz2 mbdump-editor.tar.bz2 mbdump.tar.bz2 (2.xgb)
but I ran out of disk about 20gb
bitmap
try doubling the size of the disk
pingupingu
so what is required for the mimimal dump for me to start using the python api and get some results?
CallerNo6
JesseW, yeah, that was my understanding too
bitmap
those should be enough
pingupingu
hmm righto.. I will have to request more disk space
righto.. then I should not be bound by the rate requests 1/sec etc when I am running a local musicbrainz instance
bitmap
right
pingupingu
good
bitmap, ya seem to know quit a bit about mb, what do you do?
bitmap
I'm one of the devs :)
pingupingu
ahh great :)
JesseW wonders about the possibilities of leveraging one of the distributed content systems that are popular theses days to distribute the load of serving musicbrainz content
JesseW
maybe IPFS, or DAT, or one of those.
alastairp
"7:19 AM <pingupingu> it says "For "python-musicbrainz/0.7.3": we allow through (on average) 50 requests per second, "
regagain joined the channel
note that python-musicbrainz is not the same as python-musicbrainzngs
that text is wayyyyy out of date and should be deleted
mihaitish joined the channel
pingupingu has quit
xps2 has quit
xps2 joined the channel
diana_olhovyk joined the channel
JesseW has quit
yeeeargh joined the channel
Freso would certainly hope that bitmap knows "quit a bit about mb" :p
Freso
alastairp: WP:BB :)
mihaitish has quit
JonnyJD joined the channel
ariscop has quit
kartikgupta0909 joined the channel
kartikgupta0909
alastairp: We can start whenever you ready
zas
bitmap: you're right now global rate limit is applied before UA or IP ones, and it is less fair to small users. Though we don't have much choice.
I'll see how to improve things in next days. On the positive side, MB website has never been that responsive since months, and such drastic restriction on WS will encourage big users to use their own db.
And search servers didn't crash since the change was made.
ariscop joined the channel
alastairp
kartikgupta0909: hey, I'm here
let me find your document
OK
kartikgupta0909
lets start
alastairp
I don't think that we need to change much in the proposal. The only thing that I see currently is that we could possibly target the latest ubuntu instead - 16.04 instead of 14.04
kartikgupta0909
yes
alastairp
zas: can you explain to me a little about chef - does it prefer to "own" the entire machine?
kartikgupta0909
also authentication
alastairp
we're looking at an automated way to install a bunch of packages and do configuration
except people will probably run this on an existing server
kartikgupta0909: right. Gentlecat started a patch for API keys for users
kartikgupta0909
oh
alastairp
I think this is a good start for us, we may not need anything else
yes, will do that. Also I have no idea what chef is. I was thinking of writing installation scripts similar to the ones already included in acousticbrainz server
yes thats what i was talking about
alastairp
chef is a management tool for setting up servers
it's a bit more structured than just writing a script
and has features to easily deploy to many machines at a time