#metabrainz

/

      • JonnyJD has quit
      • QuoraUK has quit
      • JesseW joined the channel
      • Slurpee joined the channel
      • Slurpee has quit
      • Slurpee joined the channel
      • D4RK-PH0ENiX has quit
      • D4RK-PH0ENiX joined the channel
      • Slurpee has quit
      • pingupingu joined the channel
      • kurros has quit
      • kurros joined the channel
      • mihaitish joined the channel
      • mihaitish has quit
      • pingupingu
        hello folks
      • I tried requesting json reposnses, but I get 503 response where as the same request with xml gives me the result: http://musicbrainz.org/ws/2/artist/05cbaf37-6dc...
      • change the &fmt=xml to &fmt=json and I get 503, any ideas why?
      • bitmap
        503 means the web service is over capacity, you'll just have to try again
      • pingupingu
        hey there again bitmap
      • howdy?
      • I thought it was essentially the same service but just doing the xml to json translation
      • bitmap
        it is :) the &fmt= isn't having any effect
      • you probably just got unlucky, I only get 503s for the XML for example
      • pingupingu
        isnt having any effect?
      • do you have a sample query for the json results?
      • bitmap
        it's not affecting the rate at which you get 503s at all
      • you already linked one, just change fmt=xml to fmt=json
      • refresh until the server isn't busy
      • pingupingu has left the channel
      • pingupingu joined the channel
      • pingupingu
        bugger i quit accidentally
      • now I get 503 on xml too. are the servers usually this busy?
      • bitmap
        lately yes, but zas changed the rate limiting yesterday and that's having an affect too
      • pingupingu
        ha I managed to get to beta.musicbrainz.org with both jsons and xml responses
      • I just wanted to take a look at the responses
      • bitmap
        it seems to be less fair if you're only making a few requests now. hmph
      • pingupingu
        so the 50 responses per second per app query is changed now?
      • I didnt expect to gt a 503 immediately on the first json request
      • bitmap
        afaik our rate limit has always been 1 req/s, not sure where 50 req/s per app comes from
      • but it seems we don't have the capacity to honor 1 req/s for everyone right now
      • pingupingu
        ha I just read that 50req/sec let me get that url for you
      • bitmap
        well, we have separate limits per-ip and per-user-agent, so that could be it
      • pingupingu
      • yeah
      • bitmap
        ah, right then
      • pingupingu
        it says "For "python-musicbrainz/0.7.3": we allow through (on average) 50 requests per second, and decline the rest (though recently this has not been hit).For "anonymous" user-agents (see below): we allow through (on average) 50 requests per second, and decline (http 503) the rest.For other user-agents: allow through. "
      • bitmap
        yeah, I don't think that's true for the new rate limiting that's in place since yesterday
      • pingupingu
        so you mean for other user-agents: allow through. is actually 1req/sec
      • bitmap
        I'll have to ask zas how the new stuff works. since headphones uses up the majority of our traffic it seems to be less fair to the rest
      • pingupingu
        bitmap, I tried to download the mbdump, mbedit and mb-something else and load it in my load it on my postgresql instance and then I ran out of space at 25gb
      • how much disc space do I need for that db dump?
      • bitmap
        you probably don't need the edit dump, which is the vast majority of that space
      • exclude that and you should be fine, I think
      • JesseW
        from my reading of the rate limit page, it's 50 req/sec for *all the clients with that user-agent*, divided up among IP addresses with 1 req/sec for each one (so effectively, up to 50 different IP addresses using the same user-agent will be allowed through simultanenously, but no more). And for other user-agents, there's no maximimum limit, but still a limit of 1 req/sec from the IP address.
      • bitmap
        zas: I think the rate at which people receive 503s has changed because the 'global' limit is now applied before the user-agent limits, whereas previously it was applied after?
      • pingupingu
        righto.. i just checked i actually used the mb-editor instead of mb-edit (which is 3.7gb)
      • those ones I am trying to import mbdump-derived.tar.bz2 mbdump-editor.tar.bz2 mbdump.tar.bz2 (2.xgb)
      • but I ran out of disk about 20gb
      • bitmap
        try doubling the size of the disk
      • pingupingu
        so what is required for the mimimal dump for me to start using the python api and get some results?
      • CallerNo6
        JesseW, yeah, that was my understanding too
      • bitmap
        those should be enough
      • pingupingu
        hmm righto.. I will have to request more disk space
      • righto.. then I should not be bound by the rate requests 1/sec etc when I am running a local musicbrainz instance
      • bitmap
        right
      • pingupingu
        good
      • bitmap, ya seem to know quit a bit about mb, what do you do?
      • bitmap
        I'm one of the devs :)
      • pingupingu
        ahh great :)
      • JesseW wonders about the possibilities of leveraging one of the distributed content systems that are popular theses days to distribute the load of serving musicbrainz content
      • JesseW
        maybe IPFS, or DAT, or one of those.
      • alastairp
        "7:19 AM <pingupingu> it says "For "python-musicbrainz/0.7.3": we allow through (on average) 50 requests per second, "
      • regagain joined the channel
      • note that python-musicbrainz is not the same as python-musicbrainzngs
      • that text is wayyyyy out of date and should be deleted
      • mihaitish joined the channel
      • pingupingu has quit
      • xps2 has quit
      • xps2 joined the channel
      • diana_olhovyk joined the channel
      • JesseW has quit
      • yeeeargh joined the channel
      • Freso would certainly hope that bitmap knows "quit a bit about mb" :p
      • Freso
        alastairp: WP:BB :)
      • mihaitish has quit
      • JonnyJD joined the channel
      • ariscop has quit
      • kartikgupta0909 joined the channel
      • kartikgupta0909
        alastairp: We can start whenever you ready
      • zas
        bitmap: you're right now global rate limit is applied before UA or IP ones, and it is less fair to small users. Though we don't have much choice.
      • MBJenkins
        Project acousticbrainz-server build #52: SUCCESS in 4 min 44 sec: https://ci.metabrainz.org/job/acousticbrainz-se...
      • kanha has quit
      • zas
        I'll see how to improve things in next days. On the positive side, MB website has never been that responsive since months, and such drastic restriction on WS will encourage big users to use their own db.
      • And search servers didn't crash since the change was made.
      • ariscop joined the channel
      • alastairp
        kartikgupta0909: hey, I'm here
      • let me find your document
      • OK
      • kartikgupta0909
        lets start
      • alastairp
        I don't think that we need to change much in the proposal. The only thing that I see currently is that we could possibly target the latest ubuntu instead - 16.04 instead of 14.04
      • kartikgupta0909
        yes
      • alastairp
        zas: can you explain to me a little about chef - does it prefer to "own" the entire machine?
      • kartikgupta0909
        also authentication
      • alastairp
        we're looking at an automated way to install a bunch of packages and do configuration
      • except people will probably run this on an existing server
      • kartikgupta0909: right. Gentlecat started a patch for API keys for users
      • kartikgupta0909
        oh
      • alastairp
        I think this is a good start for us, we may not need anything else
      • kartikgupta0909
        for acousticbrainz users?
      • alastairp
      • right
      • zas
        alastairp: chef is more about ensuring things are in a well defined state
      • kartikgupta0909
        yes, API key might be sufficient for authentication
      • alastairp
        zas: our goal is to install some dependencies, and some source code
      • it sounds like something that chef is well suited for, except I wonder if it's a very large stick
      • and we just need a shell script
      • zas
        Chef can do it of course
      • alastairp
        kartikgupta0909: I propose that you develop on a Vagrant system, this means that it's easy for us to test the entire setup process
      • your first milestones are to set up the software deployment
      • kartikgupta0909
        sure, will do that. Even my acousticbrainz server runs on vagrant
      • yes
      • alastairp
        perfect, so you can probably start by copying the acousticbrainz vagrant configuration
      • zas
        You may start to write a shell script, then convert it to chef recipes. Chef isn't easy to learn.
      • alastairp
        hmm, right
      • zas
        Chef + vagrant works welk
      • alastairp
        kartikgupta0909: OK, based on zas' feedback, let's stay with shell scripts for now
      • https://github.com/metabrainz/acousticbrainz-se... is the main entry point for the AB install scripts
      • kartikgupta0909
        yes, will do that. Also I have no idea what chef is. I was thinking of writing installation scripts similar to the ones already included in acousticbrainz server
      • yes thats what i was talking about
      • alastairp
        chef is a management tool for setting up servers
      • it's a bit more structured than just writing a script
      • and has features to easily deploy to many machines at a time
      • kartikgupta0909
        oh okay, I will look into that and try to learn it, For now I will start working on the scripts.
      • alastairp
        however, don't worry too much about this for now. If we have time we could look at changing to it
      • as zas says, it's a lot to learn
      • regarding installation of dependencies, I don't want to install gaia from source
      • kartikgupta0909
        also you were saying about maintaining our own repo for 16.04 which will contain all the dependencies
      • yes
      • alastairp
        right, exactly
      • kartikgupta0909
        although I dont know how those repository work. But will learn it before the start of the coding period.
      • alastairp
        We have tools to build debian/ubuntu packages for essentia: https://github.com/MTG/essentia/tree/master/debian
      • kartikgupta0909
        do we need essentia for this?
      • alastairp
        there are also tools for gaia, but they are more out of date: https://github.com/MTG/gaia/tree/master/debian
      • kartikgupta0909
        i doubt that
      • alastairp
        no, we only need gaia
      • kartikgupta0909
        yes
      • alastairp
        I suggest you start looking at debian package building. there are lots of tutorals about how to do it: https://wiki.debian.org/HowToPackageForDebian
      • kartikgupta0909
        other things are yaml, protobuf and maybe Screen
      • yes I will follow the tutorials over this weekend itself.