#musicbrainz-devel

/

      • JonnyJD_
        ianmcorvidae: goodies can't do web requests, so they don't make sense for MB
      • 2013-11-06 31049, 2013

      • ruaok
        er responses.
      • 2013-11-06 31017, 2013

      • ianmcorvidae
        JonnyJD_: no, but point stands -- I think longtail is probably the main one that's workable for MB data
      • 2013-11-06 31035, 2013

      • ianmcorvidae
        JonnyJD_: only included it because it's keyword-oriented, like all the others that aren't longtail
      • 2013-11-06 31052, 2013

      • navap
        Responses? What do they have to do with the logs?
      • 2013-11-06 31025, 2013

      • ianmcorvidae
        hm, possibly the longtail ones are also keyword-based
      • 2013-11-06 31036, 2013

      • misterswag joined the channel
      • 2013-11-06 31004, 2013

      • ruaok
        navap: I thought you were talking about responses, since you said "compact"
      • 2013-11-06 31019, 2013

      • ianmcorvidae
        I'd still go with fathead or longtail, anyway, since I don't know how they throttle their requests to our WS, if they do -- though we'd need to find a source that could be used for it (I *guess* the data dumps, but that'd be hairy)
      • 2013-11-06 31033, 2013

      • njh
        Twitter URLs seems to be stored in MusicBrainz using a mixture of http and https. Is this a known issue? Can I raise a bug?
      • 2013-11-06 31018, 2013

      • navap
        ruaok: I was thinking about the possibility of keeping logs for a longer time period so we have more to analyze. 1 web visit -> several hits, 1 ws visit -> 1 hit. That's what I meant by more compact
      • 2013-11-06 31020, 2013

      • ianmcorvidae
        njh: they changed it at some point (twitter, I mean); afaik the notion is that https is preferred but they need to be changed over time
      • 2013-11-06 31047, 2013

      • nikki
        ianmcorvidae: no, they've already been fixed
      • 2013-11-06 31051, 2013

      • ruaok
        navap: ah.
      • 2013-11-06 31006, 2013

      • rvedotrc
        I can't get used to ruaok being in the same(ish) time zone as me.
      • 2013-11-06 31007, 2013

      • nikki
        there's exactly two left, which probably slipped past the cleanup
      • 2013-11-06 31014, 2013

      • ianmcorvidae
        ah, lol
      • 2013-11-06 31018, 2013

      • ruaok
        rvedotrc: :)
      • 2013-11-06 31027, 2013

      • rvedotrc
        It's like: replies, but faster! :-D
      • 2013-11-06 31051, 2013

      • nikki
        haha
      • 2013-11-06 31057, 2013

      • nikki
        yeah, it's still weird to me too
      • 2013-11-06 31005, 2013

      • ruaok
        navap: we could double the number of days we keep, but then we'd be outta space on that server.
      • 2013-11-06 31020, 2013

      • ruaok
        nikki: even after 6 months?
      • 2013-11-06 31042, 2013

      • nikki
        njh: anyway, there's your answer :P our data was already fixed quite a while ago, if yours isn't then it's out of date
      • 2013-11-06 31049, 2013

      • ruaok
        ok, I gotta head out. getting dark here in b-town.
      • 2013-11-06 31019, 2013

      • nikki
        ruaok: yes. 6 months isn't much after 8 years or so of knowing you were in california :P
      • 2013-11-06 31026, 2013

      • ruaok
        ha, true that
      • 2013-11-06 31037, 2013

      • ruaok
        bbiab
      • 2013-11-06 31033, 2013

      • JonnyJD_
        ianmcorvidae: sounds like the duckduckhack stuff will again be exposed in an api, so one could use the duckduckgo api to query musicbrainz indirectly. Sounds like the number of requests can be considerable.
      • 2013-11-06 31000, 2013

      • ianmcorvidae
        JonnyJD_: yeah -- looking further they do cache results for a period
      • 2013-11-06 31014, 2013

      • JonnyJD_
        https://api.duckduckgo.com/api : Our long-term goal is for all of our instant answers to be available through this open API.
      • 2013-11-06 31024, 2013

      • ianmcorvidae
        (which you can change/configure -- but we might still have to give them a special ratelimit bucket)
      • 2013-11-06 31030, 2013

      • misterswag joined the channel
      • 2013-11-06 31058, 2013

      • MBJenkins joined the channel
      • 2013-11-06 31001, 2013

      • ianmcorvidae
        yeah, it is
      • 2013-11-06 31007, 2013

      • ianmcorvidae
        (going off [off], doesn't matter much)
      • 2013-11-06 31050, 2013

      • ianmcorvidae
        some things using fathead are e.g. the arch packages thing, which downloads a list of packages and processes it into their tab-separated format
      • 2013-11-06 31037, 2013

      • ianmcorvidae
        so I guess their intention is to use spice where there's an API to use and fathead when you need to process a downloaded file, but
      • 2013-11-06 31040, 2013

      • JonnyJD_
        yep, I also found the arch thing
      • 2013-11-06 31058, 2013

      • JonnyJD_
        but the code is just some metadata, couldn't find the rest
      • 2013-11-06 31005, 2013

      • MBJenkins joined the channel
      • 2013-11-06 31006, 2013

      • ianmcorvidae
        it's in the 'share' folder
      • 2013-11-06 31008, 2013

      • ianmcorvidae
        for the actual code
      • 2013-11-06 31012, 2013

      • ianmcorvidae
        the lib folder just has the metadata bit
      • 2013-11-06 31017, 2013

      • ianmcorvidae
      • 2013-11-06 31039, 2013

      • JonnyJD_
        D'oh..
      • 2013-11-06 31054, 2013

      • JonnyJD_
        also just found that
      • 2013-11-06 31021, 2013

      • ruaok joined the channel
      • 2013-11-06 31022, 2013

      • ruaok joined the channel
      • 2013-11-06 31031, 2013

      • JonnyJD_
        so yes, we should use fathead/longtail, because then this is load on the DDG servers, not ours
      • 2013-11-06 31001, 2013

      • ianmcorvidae
        yeah
      • 2013-11-06 31011, 2013

      • ianmcorvidae
        but then I guess it just gets to process data dumps
      • 2013-11-06 31017, 2013

      • ianmcorvidae
        which will be a fun game :P
      • 2013-11-06 31006, 2013

      • JonnyJD_
        a sane thing to do would be to let DDG host a MB mirror (+ search server)..
      • 2013-11-06 31026, 2013

      • JonnyJD_
        though I guess this is exactly what they don't want to do
      • 2013-11-06 31048, 2013

      • ruaok
        that wasn't discussed.
      • 2013-11-06 31007, 2013

      • ianmcorvidae
        yeah, I suppose it is mostly search traffic too
      • 2013-11-06 31012, 2013

      • ianmcorvidae
        which is at least much nicer on us
      • 2013-11-06 31020, 2013

      • JonnyJD_
        well, hosting a MB server is a lot of effort, but so is re-inventing "having some kind of dump to query against"
      • 2013-11-06 31054, 2013

      • ruaok
        crap.
      • 2013-11-06 31059, 2013

      • ruaok
        hobbes is in bad shape.
      • 2013-11-06 31004, 2013

      • ruaok
        its second drive is failing too.
      • 2013-11-06 31028, 2013

      • ruaok
        that's why it keeps kicking over into read-only mode.
      • 2013-11-06 31016, 2013

      • nikki
        what's on hobbes?
      • 2013-11-06 31021, 2013

      • ruaok
        jenkins
      • 2013-11-06 31036, 2013

      • ruaok
        used to be test/beta/search index updater.
      • 2013-11-06 31057, 2013

      • ruaok
        but iirc jenkins is the only real thing that is being used.
      • 2013-11-06 31003, 2013

      • ruaok
        the machine could use a refresh anyway.
      • 2013-11-06 31009, 2013

      • ianmcorvidae
        seems accurate, yes
      • 2013-11-06 31015, 2013

      • ruaok
        ocharles: still around?
      • 2013-11-06 31032, 2013

      • ruaok
        are you familiar with jenkins?
      • 2013-11-06 31002, 2013

      • ruaok
        (that was directed at ianmcorvidae)
      • 2013-11-06 31006, 2013

      • ianmcorvidae
        me? only roughly, ocharles has done the majority of the setup
      • 2013-11-06 31013, 2013

      • ruaok
        are the configs checked into git?
      • 2013-11-06 31023, 2013

      • ruaok
        yeah, I figured that.
      • 2013-11-06 31024, 2013

      • ianmcorvidae
        they aren't, of that much I'm sure
      • 2013-11-06 31032, 2013

      • ruaok
        ok, we need to save them.
      • 2013-11-06 31033, 2013

      • ianmcorvidae
        unless they got snuck in very craftily somehow, but
      • 2013-11-06 31038, 2013

      • ruaok
        like RIGHT NOW.
      • 2013-11-06 31034, 2013

      • ruaok
        it already is very unhappy.
      • 2013-11-06 31043, 2013

      • ianmcorvidae
        looking at it
      • 2013-11-06 31049, 2013

      • ruaok
        k, thanks.
      • 2013-11-06 31057, 2013

      • ruaok
        I can check to see if that part is backed up.
      • 2013-11-06 31011, 2013

      • ianmcorvidae
        I'm having a reasonable deal of trouble even getting into hobbes to look, but it's /var/lib/jenkins that should mostly matter
      • 2013-11-06 31023, 2013

      • ruaok
        I'm in and looking at the dir.
      • 2013-11-06 31032, 2013

      • ianmcorvidae
        okay
      • 2013-11-06 31044, 2013

      • ruaok
        I'm trying to get an idea how much data this is.
      • 2013-11-06 31059, 2013

      • ianmcorvidae
        I'm copying what I can to lenny as we speak, using http://www.clausconrad.com/blog/backup-jenkins-co… as a model for what exactly needs preserving
      • 2013-11-06 31033, 2013

      • ianmcorvidae
        some of it's giving me permissions issues , but
      • 2013-11-06 31008, 2013

      • ruaok
        no backups of the jenkins config
      • 2013-11-06 31039, 2013

      • ruaok
        ok, work to get what you can off it.
      • 2013-11-06 31051, 2013

      • ruaok
        then I'm going to turn it off.
      • 2013-11-06 31002, 2013

      • ruaok
        and ask dwni to put the mini into their fridge.
      • 2013-11-06 31025, 2013

      • ruaok
        with fat "Do not eat" note on it. :)
      • 2013-11-06 31057, 2013

      • reosarevok joined the channel
      • 2013-11-06 31005, 2013

      • ianmcorvidae
        last step of what that mentions in progress, as soon as hobbes figures out which way is up :)
      • 2013-11-06 31027, 2013

      • ruaok
        good luck with that.
      • 2013-11-06 31038, 2013

      • ianmcorvidae
        well, it's done fine with the rest
      • 2013-11-06 31041, 2013

      • ruaok
        anyways, my point was to put hobbes in the fridge so that drives can cool.
      • 2013-11-06 31047, 2013

      • ianmcorvidae
        yeah
      • 2013-11-06 31055, 2013

      • ruaok
        with a cold hobbes we can connect him again and pull more stuff off.
      • 2013-11-06 31002, 2013

      • ruaok
        but, please keep working
      • 2013-11-06 31010, 2013

      • ruaok
        once you're done, shut him down and ping me.
      • 2013-11-06 31032, 2013

      • ianmcorvidae
        will do
      • 2013-11-06 31040, 2013

      • ruaok
        thx
      • 2013-11-06 31047, 2013

      • ijabz joined the channel
      • 2013-11-06 31020, 2013

      • ianmcorvidae
        ruaok: shutting down now
      • 2013-11-06 31054, 2013

      • ianmcorvidae
        ruaok: slowly :P
      • 2013-11-06 31004, 2013

      • andreypopp joined the channel
      • 2013-11-06 31002, 2013

      • ruaok
        k, thanks
      • 2013-11-06 31037, 2013

      • MiX-MaN joined the channel
      • 2013-11-06 31045, 2013

      • murdos joined the channel
      • 2013-11-06 31048, 2013

      • ruaok
        ianmcorvidae: do you feel that we got enough stuff off the mini?
      • 2013-11-06 31004, 2013

      • ruaok
        should I ask them to put it into the fridge so we can make another pass at it tomorrow morning?
      • 2013-11-06 31020, 2013

      • ianmcorvidae
        ruaok: well, I only looked at jenkins -- I have no idea if there's other things that need looking at
      • 2013-11-06 31038, 2013

      • ianmcorvidae
        I think we have a fairly reconstructable jenkins backup
      • 2013-11-06 31041, 2013

      • ruaok
        ok, I'll have them pre-emptively put it into the frdge.
      • 2013-11-06 31044, 2013

      • ruaok
        ok, great.
      • 2013-11-06 31013, 2013

      • Guest50805 joined the channel
      • 2013-11-06 31016, 2013

      • alastairp joined the channel
      • 2013-11-06 31017, 2013

      • navap
        What's with "mini"?
      • 2013-11-06 31051, 2013

      • nikki
        presumably hobbes is on a mac mini
      • 2013-11-06 31003, 2013

      • navap
        heh really?
      • 2013-11-06 31008, 2013

      • ruaok
        yep.
      • 2013-11-06 31026, 2013

      • ruaok
        low power consumption and low space usage makes it fairly attractive for use in a colo.
      • 2013-11-06 31028, 2013

      • navap
        Is it just thrown on top in the rack?
      • 2013-11-06 31033, 2013

      • ruaok
        bottom.
      • 2013-11-06 31038, 2013

      • navap
        heh
      • 2013-11-06 31020, 2013

      • ianmcorvidae
        I've seen custom-made shelves that put four of them in a... 2U, I think
      • 2013-11-06 31041, 2013

      • ruaok
        only 4?
      • 2013-11-06 31052, 2013

      • ruaok
        I'd think you could get 6 or even 8.
      • 2013-11-06 31009, 2013

      • ianmcorvidae
        the thing I saw was only four on a row, to have ample cabling space and things, I think
      • 2013-11-06 31044, 2013

      • ianmcorvidae
        or possibly I mis-saw it, since this was a while ago :) point being it's a common thing for people to use as a server, anyway
      • 2013-11-06 31033, 2013

      • CallerNo6 believes ianmcorvidae. room for cables should never be underestimated :-)
      • 2013-11-06 31059, 2013

      • jdamcd joined the channel
      • 2013-11-06 31059, 2013

      • ruaok
        says the cable head. :)
      • 2013-11-06 31008, 2013

      • Freso
        ianmcorvidae: Ah, okay. Sorry for the noise then. :)
      • 2013-11-06 31043, 2013

      • Freso
        ruaok: Heh. I've actually looked at those (DDG) docs before and thought "someone" should make a MusicBrainz "hook". If only it wasn't Perl based... :(
      • 2013-11-06 31019, 2013

      • CallerNo6
        +1
      • 2013-11-06 31046, 2013

      • ianmcorvidae
        goodies and spice use javascript, fathead and longtail can be written in any language other than a very basic file with some metadata
      • 2013-11-06 31051, 2013

      • ianmcorvidae
        for example, the arch packages thing is python, since that seems to be the popular nearly-equivalent language :P
      • 2013-11-06 31043, 2013

      • MightyJay joined the channel
      • 2013-11-06 31009, 2013

      • jdamcd joined the channel
      • 2013-11-06 31058, 2013

      • Freso
        Hm.
      • 2013-11-06 31017, 2013

      • Freso
        I may poke more at it next week, with no more exams hanging over my head.
      • 2013-11-06 31054, 2013

      • Freso
        (Well, except for the one re-exam I think I'll have to make due with - but that's not until next year anyway.)
      • 2013-11-06 31047, 2013

      • jdamcd joined the channel
      • 2013-11-06 31056, 2013

      • misterswag joined the channel
      • 2013-11-06 31029, 2013

      • nikki
        nikki has changed the topic to: Arctic week! http://youtu.be/5Y6TqxLmxIo | http://musicbrainz.org/#devel | Agenda: MBS-3841 - relationships & url formatting (ocharles), spam accounts (nikki) | AcoustID replication (ex. MBS-6694)
      • 2013-11-06 31044, 2013

      • Freso
        Yes. We should definite spam accounts more.