#musicbrainz-devel

/

      • ruaok
        ok, I'll go investigate it.
      • 2010-11-15 31914, 2010

      • ocharles
        at least the basics of that role
      • 2010-11-15 31917, 2010

      • murdos
        ocharles: ah, ok...
      • 2010-11-15 31919, 2010

      • ruaok
        ocharles: ok, can you make that change available to me?
      • 2010-11-15 31930, 2010

      • ruaok
        push a branch?
      • 2010-11-15 31930, 2010

      • ocharles
        ruaok: I think it might have been part of my Fey rewrite stuff, so I dunno if it's much use
      • 2010-11-15 31937, 2010

      • ocharles
        I'll look into it
      • 2010-11-15 31940, 2010

      • ruaok
        ok.
      • 2010-11-15 31956, 2010

      • ruaok
        if it doesn't impact too much stuff, then I'll make that change and fix the iso_ change.
      • 2010-11-15 31909, 2010

      • ocharles
        And do a new data dump?
      • 2010-11-15 31910, 2010

      • ruaok
        then its FINITO. BASTA. SCHLUSS.
      • 2010-11-15 31913, 2010

      • ruaok
        yes.
      • 2010-11-15 31915, 2010

      • ruaok
        and reload test.
      • 2010-11-15 31918, 2010

      • ocharles
        ok, I'll do the isco change tomorrow
      • 2010-11-15 31931, 2010

      • ruaok
        ocharles: worry about your bug list.
      • 2010-11-15 31936, 2010

      • ocharles
        ok
      • 2010-11-15 31937, 2010

      • ruaok
        I'll just take care of that one change myself.
      • 2010-11-15 31943, 2010

      • ocharles
        sure, that makes sense
      • 2010-11-15 31947, 2010

      • ruaok
        in the interest of keeping the branches simple.
      • 2010-11-15 31951, 2010

      • ocharles nods
      • 2010-11-15 31955, 2010

      • ruaok
        its got a little fusterclicky last time.
      • 2010-11-15 31900, 2010

      • ruaok
        cliky?
      • 2010-11-15 31901, 2010

      • ocharles
        I cleared all my branches for that now that it's merged
      • 2010-11-15 31905, 2010

      • ruaok
        ok.
      • 2010-11-15 31910, 2010

      • ruaok
        moving on.
      • 2010-11-15 31914, 2010

      • ruaok
        the test server....
      • 2010-11-15 31920, 2010

      • ruaok
        one drive in the test server has failed.
      • 2010-11-15 31925, 2010

      • warp noticed that.
      • 2010-11-15 31931, 2010

      • ruaok
        DO NOT STORE ANYTHING IMPORTANT ON THE TEST SERVER.
      • 2010-11-15 31941, 2010

      • ruaok apologizes for shouting
      • 2010-11-15 31950, 2010

      • djce feels it's an important point,
      • 2010-11-15 31951, 2010

      • ruaok
        I am NOT (!) going to fix that drive.
      • 2010-11-15 31954, 2010

      • djce
        even when the drives work.
      • 2010-11-15 31900, 2010

      • ruaok
        djce: indeed!
      • 2010-11-15 31915, 2010

      • ruaok
        test should always be considered dispensable at any point in time.
      • 2010-11-15 31923, 2010

      • ruaok
        test is also an older server that is sucking a lot of power.
      • 2010-11-15 31940, 2010

      • ruaok
        and power is limited in mountain view (I'm guessing google gets all the power up there)
      • 2010-11-15 31949, 2010

      • ruaok
        so, we've been asked to decommision this server.
      • 2010-11-15 31902, 2010

      • djce
        timeline?
      • 2010-11-15 31917, 2010

      • ruaok
        we're either: purchasing a mac mini to run test, or purchasing a slice of a larger VM server.
      • 2010-11-15 31924, 2010

      • ruaok
        djce: unclear at this point in time.
      • 2010-11-15 31932, 2010

      • ruaok
        hopefully that will shake out soon.
      • 2010-11-15 31942, 2010

      • ruaok
        the mini option is $1k, the VM maybe $500.
      • 2010-11-15 31944, 2010

      • ocharles
        I think test is so low traffic a slice probably makes the most sense
      • 2010-11-15 31946, 2010

      • warp
        ruaok: if you intend to move to the cloud eventually, you might as well start with test.
      • 2010-11-15 31951, 2010

      • ruaok
        I'm not jazzed about either.
      • 2010-11-15 31957, 2010

      • ruaok
        ocharles: agreed.
      • 2010-11-15 31901, 2010

      • ruaok
        warp: its not cost effective.
      • 2010-11-15 31906, 2010

      • warp
        ruaok: agreed.
      • 2010-11-15 31919, 2010

      • ruaok
        both of the others are one time costs.
      • 2010-11-15 31931, 2010

      • ruaok
        whereas test is a recurring cost for a VERY low utlization server.
      • 2010-11-15 31948, 2010

      • ruaok
        I personally like the mini solution, but I dont very much like the cost of it.
      • 2010-11-15 31907, 2010

      • ocharles
        You could build the equiv of a mini for much less
      • 2010-11-15 31912, 2010

      • ruaok
        regardless, test may be unavailable for some period of time. I'm working to have that be as little as possible.
      • 2010-11-15 31922, 2010

      • ruaok
        ocharles: its about space AND power.
      • 2010-11-15 31931, 2010

      • ocharles
        and mini's are that efficient?
      • 2010-11-15 31933, 2010

      • ruaok
        I can't build something as small as a mini.
      • 2010-11-15 31936, 2010

      • ruaok
        ocharles: yes.
      • 2010-11-15 31941, 2010

      • ocharles
        interesting
      • 2010-11-15 31947, 2010

      • ruaok
        remember the setup of minis in the MB rack above MB?
      • 2010-11-15 31956, 2010

      • ocharles
        ya, I guess it has reason :)
      • 2010-11-15 31959, 2010

      • ruaok
        ding.
      • 2010-11-15 31903, 2010

      • ruaok
        ok, onward.
      • 2010-11-15 31903, 2010

      • pbryan
        Heh.
      • 2010-11-15 31922, 2010

      • ruaok
        the next issue is how we store edit data in the edit tables.
      • 2010-11-15 31935, 2010

      • ruaok
        XML was slow and is very verbose, which causes the DB size to go up.
      • 2010-11-15 31946, 2010

      • ruaok
        and makes it MUCH harder to keep the edit table in RAM.
      • 2010-11-15 31953, 2010

      • ruaok
        thus we need to move to something else.
      • 2010-11-15 31958, 2010

      • ruaok
        I personally don't care what.
      • 2010-11-15 31905, 2010

      • ruaok
        JSON and hstore are suggested.
      • 2010-11-15 31910, 2010

      • pbryan
        +1 to JSON
      • 2010-11-15 31914, 2010

      • ruaok
        hstore can be queried, JSON cannot.
      • 2010-11-15 31917, 2010

      • ocharles
        I vote json for the speed of serializing it
      • 2010-11-15 31920, 2010

      • warp
        the current xml has a lot of unneccesary bloat
      • 2010-11-15 31925, 2010

      • ruaok
        warp: +1
      • 2010-11-15 31932, 2010

      • ocharles
        I don't think we should even be considering querying because the entire schema is wrong, in my opinion
      • 2010-11-15 31948, 2010

      • ocharles
        We should just use the fastest store possible, which from what I see is JSON, or maybe YAML::XS
      • 2010-11-15 31954, 2010

      • warp
        so it's not just switching from xml to something else, it's also making sure only the neccesary bits are stored.
      • 2010-11-15 31957, 2010

      • ocharles
        my ยข2
      • 2010-11-15 31902, 2010

      • ruaok
        ocharles: murdos or luks pointed out that for reporting a table scan would be ok. just not for live queries.
      • 2010-11-15 31904, 2010

      • pbryan
        Depending on where you put JSON, it can be queried.
      • 2010-11-15 31908, 2010

      • pbryan
        Example: CouchDB.
      • 2010-11-15 31943, 2010

      • warp
        json sounds nice and simple, I'm not familiar with hstore and yaml::xs, so don't have much opinion on them.
      • 2010-11-15 31944, 2010

      • ocharles
        We could probably make the json queryable in Postgres with an extension too
      • 2010-11-15 31953, 2010

      • ocharles
        yaml::xs is just a c backend to generate yaml
      • 2010-11-15 31904, 2010

      • ruaok
        ocharles: hstore already has that extension.
      • 2010-11-15 31914, 2010

      • ruaok
        which is the reason why hstore was suggested.
      • 2010-11-15 31923, 2010

      • ocharles
        Right, but then we have to write a hstore serializer
      • 2010-11-15 31934, 2010

      • ruaok
        with is another dep.
      • 2010-11-15 31936, 2010

      • ocharles
        so either way, we write something it looks like
      • 2010-11-15 31951, 2010

      • ruaok
        do we have to write anything with json?
      • 2010-11-15 31901, 2010

      • ocharles
        Well, if you want to query into edit data, then probably
      • 2010-11-15 31906, 2010

      • ruaok wonders if murdos is present
      • 2010-11-15 31906, 2010

      • ocharles
        or we just do that querying perl side
      • 2010-11-15 31908, 2010

      • murdos
        ocharles: it's easier to write perl code than C code
      • 2010-11-15 31919, 2010

      • ocharles
        murdos: no, perl is too slow here
      • 2010-11-15 31929, 2010

      • ocharles
        that's why I chose JSON, because serialization is done in C
      • 2010-11-15 31931, 2010

      • pbryan
        It's easier to allow an optimized database to perform a query than perl (or C) code.
      • 2010-11-15 31931, 2010

      • ocharles
        and it's blazingly fast
      • 2010-11-15 31900, 2010

      • ocharles
        (this is respect to the edit migration)
      • 2010-11-15 31904, 2010

      • warp
        if we need to query that edit data... it's probably easier for us to write a serializer than a querying thingy for postgres.
      • 2010-11-15 31930, 2010

      • ruaok
        well, we're never going to query that table.
      • 2010-11-15 31939, 2010

      • ocharles
        but you'll want indexes if you want reports, right?
      • 2010-11-15 31943, 2010

      • ruaok
        nothing that does a seq scan on 10M rows can go live into production.
      • 2010-11-15 31959, 2010

      • ruaok
        but, for reporting where the table is scanned once per 24 hours is acceptable.
      • 2010-11-15 31912, 2010

      • ocharles
        For that, you could probably get away with doing it in Perl then too
      • 2010-11-15 31923, 2010

      • ocharles
        just grab all edits of type x, y, z, call new_from_row and check them there
      • 2010-11-15 31933, 2010

      • ocharles
        it's a bit messy, but it's probably the easiest solution that avoids us writing c
      • 2010-11-15 31952, 2010

      • ruaok
        ok, I'm hearing that more people prefer JSON.
      • 2010-11-15 31904, 2010

      • ruaok
        if JSON really bugs you, speak up now!
      • 2010-11-15 31920, 2010

      • ruaok
        the current code doesn't use JSON, does it?
      • 2010-11-15 31920, 2010

      • ocharles
        My other thought here...
      • 2010-11-15 31924, 2010

      • murdos
        I just find strange to store json in db;..
      • 2010-11-15 31950, 2010

      • ocharles
        murdos: it's just serialization. It's not json that,s wrong, it's the fact we have serialized data
      • 2010-11-15 31953, 2010

      • luks
        well, it's less strange than the mix we have right now :)
      • 2010-11-15 31901, 2010

      • ruaok
        luks: +1
      • 2010-11-15 31918, 2010

      • ocharles
        that is my problem. And I'm wondering if now would be the best time to address it, but I guess we don't have enough time..
      • 2010-11-15 31922, 2010

      • ocharles
        which is a shame
      • 2010-11-15 31934, 2010

      • ruaok
        ocharles: well, we're going to fix the edit system after NGS.
      • 2010-11-15 31938, 2010

      • ruaok
        lets deal with this issue then.
      • 2010-11-15 31945, 2010

      • ocharles
        alright
      • 2010-11-15 31949, 2010

      • ruaok
        so, json in the DB should be a time limited thing.
      • 2010-11-15 31955, 2010

      • ruaok
        (famous last words).
      • 2010-11-15 31957, 2010

      • warp
        lol
      • 2010-11-15 31904, 2010

      • ruaok
        so, JSON it is then.
      • 2010-11-15 31906, 2010

      • ocharles
        It'll never go away, I don't think, historic data will be too tricky
      • 2010-11-15 31910, 2010

      • pbryan
        \o/
      • 2010-11-15 31926, 2010

      • ruaok
        ocharles: is it JSON now? if not, can you make it json very soon?
      • 2010-11-15 31932, 2010

      • ocharles
        it is now
      • 2010-11-15 31937, 2010

      • ruaok
        ruaok has changed the topic to: agenda: collections/lists, replication packets, [got more?]
      • 2010-11-15 31942, 2010

      • ruaok
        ok, case closed.
      • 2010-11-15 31947, 2010

      • ruaok
        on to the next sticky topic.
      • 2010-11-15 31952, 2010

      • ruaok
        collections vs lists.
      • 2010-11-15 31903, 2010

      • pbryan
        Pls add to topic: works ACs.
      • 2010-11-15 31910, 2010

      • ruaok
        there were some interesting cases presented in the mailing list.
      • 2010-11-15 31924, 2010

      • ruaok
        but not anything that clearly stood out.
      • 2010-11-15 31940, 2010

      • ruaok
        murdos: I know you feel strongly about this and are willing to throw some time at this.
      • 2010-11-15 31900, 2010

      • ruaok
        I'm ok, with letting murdos drive this process and fix up the nomenclature to be more clear.
      • 2010-11-15 31913, 2010

      • ruaok
        murdos: are you still interested in working on that?
      • 2010-11-15 31928, 2010

      • warp
        warp has changed the topic to: agenda: collections/lists, replication packets, work ACs, [got more?]
      • 2010-11-15 31942, 2010

      • murdos
        all the feedback I get from users were that they understand 'collection' but not 'list'
      • 2010-11-15 31907, 2010

      • ruaok
        I think I prefer to change the nomenclature back to collections.
      • 2010-11-15 31917, 2010

      • ocharles
        I don't, but I do have more thoughts
      • 2010-11-15 31922, 2010

      • ruaok
        then when we actually add "list" support, then we should expose a new list concept.
      • 2010-11-15 31933, 2010

      • ocharles
        I think collection should be something that is on top of lists. A collection has a list, but it might have more data
      • 2010-11-15 31948, 2010

      • ocharles
        This could let client software do special stuff if it needs to know the user's collection
      • 2010-11-15 31949, 2010

      • ruaok
        for me its the concept of ordering.
      • 2010-11-15 31957, 2010

      • ruaok
        a list should have an ordering and they don't right now.
      • 2010-11-15 31908, 2010

      • ruaok
        thats why they should stay collections, IMHO.
      • 2010-11-15 31910, 2010

      • ocharles
        Want me to call it a set then? :)