#metabrainz

/

      • alastairp
        neither
      • 2018-03-15 07417, 2018

      • alastairp
        only potential issue here is that replication packets use row ids
      • 2018-03-15 07431, 2018

      • alastairp
        so we'd have to set those specifically too (not a huge problem)
      • 2018-03-15 07407, 2018

      • alastairp
        rsh7: are you following along?
      • 2018-03-15 07424, 2018

      • alastairp
        replication packets are the way that mirrors of the musicbrainz database keep up to date
      • 2018-03-15 07436, 2018

      • alastairp
        it's effectively a file that says "change this row in this table to this value"
      • 2018-03-15 07455, 2018

      • alastairp
        iliekcomputers is suggesting that if we keep the table structure exactly the same, we can just look in the replication packet file and see if the item being updated exists in our mirror, and if so, just copy it in
      • 2018-03-15 07411, 2018

      • alastairp
        this means that the musicbrainz update system takes care of the "last_updated" field for us
      • 2018-03-15 07458, 2018

      • alastairp
        ruaok: bitmap: confirm if you think this sounds like a reasonable idea?
      • 2018-03-15 07426, 2018

      • iliekcomputers
        me and rsh7 talked about this a bit, but we weren't sure how easy / hard it would be to implement
      • 2018-03-15 07449, 2018

      • alastairp
        remember: the reason that we want a copy of some musicbrainz metadata in the ab database is to be able to join against it in a single query to do filtering/stats
      • 2018-03-15 07411, 2018

      • alastairp
        because a second database would require us to do queries to two databases and join in python
      • 2018-03-15 07442, 2018

      • iliekcomputers
        like the problem of showing only those release-groups that had a review in CB ferbncode was trying to solve
      • 2018-03-15 07407, 2018

      • alastairp
        the only potential problem is that the number of tables we have to copy becomes so big that it's as unweildy to deal with as querying two databases
      • 2018-03-15 07424, 2018

      • alastairp
        s/the only/at least one/
      • 2018-03-15 07422, 2018

      • kartikeyaSh_ircc
        alastairp: can you tell me why do we need "updated" field in these tables (When will this be used?) https://github.com/metabrainz/messybrainz-server/…
      • 2018-03-15 07423, 2018

      • Freso
        alastairp, iliekcomputers: One problem with replication is that the db will always be behind, and there *will* be users that add something in MB and expect to use/see it in *B straight away.
      • 2018-03-15 07406, 2018

      • Freso
        (This has been an issue for AB previously too, and continues to be an issue with Picard/AcoustID interaction.)
      • 2018-03-15 07454, 2018

      • alastairp
        Freso: it's possible that we could special-case this: if an mbid exists in AB and is not in our metadata tables, do a ws lookup, if that returns data then show it along with an infobox
      • 2018-03-15 07421, 2018

      • bitmap
        alastairp: if the schema is a superset of mb's, you could probably use the scripts mb provides to just apply the packets? I think they might skip tables that don't exist, though if not it would be trivial to add an option allowing that
      • 2018-03-15 07423, 2018

      • sud0pl0x joined the channel
      • 2018-03-15 07449, 2018

      • samj1912
        bitmap, can I bug you with MB PR reviews? :p
      • 2018-03-15 07400, 2018

      • bitmap
        you may
      • 2018-03-15 07429, 2018

      • samj1912
        Election pages are done from my side
      • 2018-03-15 07449, 2018

      • samj1912
        Ditto with the stash move
      • 2018-03-15 07403, 2018

      • alastairp
        bitmap: right, the only other potential issue is that I don't want to store all MB rows in our local tables
      • 2018-03-15 07442, 2018

      • alastairp
        e.g., we only have 3M unique recordings in AB, no reason to store 18M recording rows
      • 2018-03-15 07407, 2018

      • bitmap
        it definitely already skips UPDATEs and DELETEs for rows that don't exist, but I suppose you want to filter out inserts too
      • 2018-03-15 07432, 2018

      • alastairp
        right
      • 2018-03-15 07408, 2018

      • alastairp
        (I have no idea what timezone you indian crowd are on)
      • 2018-03-15 07421, 2018

      • alastairp
        if it was 2am here I'd be asleep a long time ago
      • 2018-03-15 07454, 2018

      • rsh7
        it's 1:22 am 😅
      • 2018-03-15 07405, 2018

      • ferbncode[m]1
        :D
      • 2018-03-15 07417, 2018

      • bitmap
        all inserts, or do you want to apply a test to each row to determine if it should be added? (sorry I didn't catch what the exact use case was)
      • 2018-03-15 07419, 2018

      • iliekcomputers
        late sleepers
      • 2018-03-15 07421, 2018

      • alastairp
        all inserts, I think
      • 2018-03-15 07428, 2018

      • iliekcomputers
        so a good way to update the copied data would be:
      • 2018-03-15 07431, 2018

      • alastairp
        we only want to insert a row when that mbid is added to acousticbrainz
      • 2018-03-15 07450, 2018

      • alastairp
        which I think we'd do with our own script that runs against an existing MBS slave db
      • 2018-03-15 07406, 2018

      • iliekcomputers
        1. download the replication packets on intervals
      • 2018-03-15 07426, 2018

      • iliekcomputers
        2. just apply them (without much modification because it skips rows that don't exist already?)
      • 2018-03-15 07435, 2018

      • iliekcomputers
        seems simple enough?
      • 2018-03-15 07450, 2018

      • iliekcomputers
        or am i missing something?
      • 2018-03-15 07422, 2018

      • bitmap
        we could add a couple options to LoadReplicationChanges to skip tables that don't exist, and skip inserts
      • 2018-03-15 07423, 2018

      • alastairp
        rsh7: it sounds like the tools that we have available are going to let us do it this way. update your proposal to cover the results of this discussion
      • 2018-03-15 07432, 2018

      • alastairp
        the "difficult" step will be to decide which tables to replicate
      • 2018-03-15 07440, 2018

      • alastairp
        see if you can come up with a list
      • 2018-03-15 07457, 2018

      • alastairp
        iliekcomputers: do you know your way around the mb database?
      • 2018-03-15 07420, 2018

      • iliekcomputers
        alastairp: not very well
      • 2018-03-15 07429, 2018

      • alastairp
        great, you can help :)
      • 2018-03-15 07439, 2018

      • iliekcomputers
        I mean, I have a basic idea, but I wouldn't be pop quiz ready
      • 2018-03-15 07442, 2018

      • alastairp
        and on that note, given that it's 9pm, I'm going to bed
      • 2018-03-15 07449, 2018

      • alastairp
        have a good night/morning all
      • 2018-03-15 07451, 2018

      • iliekcomputers
        (i don't know what l_url_url does)
      • 2018-03-15 07456, 2018

      • ferbncode[m]1
        alastairp: there will be a slave MB db connected to AB (or we can use WS too) and we update tables in AB whenever a new recording is added (inserts case). Right?
      • 2018-03-15 07407, 2018

      • alastairp
        yes
      • 2018-03-15 07413, 2018

      • iliekcomputers
        alastairp: good night 😴
      • 2018-03-15 07415, 2018

      • ferbncode[m]1
        It's empty :D
      • 2018-03-15 07441, 2018

      • iliekcomputers
        ferbncode[m]1: the idea is to use a similar connection to the one in CB to insert data into the AB database
      • 2018-03-15 07442, 2018

      • rsh7
        alastairp: Alright! I will update it.
      • 2018-03-15 07452, 2018

      • bitmap
        it links two urls together in another universe
      • 2018-03-15 07407, 2018

      • iliekcomputers
        and then use replication packets to update the db
      • 2018-03-15 07437, 2018

      • iliekcomputers
        bitmap: 🙇🏽‍♀️
      • 2018-03-15 07406, 2018

      • Freso
        iliekcomputers: `l_url_url`: [https://www.wikidata.org/wiki/Special:EntityPage/…] is the Wikidata item for [https://www.facebook.com/]
      • 2018-03-15 07446, 2018

      • Freso
        (Except no URL-URL relationships are defined in MB: https://musicbrainz.org/relationships/url-url )
      • 2018-03-15 07408, 2018

      • sud0pl0x has quit
      • 2018-03-15 07406, 2018

      • bitmap
        adhawkins: I reverted the commit that caused LCA-7, so it can be closed now (unless you still want to fix the code to carry over the query params, just to future-proof things)
      • 2018-03-15 07406, 2018

      • BrainzBot
        LCA-7: No longer able to fetch index.json https://tickets.metabrainz.org/browse/LCA-7
      • 2018-03-15 07438, 2018

      • bitmap
        and about cors headers for index.json, they were testing changes to make that happen yesterday, so I guess they should be deploying it any day now
      • 2018-03-15 07419, 2018

      • bitmap
        once that's done we can close CAA-4
      • 2018-03-15 07419, 2018

      • BrainzBot
        CAA-4: Enable CORS (Cross-Origin Resource Sharing) https://tickets.metabrainz.org/browse/CAA-4
      • 2018-03-15 07423, 2018

      • github joined the channel
      • 2018-03-15 07423, 2018

      • github
        [listenbrainz-server] paramsingh opened pull request #381: LB-329: Add supported keys to Influx rows explicitly (master...spotify-id-bug) https://git.io/vxqJm
      • 2018-03-15 07423, 2018

      • github has left the channel
      • 2018-03-15 07420, 2018

      • iliekcomputers
        ruaok: if you get a minute or two, could you review https://github.com/metabrainz/critiquebrainz/pull…, it is a small PR fixing cron in CB, mostly taken from LB
      • 2018-03-15 07422, 2018

      • ruaok clicks approve
      • 2018-03-15 07431, 2018

      • iliekcomputers
        😂
      • 2018-03-15 07408, 2018

      • iliekcomputers
        thanks
      • 2018-03-15 07411, 2018

      • ruaok
        :)
      • 2018-03-15 07425, 2018

      • ruaok
        how flexible is your schedule next week?
      • 2018-03-15 07430, 2018

      • iliekcomputers
        depends, tbh. I'd like to be back in college for the weekend but other than that, I could make weekdays work most probably.
      • 2018-03-15 07452, 2018

      • iliekcomputers understands that it should probably be the opposite
      • 2018-03-15 07456, 2018

      • iliekcomputers
        🤷🏽
      • 2018-03-15 07404, 2018

      • ruaok
        made perfect sense to me.
      • 2018-03-15 07410, 2018

      • iliekcomputers
        lol
      • 2018-03-15 07435, 2018

      • ruaok
        I'd like to see if we can still meet up. things are still up in the air. but we certainly lost the chance to meet samj1912
      • 2018-03-15 07418, 2018

      • iliekcomputers
        when do you plan to come? I'd love to meet up still
      • 2018-03-15 07438, 2018

      • ruaok
        with some luck, I should arrive on the 21st. same flights.
      • 2018-03-15 07401, 2018

      • ruaok
        k, I think a plan will congeal tomorrow.
      • 2018-03-15 07423, 2018

      • iliekcomputers
        ruaok: 21st should be okay :)
      • 2018-03-15 07445, 2018

      • ruaok
        maybe 22nd is better so I can get some sleep. :)
      • 2018-03-15 07401, 2018

      • ruaok
        and there will be no chhavi_ to chase me to sunrise yoga.
      • 2018-03-15 07404, 2018

      • ruaok
        (phew)
      • 2018-03-15 07442, 2018

      • samj1912
        🤣
      • 2018-03-15 07407, 2018

      • iliekcomputers
        ferbncode[m]1: could probably chase you instead, he wakes up early too ;)
      • 2018-03-15 07420, 2018

      • ruaok
        gah!
      • 2018-03-15 07450, 2018

      • ruaok boooks a delhi hotel through the weekend
      • 2018-03-15 07426, 2018

      • ferbncode[m]1
        iliekcomputers: :D
      • 2018-03-15 07411, 2018

      • Leo_Verto
        "he wakes up early too"
      • 2018-03-15 07418, 2018

      • Leo_Verto
        "current time in India is 2:39 am"
      • 2018-03-15 07418, 2018

      • Leo_Verto
        do you just not sleep, ferbncode ?
      • 2018-03-15 07439, 2018

      • ruaok
        thank you Leo_Verto, take a bow. I was thinking the same.
      • 2018-03-15 07448, 2018

      • ruaok
        how about you ferbncode[m]1? meet up next week?
      • 2018-03-15 07407, 2018

      • ruaok
        (don't book anything until I've checked in!!)
      • 2018-03-15 07425, 2018

      • ferbncode[m]1
        ruaok: yes, works for me perfectly
      • 2018-03-15 07437, 2018

      • ferbncode[m]1
        :)
      • 2018-03-15 07406, 2018

      • ruaok
        k, a new plan will be hatched tomorrow.
      • 2018-03-15 07418, 2018

      • ferbncode[m]1
        Leo_Verto: I really need to fix my sleep schedule :D
      • 2018-03-15 07424, 2018

      • iliekcomputers
        🎉
      • 2018-03-15 07456, 2018

      • iliekcomputers
        hopefully the plan involves food 😄
      • 2018-03-15 07424, 2018

      • ruaok
        that's the plan, no?
      • 2018-03-15 07431, 2018

      • ruaok
        is there anything other than food?
      • 2018-03-15 07445, 2018

      • iliekcomputers
        maybe drinks ;)
      • 2018-03-15 07411, 2018

      • ferbncode[m]1
        Hehe :D
      • 2018-03-15 07440, 2018

      • ruaok
        I certainly owe you a few.
      • 2018-03-15 07450, 2018

      • ruaok
        at least three for the data dumps torture.
      • 2018-03-15 07457, 2018

      • ruaok
        I'm very happy with the result though.
      • 2018-03-15 07409, 2018

      • iliekcomputers
        me too :)
      • 2018-03-15 07422, 2018

      • ruaok
        incremental dumps are going to be a bit easier too.
      • 2018-03-15 07436, 2018

      • ruaok
        a lot of questions have been answered
      • 2018-03-15 07433, 2018

      • iliekcomputers
        next thing on the list.
      • 2018-03-15 07448, 2018

      • iliekcomputers
        I'd like to discuss the BQ table partitioning when we meet too
      • 2018-03-15 07421, 2018

      • ruaok
        great idea.
      • 2018-03-15 07430, 2018

      • ruaok
        we should find a whiteboard for that.
      • 2018-03-15 07447, 2018

      • ruaok
        in fact, we should really have a talk about LB charts.
      • 2018-03-15 07456, 2018

      • ruaok
        and really. we need chhavi for that.
      • 2018-03-15 07421, 2018

      • ruaok
        I think if we talk about what graphs we want, we get use cases for BQ.
      • 2018-03-15 07432, 2018

      • ruaok
        once we have use cases for BQ, we can talk about what the structure needs to be
      • 2018-03-15 07459, 2018

      • ruaok
        and we need her brainz on which d3 graph types would be suitable for which graphs
      • 2018-03-15 07447, 2018

      • iliekcomputers
        hmm, nice train of thought. Maybe we could do something over Hangouts, if chhavi can't be physically there
      • 2018-03-15 07403, 2018

      • iliekcomputers
        I like the idea of going from graphs to schema, nice.
      • 2018-03-15 07444, 2018

      • ruaok
        I'll offer whatever assistance I can to make it worthwhile for her to come.
      • 2018-03-15 07408, 2018

      • ruaok
        I think this discussion is key for the next 6 months of LB
      • 2018-03-15 07419, 2018

      • iliekcomputers
        indeed
      • 2018-03-15 07452, 2018

      • ruaok
        and bonus round, perhaps over drinks, how do we kick start the recommendation engine?
      • 2018-03-15 07423, 2018

      • ferbncode[m]1
        iliekcomputers, ruaok : Probably we can also discuss something regarding Recommend-brainz? I am really excited about that too.
      • 2018-03-15 07428, 2018

      • iliekcomputers
        yes please
      • 2018-03-15 07432, 2018

      • ruaok
        rent a machine with too much ram (tm) for an hour or two and load LB into reggomend?
      • 2018-03-15 07435, 2018

      • ferbncode[m]1
        Oh, :)
      • 2018-03-15 07444, 2018

      • iliekcomputers
        ferbncode[m]1: ruaok: great minds ;)
      • 2018-03-15 07444, 2018

      • ruaok
        yes, please.
      • 2018-03-15 07452, 2018

      • ruaok
        I see, mini summit shaping up.
      • 2018-03-15 07406, 2018

      • iliekcomputers
        \o/
      • 2018-03-15 07428, 2018

      • ruaok
        with uni on holidays does that mean we could sneak into an empty class somewhere and comandeer a room in the name of open source?
      • 2018-03-15 07400, 2018

      • iliekcomputers
        I have my doubts on how feasible that would be.
      • 2018-03-15 07418, 2018

      • ruaok
        bummer.
      • 2018-03-15 07439, 2018

      • ruaok
        that would be a safe bet at my old uni. just look like students and be busy.
      • 2018-03-15 07446, 2018

      • ruaok
        almost as good as a white lab coat
      • 2018-03-15 07402, 2018

      • reosarevok
        ruaok: I bet it doesn't work in Spain because the "be busy" thing is a dead giveaway? :p
      • 2018-03-15 07422, 2018

      • ruaok
        lol