#metabrainz

/

      • ruaok
        let me try and give an example.
      • 2017-12-22 35648, 2017

      • ruaok
        you familiar with ram drives and caching in the linux kernel?
      • 2017-12-22 35600, 2017

      • ruaok
        just superficially understanding the concepts is all you need to know.
      • 2017-12-22 35612, 2017

      • samj1912
        yup
      • 2017-12-22 35630, 2017

      • ruaok
        often times there would be things I want to go fast.
      • 2017-12-22 35643, 2017

      • ruaok
        like, load an index and rotate it -- perfect example.
      • 2017-12-22 35658, 2017

      • ruaok
        I know, I'll get a ram disk to do that. that guarantees that everything stays in ram in will be fast.
      • 2017-12-22 35609, 2017

      • ruaok
        I tried it and it never was fast.
      • 2017-12-22 35616, 2017

      • ruaok
        because I could not see the whole picture.
      • 2017-12-22 35639, 2017

      • ruaok
        instead, letting linux do the caching automatically by literally doing nothing special, I always got the best performance.
      • 2017-12-22 35652, 2017

      • ruaok
        because linux knows what it needs and can see the whole picture, instantaneously.
      • 2017-12-22 35648, 2017

      • samj1912 nods
      • 2017-12-22 35649, 2017

      • ruaok
        the point I am getting to is this: not sure you need to have a "large indexer backup". I think a secondary or tertiary indexer is all you need.
      • 2017-12-22 35659, 2017

      • samj1912
        yeah I guess
      • 2017-12-22 35619, 2017

      • samj1912
        talking to you I realized, we could have 2 instances, 1 with an index limit and one without
      • 2017-12-22 35629, 2017

      • ruaok
        and even then there is a fine line -- this is about the old "premature optimization" problem.
      • 2017-12-22 35640, 2017

      • samj1912
        yes
      • 2017-12-22 35658, 2017

      • ruaok
        and it is tricky. clearly we want to avoid having indexing break down because of bottlenecks.
      • 2017-12-22 35608, 2017

      • samj1912
        let me make an artificial call for indexing VA?
      • 2017-12-22 35614, 2017

      • ruaok
        so we want a certain amount of thinking of optimization, but not too much.
      • 2017-12-22 35628, 2017

      • ruaok
        indexing VA == reindex
      • 2017-12-22 35643, 2017

      • ruaok
        should things like that ever be needed, just reindex.
      • 2017-12-22 35609, 2017

      • ruaok
        "1 with an index limit and one without"
      • 2017-12-22 35613, 2017

      • ruaok
        not sure even that is needed.
      • 2017-12-22 35626, 2017

      • ruaok
        ensure that having more than 1 indexer works.
      • 2017-12-22 35631, 2017

      • ruaok
        and leave it at that.
      • 2017-12-22 35648, 2017

      • ruaok
        when real usecases come, we can throw more indexers at the problem.
      • 2017-12-22 35607, 2017

      • ruaok
        and if those are a problem, we've identified an actual problematic use case that we should optimize for.
      • 2017-12-22 35617, 2017

      • ruaok
        but that day may never come, and that is fine too.
      • 2017-12-22 35621, 2017

      • ruaok
        so, come to a close on what you're doing with the indexer, fixing bugs, but don't plan too many more improvements.
      • 2017-12-22 35630, 2017

      • ruaok
        keep watching the logs and then soon let's test this on beta.
      • 2017-12-22 35645, 2017

      • ruaok
        and perhaps reward yourself by playing with picard for a little while. :)
      • 2017-12-22 35620, 2017

      • samj1912
        Hehe okay :p
      • 2017-12-22 35634, 2017

      • samj1912
        I was thinking of taking a crack at MB instead :p
      • 2017-12-22 35616, 2017

      • padraic joined the channel
      • 2017-12-22 35644, 2017

      • Ashish_ joined the channel
      • 2017-12-22 35619, 2017

      • naiveai joined the channel
      • 2017-12-22 35657, 2017

      • naiveai has quit
      • 2017-12-22 35601, 2017

      • drsaunders has quit
      • 2017-12-22 35605, 2017

      • MusicbrainzB0T joined the channel
      • 2017-12-22 35641, 2017

      • MusicbrainzB0T1 has quit
      • 2017-12-22 35619, 2017

      • Ashish_ has quit
      • 2017-12-22 35635, 2017

      • culinko has left the channel
      • 2017-12-22 35625, 2017

      • zas
        ruaok: current search-server on prince is eating a lot of disk space, i had a look; and it appears there are plenty of /tmp/jetty/jetty-0.0.0.0-8080-ROOT.war-_-any-<somenumber>.dir (12455 atm), each of 30M or more. I'll prolly just rebuild a fresh container, but any idea why ?
      • 2017-12-22 35623, 2017

      • zas
        those are leftover on each index reloading...
      • 2017-12-22 35644, 2017

      • ruaok
        when the container restarts, I think jetty is rebuilding shit and not cleaning up.
      • 2017-12-22 35605, 2017

      • ruaok
        given how close samj1912 is, can we just mitigate it and hope for a migration soon?
      • 2017-12-22 35618, 2017

      • zas
        yes, i just clean leftovers
      • 2017-12-22 35626, 2017

      • ruaok
        great, thanks.
      • 2017-12-22 35607, 2017

      • zas
        and those containers logs are big too, truncating as well
      • 2017-12-22 35640, 2017

      • samj1912
        ruaok: can you take a look at https://github.com/metabrainz/sir/pull/67, although I have been running this PR since last night, there might be some things I might've overlooked
      • 2017-12-22 35614, 2017

      • ruaok
        let me find how I am going to be blocked on deploying LB today, before I do sir stuff.
      • 2017-12-22 35625, 2017

      • samj1912
        okay
      • 2017-12-22 35657, 2017

      • ruaok
        I've been doing 10 minutes of work on LB, get stuck, then submit PR, wait, have attention drawn away, repeat.
      • 2017-12-22 35658, 2017

      • zas
        reclaimed 70% of disk space on prince & boingo ...
      • 2017-12-22 35605, 2017

      • ruaok
        zas: great.
      • 2017-12-22 35649, 2017

      • samj1912
        zas: any idea why https://stats.metabrainz.org/dashboard/db/log-fil… doesn't work anymore?
      • 2017-12-22 35650, 2017

      • zas
        samj1912: please take care of leftovers for new sir/solr stuff ;)
      • 2017-12-22 35603, 2017

      • samj1912
        I cleaned up the logs
      • 2017-12-22 35607, 2017

      • samj1912
        and reduced the verbosity
      • 2017-12-22 35625, 2017

      • Ashish_ joined the channel
      • 2017-12-22 35627, 2017

      • zas
        i do, but i didn't fix it yet, something has changed in telegraf stuff
      • 2017-12-22 35611, 2017

      • ruaok
        ok, all websites are about to go down!
      • 2017-12-22 35618, 2017

      • samj1912
        lol :P
      • 2017-12-22 35620, 2017

      • ruaok
        ok, hopefully not.
      • 2017-12-22 35636, 2017

      • iliekcomputers
        😟
      • 2017-12-22 35659, 2017

      • samj1912
        zas: what's the current index rotation frequency for the search server?
      • 2017-12-22 35638, 2017

      • ruaok
        all indexes but recording as fast as possible.
      • 2017-12-22 35642, 2017

      • zas
        every 40 mins
      • 2017-12-22 35643, 2017

      • ruaok
        recording once every 24 hours.
      • 2017-12-22 35607, 2017

      • zas
        but for recordings, 2 hours
      • 2017-12-22 35635, 2017

      • zas
      • 2017-12-22 35607, 2017

      • samj1912
        zas: is it possible to set something like this for sir as well? https://stats.metabrainz.org/dashboard/db/alerts-…
      • 2017-12-22 35616, 2017

      • samj1912
        ill add a ticket?
      • 2017-12-22 35611, 2017

      • zas
        it uses rabbitmq right ?
      • 2017-12-22 35617, 2017

      • samj1912
        yup
      • 2017-12-22 35623, 2017

      • samj1912
        on the same server as CAA
      • 2017-12-22 35629, 2017

      • samj1912
        same container infact
      • 2017-12-22 35609, 2017

      • zas
        which queues ?
      • 2017-12-22 35637, 2017

      • samj1912
        search.index, search.delete, search.failed, search.retry
      • 2017-12-22 35656, 2017

      • zas
      • 2017-12-22 35625, 2017

      • samj1912
        thanks o/
      • 2017-12-22 35657, 2017

      • zas
        alerts caa is for alerts mainly, and uses partly metrics from rabbitmq
      • 2017-12-22 35612, 2017

      • zas
        rabbitmq stats are more generic
      • 2017-12-22 35633, 2017

      • zas
        if you want specific stats on sir behavior, just ask me
      • 2017-12-22 35654, 2017

      • samj1912
        cool
      • 2017-12-22 35602, 2017

      • ruaok
      • 2017-12-22 35620, 2017

      • ruaok
        schema needs updating, and the other beta containers are not running yet.
      • 2017-12-22 35624, 2017

      • ruaok
        but still, progress!
      • 2017-12-22 35601, 2017

      • iliekcomputers
        🎉 🎉
      • 2017-12-22 35608, 2017

      • iliekcomputers
        slow and steady :)
      • 2017-12-22 35652, 2017

      • samj1912
      • 2017-12-22 35604, 2017

      • skGCI joined the channel
      • 2017-12-22 35604, 2017

      • samj1912
        is this supposed to happen
      • 2017-12-22 35623, 2017

      • ruaok
        expected, not supposed. :)
      • 2017-12-22 35645, 2017

      • iliekcomputers
        samj1912: schema changes need to be made still
      • 2017-12-22 35607, 2017

      • samj1912
        ah okay :)
      • 2017-12-22 35630, 2017

      • ruaok
        iliekcomputers: remind me, which schema changes do we need to apply?
      • 2017-12-22 35634, 2017

      • ruaok
        07-03 onward?
      • 2017-12-22 35618, 2017

      • alastairp
        for MeB, AB, LB, is it worth working out a way to do nice schema upgrades?
      • 2017-12-22 35621, 2017

      • KassOtsimine has quit
      • 2017-12-22 35630, 2017

      • alastairp
        I mean, knowing what things have been applied, and what needs to be applied to make a release
      • 2017-12-22 35635, 2017

      • iliekcomputers
        ruaok: let me check
      • 2017-12-22 35644, 2017

      • KassOtsimine joined the channel
      • 2017-12-22 35644, 2017

      • KassOtsimine has quit
      • 2017-12-22 35644, 2017

      • KassOtsimine joined the channel
      • 2017-12-22 35607, 2017

      • alastairp
        [in a way that is consistent over all of our flask apps]
      • 2017-12-22 35620, 2017

      • ruaok
        alastairp: we've got the beginning of it in LB, with the admin/sql/updates dir
      • 2017-12-22 35651, 2017

      • iliekcomputers
        ruaok: 2017-07-03 onward seems correct to me
      • 2017-12-22 35656, 2017

      • alastairp
        sure, we have that in AB too
      • 2017-12-22 35613, 2017

      • alastairp
        my point is that you're all having a chat about which migration you think should be applied first
      • 2017-12-22 35620, 2017

      • alastairp
        it'd be nice to have a record of that somewhere
      • 2017-12-22 35627, 2017

      • alastairp
        I've run into the same issues in AB
      • 2017-12-22 35637, 2017

      • ruaok
        yes and there isn't a comprehensive script that does it. I need to run them by hand.
      • 2017-12-22 35640, 2017

      • alastairp
        especially if there are 2 independent PRs with overlapping dated migrations
      • 2017-12-22 35659, 2017

      • alastairp
        anyway, might be something to consider
      • 2017-12-22 35637, 2017

      • iliekcomputers
        there is a db.SCHEMA_VERSION in the code, maybe we keep a table (schema_version, date_updated) to keep track ?
      • 2017-12-22 35600, 2017

      • ruaok
        iliekcomputers: that would be good.
      • 2017-12-22 35600, 2017

      • iliekcomputers
        and a script which checks the schema version and applies all the updates that haven't been applied yet
      • 2017-12-22 35604, 2017

      • alastairp
        yeah, that might be a good start
      • 2017-12-22 35611, 2017

      • alastairp
        I made a start on almost exactly that for AB
      • 2017-12-22 35613, 2017

      • ruaok
        alastairp: yes, that is exactly the case which is causing me to ask. :)
      • 2017-12-22 35615, 2017

      • alastairp
        but I think I made it too complex
      • 2017-12-22 35632, 2017

      • ruaok has a backup of the PG db now
      • 2017-12-22 35646, 2017

      • alastairp
      • 2017-12-22 35648, 2017

      • alastairp
        I like this
      • 2017-12-22 35612, 2017

      • ruaok
        ok, yes. indeed. nice. :)
      • 2017-12-22 35622, 2017

      • alastairp
        we had a situation in AB where migration required 1 sql file, 1 py file, 1 sql file for a single migration
      • 2017-12-22 35620, 2017

      • alastairp
        also, not sure of the case in LB, but in AB we have public and private tables, so theoretically we could have 2 schema numbers?? I had a bit of a chat with bitmap about how MB does that, but I didn't come to a good solution
      • 2017-12-22 35631, 2017

      • ruaok
        iliekcomputers: schema changes applied fine.
      • 2017-12-22 35605, 2017

      • ruaok
        2 schema numbers sounds nothing but trouble.
      • 2017-12-22 35601, 2017

      • ruaok
        iliekcomputers: I should be able to trigger a dump of all the data now, right?
      • 2017-12-22 35605, 2017

      • ruaok
        let me do that before I move on.
      • 2017-12-22 35634, 2017

      • iliekcomputers
        ruaok: `python manage.py dump create` would work
      • 2017-12-22 35639, 2017

      • iliekcomputers
        ideally
      • 2017-12-22 35649, 2017

      • alastairp
        ruaok: yeah, 2 does
      • 2017-12-22 35610, 2017

      • ruaok
        iliekcomputers: do we have a means for copying the dumps out of the containers yet? :)
      • 2017-12-22 35621, 2017

      • iliekcomputers
        ruaok: no, not yet
      • 2017-12-22 35636, 2017

      • ruaok
        that should be a next PR then.
      • 2017-12-22 35647, 2017

      • alastairp
        isn't that "just" a docker volume?
      • 2017-12-22 35607, 2017

      • ruaok
        could be, but it isn't setup yet.
      • 2017-12-22 35610, 2017

      • alastairp
        sure
      • 2017-12-22 35617, 2017

      • alastairp
        I put just in "" for a reason :)
      • 2017-12-22 35638, 2017

      • naiveai joined the channel
      • 2017-12-22 35651, 2017

      • ruaok
        ok, the dumper is trying to connect to the wrong DB.
      • 2017-12-22 35609, 2017

      • ruaok
        I suspect that we duplicated the config file reading and it is reading the wrong config files.
      • 2017-12-22 35624, 2017

      • ruaok
        let me commit the patch I have now, then we can consolidate how we connect.
      • 2017-12-22 35603, 2017

      • iliekcomputers
        ruaok: if it's importing the configuration files directly (which it probably is) then that would happen, yes.
      • 2017-12-22 35623, 2017

      • D4RK-PH0ENiX has quit
      • 2017-12-22 35645, 2017

      • iliekcomputers
        Me and alastairp have worked out a good way to manage the configurations using the flask app