#musicbrainz-devel

/

      • kuno agrees
      • 2015-02-10 04157, 2015

      • darthanubis joined the channel
      • 2015-02-10 04131, 2015

      • Mineo
        that seems unlikely at the moment - I'll probably have my last exam in a few weeks, so I could write my master's thesis over the summer
      • 2015-02-10 04129, 2015

      • ruaok
        nooooeos!
      • 2015-02-10 04138, 2015

      • ruaok
        we really need the SOLR stuff finished! :(
      • 2015-02-10 04142, 2015

      • Mineo
        https://wiki.musicbrainz.org/User:Mineo/SolrSearc… is the stuff you can currently search for on solr, which should cover everything on https://beta.musicbrainz.org/doc/Development/XML_…
      • 2015-02-10 04145, 2015

      • ruaok
        is your code deployable?
      • 2015-02-10 04140, 2015

      • Mineo
        the larger part that's currently missing is porting the analysis step to solr, but I think I've outlined that in an email last year
      • 2015-02-10 04110, 2015

      • Mineo
        it's deployable in that it produces valid xml if you search for stuff and getting data from the db into solr works fine
      • 2015-02-10 04146, 2015

      • ruaok
        depolyable, but ... == not deployable.
      • 2015-02-10 04145, 2015

      • alastairp
        ruaok: let's talk about that further
      • 2015-02-10 04157, 2015

      • ruaok
        mentoring?
      • 2015-02-10 04100, 2015

      • alastairp
        yeah
      • 2015-02-10 04104, 2015

      • ruaok
        k.
      • 2015-02-10 04110, 2015

      • alastairp
        AB, as part of MB? in collaboration with here?
      • 2015-02-10 04153, 2015

      • ruaok
        well, UPF could apply, but that would be a kinda pain, I suspect.
      • 2015-02-10 04106, 2015

      • ruaok
        I'm happy to have AB under the metabrainz umbrella.
      • 2015-02-10 04115, 2015

      • Mineo
        well, I'm not sure what deployable means - if you want to change the production search over to it right now, then no, that wouldn't be a good idea. if you want to set it up on rika for example, so people can use it on their sandboxes, that would certainly work (the only change required in MBS is https://bitbucket.org/mineo/musicbrainz-server/co…)
      • 2015-02-10 04158, 2015

      • ruaok
        the former is what I really care about.
      • 2015-02-10 04121, 2015

      • ruaok
        sounds like the SOLR project needs to have a CB styled continuation project behind it.
      • 2015-02-10 04126, 2015

      • ruaok
        part coding/ part sysadmin
      • 2015-02-10 04136, 2015

      • kepstin-laptop
        could drop a 'devops' keyowrd in there to be all hip and cool and stuff.
      • 2015-02-10 04105, 2015

      • ruaok
      • 2015-02-10 04136, 2015

      • rk29 joined the channel
      • 2015-02-10 04136, 2015

      • JesseW joined the channel
      • 2015-02-10 04105, 2015

      • ijabz2 joined the channel
      • 2015-02-10 04148, 2015

      • ijabz2 joined the channel
      • 2015-02-10 04113, 2015

      • diana_olhovik_ joined the channel
      • 2015-02-10 04157, 2015

      • LordSputnik joined the channel
      • 2015-02-10 04158, 2015

      • LordSputnik
        damn, just as I was about to message him...
      • 2015-02-10 04104, 2015

      • Freso
        LordSputnik: rob [at mb] ;)
      • 2015-02-10 04143, 2015

      • LordSputnik
        Freso: nah, I've updated the JIRA ticket I was going to talk to him about, so no need now ;)
      • 2015-02-10 04121, 2015

      • ruaok joined the channel
      • 2015-02-10 04123, 2015

      • rk29 joined the channel
      • 2015-02-10 04113, 2015

      • yeeeargh joined the channel
      • 2015-02-10 04114, 2015

      • Freso
        LordSputnik: Btw, bb.o is 502'ing.
      • 2015-02-10 04124, 2015

      • LordSputnik
        Freso: yeah I noticed a minute ago
      • 2015-02-10 04130, 2015

      • LordSputnik
        looking into it
      • 2015-02-10 04139, 2015

      • Freso
        "Sorry, that means this server's off right now — check back another time or use the main site at http://musicbrainz.org!"
      • 2015-02-10 04151, 2015

      • Freso
        :)
      • 2015-02-10 04103, 2015

      • ianmcorvidae
        yeah, not writing custom 502 pages for non-MBS sandboxes :P
      • 2015-02-10 04108, 2015

      • Freso
        ;)
      • 2015-02-10 04132, 2015

      • LordSputnik
        Freso: hmm, it seems I actually stopped the server manually and forgot to restart it last night...
      • 2015-02-10 04138, 2015

      • LordSputnik
        fixed
      • 2015-02-10 04139, 2015

      • xram joined the channel
      • 2015-02-10 04126, 2015

      • chirlu` joined the channel
      • 2015-02-10 04119, 2015

      • alastairp
        Mineo: thanks for the comments on that ticket
      • 2015-02-10 04129, 2015

      • alastairp
        annoying that it’s not “easy"
      • 2015-02-10 04141, 2015

      • alastairp
        maybe this should push us towards requests again :-P
      • 2015-02-10 04123, 2015

      • chirlu`
        ianmcorvidae: The subscriptions emails failed again, this time for an SMTP problem (“Temporary local problem - please try later”).
      • 2015-02-10 04132, 2015

      • chirlu`
        a) Was this really just temporary?
      • 2015-02-10 04144, 2015

      • ianmcorvidae
        they regularly do that
      • 2015-02-10 04154, 2015

      • chirlu`
        b) Should I change the script to continue if sending one message fails?
      • 2015-02-10 04117, 2015

      • ianmcorvidae
        if it doesn't already do that then probably :/ I'd understood it already did that
      • 2015-02-10 04150, 2015

      • ianmcorvidae
        but I've barely touched the subscriptions stuff, I think you have a better idea than I do at this point
      • 2015-02-10 04140, 2015

      • chirlu`
        No, apparently such a problem results in an exception being thrown, and the script doesn’t catch it.
      • 2015-02-10 04146, 2015

      • ianmcorvidae
        it appears it was killed
      • 2015-02-10 04157, 2015

      • ianmcorvidae
        so the memory may still be the issue
      • 2015-02-10 04107, 2015

      • ianmcorvidae
        before the memory stuff was happening it would screw up on multiple things sending emails
      • 2015-02-10 04132, 2015

      • ianmcorvidae
        multiple temporary local problem/unroutable address/etc.
      • 2015-02-10 04151, 2015

      • ianmcorvidae
        so somehow that bit isn't killing the script
      • 2015-02-10 04108, 2015

      • ianmcorvidae
        but yeah, the one from this morning it has the same /home/musicbrainz/musicbrainz-server/admin/cron/daily.sh: line 86: 14133 Killed
      • 2015-02-10 04105, 2015

      • chirlu`
        Hm, I’ll check closer what happens in case of an SMTP error.
      • 2015-02-10 04101, 2015

      • chirlu`
        Could we run it manually with “--dry-run --verbose”?
      • 2015-02-10 04137, 2015

      • ianmcorvidae
        sure
      • 2015-02-10 04144, 2015

      • ianmcorvidae
        starting that now
      • 2015-02-10 04116, 2015

      • ianmcorvidae
        it's already the thing using the most memory on the system, also. watching it to see if it drops at some point
      • 2015-02-10 04120, 2015

      • ianmcorvidae
        (right now 8%)
      • 2015-02-10 04143, 2015

      • ijabz2 joined the channel
      • 2015-02-10 04120, 2015

      • ianmcorvidae
        ah, no, apparently it finished before dying
      • 2015-02-10 04131, 2015

      • chirlu`
        As far as I understand, Perl never returns memory to the system.
      • 2015-02-10 04148, 2015

      • chirlu`
        It only reuses it internally if possible.
      • 2015-02-10 04113, 2015

      • ianmcorvidae
        right
      • 2015-02-10 04103, 2015

      • ianmcorvidae
        last night wouldn't have been the weekly, anyway
      • 2015-02-10 04116, 2015

      • ianmcorvidae
        shall I send you the log of the run? it seems to have gone through it and finished fine
      • 2015-02-10 04145, 2015

      • ruaok
        huh.
      • 2015-02-10 04100, 2015

      • ruaok
        maybe the invoking script has a resource limit set?
      • 2015-02-10 04141, 2015

      • ianmcorvidae
        hm, or it killed it
      • 2015-02-10 04148, 2015

      • chirlu`
        Or it’s the mailer (--dry-run avoids it).
      • 2015-02-10 04113, 2015

      • ianmcorvidae
        the end of dmesg currently has [8885062.052366] Killed process 27816 (perl) total-vm:1546348kB, anon-rss:1315228kB, file-rss:972kB
      • 2015-02-10 04119, 2015

      • chirlu`
        re: log, the interesting parts would probably just be the “End of batch” and “Completed” lines.
      • 2015-02-10 04132, 2015

      • chirlu`
        “End of batch” for the cache statistics.
      • 2015-02-10 04110, 2015

      • ianmcorvidae
        it didn't finish a batch.
      • 2015-02-10 04115, 2015

      • chirlu`
        1.5 GB total-vm looks rather much.
      • 2015-02-10 04124, 2015

      • ianmcorvidae
        so yeah, I think it got killed
      • 2015-02-10 04127, 2015

      • ianmcorvidae
        memory usage is really close to the limit on that system, I guess
      • 2015-02-10 04146, 2015

      • chirlu`
        Perhaps send me the log anyway, I could look how far it got within the first batch at least.
      • 2015-02-10 04111, 2015

      • ianmcorvidae
        sure. I'm poking around the system a bit to see if I can make the memory usage less nasty
      • 2015-02-10 04120, 2015

      • ianmcorvidae
        I don't think MBS should need 14GB of memory, so
      • 2015-02-10 04115, 2015

      • chirlu`
        But neither should the script need 1.5 GB just for one 1000-editor batch.
      • 2015-02-10 04123, 2015

      • ianmcorvidae
        yeah
      • 2015-02-10 04158, 2015

      • ianmcorvidae
        I think there were some extra starman workers lying around
      • 2015-02-10 04103, 2015

      • ianmcorvidae
        so I killed those, might help
      • 2015-02-10 04137, 2015

      • ianmcorvidae
        chirlu`: what email should I send to? PM if you prefer
      • 2015-02-10 04130, 2015

      • ianmcorvidae
        doing another test run as well with the system less memory-heavy, in case that provides more insight
      • 2015-02-10 04129, 2015

      • chirlu`
        I’ll wait for that, then. :) Otherwise any local part at chirlu de should work.
      • 2015-02-10 04137, 2015

      • ianmcorvidae
        k
      • 2015-02-10 04138, 2015

      • chirlu`
        I normally use ulrich.
      • 2015-02-10 04116, 2015

      • ianmcorvidae
        currently memory use is still climbing, but the system is only using ~5G of memory, not ~14 of its 16. so if it levels out at some point hopefully we'll know what that point is, this run
      • 2015-02-10 04115, 2015

      • ianmcorvidae
        still does seem high though, it's at almost 2G of VM
      • 2015-02-10 04158, 2015

      • ianmcorvidae
        heh, and it's currently on the user it died on last time, interesting :)
      • 2015-02-10 04124, 2015

      • chirlu`
        Does that user have lots of subscriptions or something?
      • 2015-02-10 04136, 2015

      • ianmcorvidae
        I don't know, it hasn't gotten far enough to actually print out the username
      • 2015-02-10 04154, 2015

      • ianmcorvidae
        it's whoever it'll get to after DosX
      • 2015-02-10 04159, 2015

      • ianmcorvidae
        there we go, it did pass it
      • 2015-02-10 04101, 2015

      • ianmcorvidae sees who it was
      • 2015-02-10 04129, 2015

      • ianmcorvidae
        maybe it was just not flushing or something. odd
      • 2015-02-10 04109, 2015

      • ianmcorvidae
        ojnkpjg with 12440 subscriptions wasn't too far after though, so that might be it
      • 2015-02-10 04102, 2015

      • chirlu`
        Yes, that sounds like much.
      • 2015-02-10 04136, 2015

      • chirlu`
        In particular when subscription mails haven’t been sent for a while, so there may have many edits piled up.
      • 2015-02-10 04154, 2015

      • ianmcorvidae
        sure
      • 2015-02-10 04118, 2015

      • ianmcorvidae
        I notice it goes to the trouble of finding subscriptions before checking if there's a confirmed email, that seems like a possibly minor optimization that could happen
      • 2015-02-10 04155, 2015

      • chirlu`
        Hm, possibly.
      • 2015-02-10 04133, 2015

      • ianmcorvidae
        however, memory usage has leveled off now, also, it seems. 18% memory, heh. 3105M of VM
      • 2015-02-10 04156, 2015

      • ianmcorvidae
        yeah, it finished a batch: End of batch: removing 53338 entities from the cache (out of 54214)
      • 2015-02-10 04130, 2015

      • ianmcorvidae
        I get the impression we may want to make batches an order of magnitude smaller, perhaps
      • 2015-02-10 04144, 2015

      • chirlu`
        That sounds good. So it still caches around 1000 entities that were used multiple times.
      • 2015-02-10 04153, 2015

      • chirlu`
        Yes, perhaps 1000 is too much.
      • 2015-02-10 04118, 2015

      • ianmcorvidae
        will be interesting to see how it does with subsequent batches, obviously I won't change it until our test run is done, but :)
      • 2015-02-10 04119, 2015

      • chirlu`
        Though reducing the size will also reduce the benefit of the cache, because only very popular entities will appear twice in a 100-editor batch (say).
      • 2015-02-10 04155, 2015

      • ianmcorvidae
        true
      • 2015-02-10 04127, 2015

      • chirlu`
        Anyway, the second batch should reuse the memory from the 53338 entities that were removed, so memory usage should stay mostly constant now.
      • 2015-02-10 04139, 2015

      • ianmcorvidae
        yeah
      • 2015-02-10 04157, 2015

      • chirlu`
        Unless there is another editor with zillions of subscriptions.
      • 2015-02-10 04104, 2015

      • ianmcorvidae
        reosarevok :P
      • 2015-02-10 04107, 2015

      • ianmcorvidae
        and probably nikki
      • 2015-02-10 04129, 2015

      • ianmcorvidae
        I doubt ojnkpjg is the worst as far as that
      • 2015-02-10 04158, 2015

      • chirlu`
        Hm, perhaps some per-editor limit of how much to put into the cache?
      • 2015-02-10 04148, 2015

      • chirlu`
        “Cache only the 100 most relevant subscriptions”, for some definition of relevant.
      • 2015-02-10 04108, 2015

      • ianmcorvidae
        hm, maybe
      • 2015-02-10 04115, 2015

      • ianmcorvidae
        btw, for artists at least ojnkpjg is fifth
      • 2015-02-10 04145, 2015

      • ianmcorvidae
        drsaunde has 53751 subscriptions looking only at artists :)
      • 2015-02-10 04106, 2015

      • chirlu`
        I had also thought about asking the database beforehand about the most common subscriptions, and then only cache those.
      • 2015-02-10 04117, 2015

      • ianmcorvidae
        drsaunde is the one who'll break this nice and hard for us, I think, if he does :) 53751 artists and 5283 labels
      • 2015-02-10 04133, 2015

      • chirlu`
        I.e. “give me all entities that have at least 3 subscribers” or something.
      • 2015-02-10 04149, 2015

      • ianmcorvidae
        yeah, could work
      • 2015-02-10 04134, 2015

      • ianmcorvidae
        second batch removed 24795 of 26225 entities
      • 2015-02-10 04139, 2015

      • ianmcorvidae
        so about the same amount of growth I guess
      • 2015-02-10 04106, 2015

      • ianmcorvidae
        (in the cache, I mean. I doubt it even got close to filling that 3G of memory with half as many entities)
      • 2015-02-10 04155, 2015

      • diana_olhovik joined the channel
      • 2015-02-10 04143, 2015

      • ariscop joined the channel
      • 2015-02-10 04157, 2015

      • ianmcorvidae
        yeah, memory usage is climbing again :(
      • 2015-02-10 04108, 2015

      • ianmcorvidae
        up to 27.1% with 4588M (virt)
      • 2015-02-10 04129, 2015

      • ianmcorvidae
        (4435 resident as of when I started typing this line :P)
      • 2015-02-10 04153, 2015

      • chirlu`
        Is it at reosarevok now? :)
      • 2015-02-10 04113, 2015

      • ianmcorvidae
        dunno :)
      • 2015-02-10 04148, 2015

      • chirlu`
        Would be editor #326637.
      • 2015-02-10 04152, 2015

      • Freso
        Just make drsaunde his own batch.
      • 2015-02-10 04105, 2015

      • Freso
        >_>
      • 2015-02-10 04110, 2015

      • ianmcorvidae
        Freso: what I'm wondering is if that wouldn't still get killed by itself :P