#metabrainz

/

      • abhishekpanwar
        Or simply just remove all the relationships from all the relaionship_sets where the deleted work is the target?
      • 2020-01-28 02857, 2020

      • ruaok
        iliekcomputers: my schedule is getting a bit mixed up for tonight. I'm free now, but 19h is more complicated. can we talk now?
      • 2020-01-28 02834, 2020

      • iliekcomputers
        Sure, I'm walking home, we can do it now
      • 2020-01-28 02851, 2020

      • iliekcomputers
        So
      • 2020-01-28 02854, 2020

      • ruaok
        shit, I didn't get a chance yet to read your PR. :(
      • 2020-01-28 02801, 2020

      • iliekcomputers
        Np
      • 2020-01-28 02827, 2020

      • iliekcomputers
        So there was a problem with us sending huge rmq messages
      • 2020-01-28 02846, 2020

      • iliekcomputers
        Which I fixed and deployed on the cluster
      • 2020-01-28 02854, 2020

      • sarthak_jain joined the channel
      • 2020-01-28 02803, 2020

      • sarthak_jain
        Hi pristine__
      • 2020-01-28 02804, 2020

      • iliekcomputers
        I also added sentry and more exception handling.
      • 2020-01-28 02835, 2020

      • iliekcomputers
        At this point, I'm happy enough with the code to calculate and ship around stats
      • 2020-01-28 02845, 2020

      • ruaok
        excellent.
      • 2020-01-28 02858, 2020

      • iliekcomputers
        However the cluster doesn't get updated with new data regularly
      • 2020-01-28 02859, 2020

      • ruaok
        that means we're a step closer to a release, yes?
      • 2020-01-28 02808, 2020

      • iliekcomputers
        Yes
      • 2020-01-28 02821, 2020

      • sarthak_jain
        Since, I did not have an actual dataframe to check with, I created my own dataframe using the schema provided, and used the pyspark sql module queries to check if I am getting the desired results, which I am able to get.
      • 2020-01-28 02829, 2020

      • ruaok
        fortunately pristine__ has a PR that loads new data.
      • 2020-01-28 02835, 2020

      • ruaok
        not incrementals, but still.
      • 2020-01-28 02844, 2020

      • sarthak_jain
        But, when I run `spark_test.sh', I get errors.
      • 2020-01-28 02847, 2020

      • iliekcomputers
        Right
      • 2020-01-28 02856, 2020

      • iliekcomputers
        Here's what I got for now
      • 2020-01-28 02857, 2020

      • sarthak_jain
        Should I shoot a PR, so that you could check ?
      • 2020-01-28 02803, 2020

      • iliekcomputers
        We make dumps twice a month.
      • 2020-01-28 02816, 2020

      • iliekcomputers
        We should reload the data into the cluster twice a month too
      • 2020-01-28 02827, 2020

      • iliekcomputers
        From the full dumps
      • 2020-01-28 02828, 2020

      • ruaok nods
      • 2020-01-28 02839, 2020

      • ruaok
        ideally load before running stats. :)
      • 2020-01-28 02811, 2020

      • iliekcomputers
        That way we can release stats while I look into what the best way to handle deleted users in the incremental dumps is.
      • 2020-01-28 02836, 2020

      • ruaok
        O_O
      • 2020-01-28 02852, 2020

      • ruaok
        I hadn't considered that wrinkle. but yes, that plan is good.
      • 2020-01-28 02826, 2020

      • iliekcomputers
        I realized the consistency problems in incremental dumps was probably due to someone deleting their account.
      • 2020-01-28 02829, 2020

      • iliekcomputers
        Ok
      • 2020-01-28 02839, 2020

      • iliekcomputers
        So action items for now are
      • 2020-01-28 02856, 2020

      • iliekcomputers
        Add cron jobs to the cluster which import new dumps
      • 2020-01-28 02858, 2020

      • ruaok
        I guess we need to add a "user deleted" message into the incrementals...
      • 2020-01-28 02813, 2020

      • iliekcomputers
        Add cron job to lemmy which requests stats
      • 2020-01-28 02856, 2020

      • abhishekpanwar has quit
      • 2020-01-28 02800, 2020

      • iliekcomputers
        I'm not sure if the cluster needs to be "productionized" any more than just creating a new account from which to run prod services
      • 2020-01-28 02829, 2020

      • ruaok
        likely, but...
      • 2020-01-28 02835, 2020

      • iliekcomputers
        Once the cron jobs are added, we let it run for a month in beta and then do a release.
      • 2020-01-28 02803, 2020

      • ruaok
        a proper production cluster should run on a VLAN with a different firewall rules and the like.
      • 2020-01-28 02821, 2020

      • iliekcomputers
        Ah
      • 2020-01-28 02829, 2020

      • ruaok
        but, our cluster only really contains public data, so if that gets hacked, its not the end of the world.
      • 2020-01-28 02842, 2020

      • ruaok
        at this point I would suggest that we add cron jobs and move on.
      • 2020-01-28 02801, 2020

      • iliekcomputers
        Sure, makes life easier.
      • 2020-01-28 02807, 2020

      • ruaok
        I've asked zas to take a look at how to restrict the docker daemon to run on only one specific nic.
      • 2020-01-28 02833, 2020

      • ruaok
        that would allow me to easily setup a new cluster with more consistent hardware, rather than the hodgepodge I cobbled together a few weeks ago.
      • 2020-01-28 02854, 2020

      • iliekcomputers
        Other than that, I'm gonna look into observability a bit. If we could get notified somewhere that cron jobs are firing and working well, that'd be pretty nice.
      • 2020-01-28 02833, 2020

      • ruaok
        can we use sentry for that?
      • 2020-01-28 02841, 2020

      • ruaok
        I suppose that is an inverted use case.
      • 2020-01-28 02823, 2020

      • iliekcomputers
        I'll probably just add some code to send us an email
      • 2020-01-28 02828, 2020

      • ruaok
        but, I agree. we should be keeping a closer eye on things for the first few months.
      • 2020-01-28 02834, 2020

      • iliekcomputers
        Haven't thought about it much.
      • 2020-01-28 02844, 2020

      • ruaok
        ok, that is what we did with MB for... a decade?
      • 2020-01-28 02801, 2020

      • iliekcomputers
        Haha. Then, it works? :D
      • 2020-01-28 02813, 2020

      • ruaok
        if the humans read the email, yes.
      • 2020-01-28 02831, 2020

      • iliekcomputers
        Sure. I'll read em.
      • 2020-01-28 02838, 2020

      • iliekcomputers sending mails to himself
      • 2020-01-28 02804, 2020

      • ruaok
        copy me. if we make it clear in the subject line if the thing failed, then it is much more likely get to get proper attention.
      • 2020-01-28 02804, 2020

      • iliekcomputers
        So that's the basic gist of the progress and the plan forward.
      • 2020-01-28 02859, 2020

      • ruaok
        sounds good. ping me if you need help with the setup of the cron jobs.
      • 2020-01-28 02807, 2020

      • ruaok
        I am not sure if the cluster is setup to email right now.
      • 2020-01-28 02835, 2020

      • ruaok
        zas: how difficult is it to get one machine to be able to send mail as meb?
      • 2020-01-28 02844, 2020

      • iliekcomputers
        Mhmm, will probably need help with that stuff.
      • 2020-01-28 02811, 2020

      • ruaok
        let me work with zas on getting email setup on the cluster.
      • 2020-01-28 02833, 2020

      • zas
        ruaok: you need to add IP to gmail to allow relay, and ofc set up a sender
      • 2020-01-28 02853, 2020

      • ruaok
        ok, doesn't sound too bad.
      • 2020-01-28 02808, 2020

      • ruaok
        I've got it on my list and when I work on it later this week, I'll ping ya, ok?
      • 2020-01-28 02809, 2020

      • iliekcomputers
        ruaok: can you update trello with the tasks for cron and email
      • 2020-01-28 02814, 2020

      • ruaok
        k
      • 2020-01-28 02855, 2020

      • iliekcomputers
        Thanks!
      • 2020-01-28 02801, 2020

      • ruaok
        > Look into why number of listens in incremental dumps do not match up.
      • 2020-01-28 02803, 2020

      • ruaok
        that is done now?
      • 2020-01-28 02812, 2020

      • iliekcomputers
        I think so, yes.
      • 2020-01-28 02847, 2020

      • iliekcomputers
        Should be replaced with "allow for deletion of users/listens in incremental dumps"
      • 2020-01-28 02852, 2020

      • c1e0_ has quit
      • 2020-01-28 02815, 2020

      • zas
        ruaok: about docker network thing, I planned to have a look tomorrow, though I'm not sure about your exact goals, I'll ping you to clarify
      • 2020-01-28 02846, 2020

      • zas
        I have to run off, gf 's birthday cannot be skipped
      • 2020-01-28 02830, 2020

      • ruaok
        zas:perfect, do that.
      • 2020-01-28 02853, 2020

      • ruaok
        iliekcomputers: done.
      • 2020-01-28 02801, 2020

      • iliekcomputers
        Thanks!
      • 2020-01-28 02817, 2020

      • prabal has quit
      • 2020-01-28 02811, 2020

      • reosarevok
        yvanzo: around again, will check your comments :)
      • 2020-01-28 02830, 2020

      • Freso
        outsidecontext: MusicBottle used Weblate.
      • 2020-01-28 02827, 2020

      • Mr_Monkey is always Webontime
      • 2020-01-28 02811, 2020

      • reosarevok is neither weblate nor webearly, it's just webwhenhemeanstoarrive
      • 2020-01-28 02830, 2020

      • ruaok
        so, webspanish?
      • 2020-01-28 02823, 2020

      • yvanzo
        reosarevok: btw, you should be able to update beta now.
      • 2020-01-28 02817, 2020

      • reosarevok
        yvanzo: as in, we should do a beta release now?
      • 2020-01-28 02833, 2020

      • CatQuest
        zas: re transaltinos. right now i cant really get transifex to even *work* so a move t oa differnt platform would *add* my ability to translate (not that norwegian translation is super useful ,but)
      • 2020-01-28 02835, 2020

      • sarthak_jain has quit
      • 2020-01-28 02826, 2020

      • yvanzo
        reosarevok: iirc, we should update beta as soon as possible, except when it’s frozen.
      • 2020-01-28 02800, 2020

      • reosarevok
        Ok
      • 2020-01-28 02807, 2020

      • reosarevok
        Is there anything that can be merged already, I wonder
      • 2020-01-28 02834, 2020

      • reosarevok
        yvanzo: ideally you'd check https://github.com/metabrainz/musicbrainz-server/… but it's big
      • 2020-01-28 02843, 2020

      • reosarevok
        I'd really love to have it on beta for as long as possible though
      • 2020-01-28 02847, 2020

      • reosarevok
        (not mine!)
      • 2020-01-28 02854, 2020

      • reosarevok
        It's on test at the moment FWIW
      • 2020-01-28 02855, 2020

      • BrainzGit
        [bookbrainz-site] MonkeyDo merged pull request #345 (master…fix-BB-348/340): Fix : BB #348/340 https://github.com/bookbrainz/bookbrainz-site/pul…
      • 2020-01-28 02856, 2020

      • BrainzBot
        BB-348: Revision diffs don't handle multiple entities well https://tickets.metabrainz.org/browse/BB-348
      • 2020-01-28 02829, 2020

      • CatQuest
        transifex doesnt evne work in chrome (and it's the latest there cna be on this computer)
      • 2020-01-28 02848, 2020

      • CatQuest
        you really, really really shouldnt need a bleeding edge super computer for translating strings :/
      • 2020-01-28 02857, 2020

      • yvanzo
        reosarevok: I mean there are many changes in master already, but we missed updating beta since last prod release.
      • 2020-01-28 02852, 2020

      • travis-ci joined the channel
      • 2020-01-28 02852, 2020

      • travis-ci
        Project bookbrainz-site build #2570: passed in 2 min 22 sec: https://travis-ci.org/bookbrainz/bookbrainz-site/…
      • 2020-01-28 02852, 2020

      • travis-ci has left the channel
      • 2020-01-28 02805, 2020

      • BrainzGit
        [bookbrainz-site] MonkeyDo merged pull request #343 (master…fix-BB-396): Fix- #BB-396 : drop down not working for error page https://github.com/bookbrainz/bookbrainz-site/pul…
      • 2020-01-28 02806, 2020

      • BrainzBot
        BB-396: Dropdown do not work in error page https://tickets.metabrainz.org/browse/BB-396
      • 2020-01-28 02851, 2020

      • Freso
        CatQuest: How far can you get? It is possible to download strings to translate from Transifex too, work on them locally, and upload them to Transifex. That might be an option if you want to translate but can’t use Transifex’s translation UI
      • 2020-01-28 02852, 2020

      • yvanzo
        started to review that one already but it's big indeed!
      • 2020-01-28 02829, 2020

      • Freso
        bitmap: Ping? Did you see my question regarding your last message to me?
      • 2020-01-28 02859, 2020

      • BrainzGit
        [bookbrainz-site] MonkeyDo merged pull request #329 (master…delete-page-improvements): fix(deletion): Delete page improvements https://github.com/bookbrainz/bookbrainz-site/pul…
      • 2020-01-28 02832, 2020

      • CatQuest
        Freso: hmm....
      • 2020-01-28 02804, 2020

      • travis-ci joined the channel
      • 2020-01-28 02804, 2020

      • travis-ci
        Project bookbrainz-site build #2571: passed in 2 min 24 sec: https://travis-ci.org/bookbrainz/bookbrainz-site/…
      • 2020-01-28 02804, 2020

      • travis-ci has left the channel
      • 2020-01-28 02823, 2020

      • reosarevok
        yvanzo: I'm making a few changes to my PRs as requested, and after that I can release beta
      • 2020-01-28 02851, 2020

      • travis-ci joined the channel
      • 2020-01-28 02851, 2020

      • travis-ci
        Project bookbrainz-site build #2572: passed in 2 min 10 sec: https://travis-ci.org/bookbrainz/bookbrainz-site/…
      • 2020-01-28 02851, 2020

      • travis-ci has left the channel
      • 2020-01-28 02850, 2020

      • reosarevok
        yvanzo: how does this sound?
      • 2020-01-28 02851, 2020

      • reosarevok
      • 2020-01-28 02835, 2020

      • reosarevok
      • 2020-01-28 02840, 2020

      • reosarevok
        Better?
      • 2020-01-28 02849, 2020

      • bitmap
        Freso: You have to run that on the database before pg_upgrade (so probably need to start it back up with the old version of pg)
      • 2020-01-28 02809, 2020

      • yvanzo
        reosarevok: IIUYC, it prevents data loss but it doesn’t solve a less-severe related issue about pregap?
      • 2020-01-28 02819, 2020

      • reosarevok
        What's the less severe issue?
      • 2020-01-28 02850, 2020

      • reosarevok
        I mean, I don't think being able to move from track 0 to track 1 when there's no discID is an issue
      • 2020-01-28 02805, 2020

      • reosarevok
        It's just the user can do something wrong
      • 2020-01-28 02824, 2020

      • reosarevok
        The user can also add a data track as a normal track and that will also block adding a discID, but that's because they're doing it wrong :)
      • 2020-01-28 02833, 2020

      • yvanzo
        Ok, and +1 for the comments.
      • 2020-01-28 02833, 2020

      • reosarevok
        "If possible, help message should give legitimate use cases for these risky options."
      • 2020-01-28 02854, 2020

      • reosarevok
        You mean "you should only uncheck the pregap checkbox if the disc doesn't have a pregap track"?
      • 2020-01-28 02833, 2020

      • yvanzo
        Maybe it would be possible to make it easier to attach a DiscID to tracklist with erroneous pregap/data track. (about users doing it wrong) (but that is a separate issue)
      • 2020-01-28 02845, 2020

      • reosarevok
        Re: https://github.com/metabrainz/musicbrainz-server/… - that's actually an interesting usecase that we should maybe support too, wanna add a ticket?
      • 2020-01-28 02805, 2020

      • reosarevok
        But I think this is good enough for now - I'll change the error though.
      • 2020-01-28 02840, 2020

      • yvanzo
        reosarevok: yes, probably something like that. I’m not completely familiar with pregap tracks, probably never edited one myself.
      • 2020-01-28 02850, 2020

      • reosarevok
        I don't think I've ever seen one either tbh
      • 2020-01-28 02840, 2020

      • yvanzo
        Another example, you just opened a PR to add genre MBIDs which won't be found either.
      • 2020-01-28 02800, 2020

      • reosarevok
        Then I should amend that PR :)
      • 2020-01-28 02817, 2020

      • reosarevok
      • 2020-01-28 02828, 2020

      • yvanzo
        (But it becomes difficult to follow three topics at the same time here...)
      • 2020-01-28 02805, 2020

      • CatQuest
        genre's have mbids? :O
      • 2020-01-28 02813, 2020

      • reosarevok
        `No MusicBrainz entities match the MBID {mbid}. Either it’s incorrect, it was for an entity that has since been deleted, or it is for something else than an entity (for example, a relationship type).`,
      • 2020-01-28 02822, 2020

      • yvanzo
        I had the same reaction at first :O
      • 2020-01-28 02836, 2020

      • CatQuest
        erhmmmmm reosarevok I can "remove " it
      • 2020-01-28 02841, 2020

      • reosarevok
        yvanzo, you reviewed the code that meant they had MBIDs :p
      • 2020-01-28 02802, 2020

      • yvanzo
        reosarevok: please reply in the PR, I cannot follow all conversations at the same time here :D
      • 2020-01-28 02812, 2020

      • reosarevok
        Ok