#metabrainz

/

      • supersandro2000 has quit
      • 2020-05-26 14715, 2020

      • supersandro20000 joined the channel
      • 2020-05-26 14718, 2020

      • d4rkie has quit
      • 2020-05-26 14748, 2020

      • D4RK-PH0ENiX joined the channel
      • 2020-05-26 14752, 2020

      • ephemer0l_ joined the channel
      • 2020-05-26 14738, 2020

      • KindOne has quit
      • 2020-05-26 14739, 2020

      • KindOne joined the channel
      • 2020-05-26 14707, 2020

      • KindTwo joined the channel
      • 2020-05-26 14753, 2020

      • KindOne has quit
      • 2020-05-26 14740, 2020

      • KindTwo has quit
      • 2020-05-26 14715, 2020

      • KindOne joined the channel
      • 2020-05-26 14742, 2020

      • Gore has quit
      • 2020-05-26 14701, 2020

      • Gore joined the channel
      • 2020-05-26 14735, 2020

      • Chinmay3199 joined the channel
      • 2020-05-26 14739, 2020

      • bitmap
        zas: not urgent but you can remove musicbrainz-redis-store-beta from the redis stats whenever you get a chance, it's been merged into musicbrainz-redis-store
      • 2020-05-26 14721, 2020

      • bitmap
        btw, if anyone was logged out of beta let me know, out of curiosity (my session was preserved but there might've been race conditions during the move)
      • 2020-05-26 14748, 2020

      • yvanzo
        bitmap: can I help you with beta or redis?
      • 2020-05-26 14757, 2020

      • bitmap
        yvanzo: I finished merging the beta redis store into the prod one, so we can deploy #1529 to beta now if you're up for that
      • 2020-05-26 14737, 2020

      • bitmap
        off to sleep for now
      • 2020-05-26 14708, 2020

      • yvanzo
        'night!
      • 2020-05-26 14741, 2020

      • BrainzGit
        [musicbrainz-server] yvanzo merged pull request #1529 (master…mbs-10845): MBS-10845: User lookup tools for account admins https://github.com/metabrainz/musicbrainz-server/…
      • 2020-05-26 14742, 2020

      • BrainzBot
        MBS-10845: Tool to allow account admins to look up accounts by e-mail https://tickets.metabrainz.org/browse/MBS-10845
      • 2020-05-26 14742, 2020

      • Rotab has quit
      • 2020-05-26 14703, 2020

      • Rotab joined the channel
      • 2020-05-26 14757, 2020

      • v6lur_ joined the channel
      • 2020-05-26 14713, 2020

      • Sophist-UK has quit
      • 2020-05-26 14749, 2020

      • BrainzGit
        [musicbrainz-server] yvanzo merged pull request #1507 (master…edit-label): MBS-10817: Convert Edit Label edit to React https://github.com/metabrainz/musicbrainz-server/…
      • 2020-05-26 14750, 2020

      • BrainzBot
        MBS-10817: Convert Edit Label edit to React https://tickets.metabrainz.org/browse/MBS-10817
      • 2020-05-26 14703, 2020

      • Mr_Monkey
        Mornin'!
      • 2020-05-26 14754, 2020

      • Wizzup has quit
      • 2020-05-26 14703, 2020

      • yokel has quit
      • 2020-05-26 14725, 2020

      • yokel joined the channel
      • 2020-05-26 14758, 2020

      • Wizzup joined the channel
      • 2020-05-26 14749, 2020

      • diru1100
        Mooin'!!
      • 2020-05-26 14738, 2020

      • supersandro20000 is now known as supersandro2000
      • 2020-05-26 14738, 2020

      • _lucifer
        SomalRudra: you added to the wrong .gitignore file. add the contents to the gitignore file in the root directory.
      • 2020-05-26 14737, 2020

      • _lucifer
        after that follow the steps here to enforce the changes https://stackoverflow.com/questions/7075923/resyn…
      • 2020-05-26 14701, 2020

      • Gazooo has quit
      • 2020-05-26 14742, 2020

      • Gazooo joined the channel
      • 2020-05-26 14732, 2020

      • jmp_music joined the channel
      • 2020-05-26 14734, 2020

      • jmp_music
        moooin
      • 2020-05-26 14700, 2020

      • ruaok
        moin!
      • 2020-05-26 14717, 2020

      • ruaok
        iliekcomputers: ping me when you have a minute.
      • 2020-05-26 14705, 2020

      • shivam-kapila
        Morning
      • 2020-05-26 14737, 2020

      • iliekcomputers
        ruaok: hey
      • 2020-05-26 14704, 2020

      • yvanzo
        updating beta.mb.o
      • 2020-05-26 14715, 2020

      • ruaok
        iliekcomputers: so, I've gotten very little to no feedback on the dups. everyone seems to be fixated on listen counts, rather than doing some digging to find dups.
      • 2020-05-26 14750, 2020

      • ruaok
        I think the listen counts in production are wrong too and I dont want to spend time fixing them.
      • 2020-05-26 14718, 2020

      • iliekcomputers
        The listen counts in production as in the user listen counts or the global listen count?
      • 2020-05-26 14719, 2020

      • ruaok
        I think the think to do is actually start a migration that could make its way to production.
      • 2020-05-26 14731, 2020

      • ruaok
        both probably
      • 2020-05-26 14754, 2020

      • iliekcomputers
        The global listen count is definitely incorrect
      • 2020-05-26 14703, 2020

      • iliekcomputers
        I remember opening an issue about it.
      • 2020-05-26 14723, 2020

      • ruaok
        so, by getting a production ready setup, we can create a more realistic comparison, that people can then ignore.
      • 2020-05-26 14702, 2020

      • ruaok
        but I wonder if "this is being ignored" is more like "I can't find problem, but I can't say for sure, so I won't say anything."
      • 2020-05-26 14729, 2020

      • iliekcomputers
        I think it's the second.
      • 2020-05-26 14739, 2020

      • ruaok
        likely.
      • 2020-05-26 14709, 2020

      • ruaok
        so, starting a production ready conversion.... I suppose first step might be for you to review the import script.
      • 2020-05-26 14739, 2020

      • ruaok
        then to do an actual migration -- which will be a bit tricky.
      • 2020-05-26 14746, 2020

      • iliekcomputers
        Yeah, that sounds good. Happy to review.
      • 2020-05-26 14756, 2020

      • iliekcomputers
        Would a doc detailing the steps be helpful?
      • 2020-05-26 14703, 2020

      • ruaok
        I would need to start a new exchange and connect it to the incoming stream.
      • 2020-05-26 14710, 2020

      • ruaok
        yes!
      • 2020-05-26 14723, 2020

      • ruaok
        I'll setup for the review and the doc after this chat.
      • 2020-05-26 14716, 2020

      • ruaok
        once the next change starts receiving listens (and the queue will grow significantly) we will need to trigger a new dump.
      • 2020-05-26 14741, 2020

      • ruaok
        and then once the dump is done, then it will take some 12-24 hours to prepare and import the dump.
      • 2020-05-26 14757, 2020

      • ruaok
        then we can connect it live and in theory we should have a clean and consistent database.
      • 2020-05-26 14720, 2020

      • ruaok
        I'm somewhat concerned about the number of listens growing, but I think we should be ok.
      • 2020-05-26 14750, 2020

      • ruaok
        so, right now my migration code is in a separate repo.
      • 2020-05-26 14759, 2020

      • iliekcomputers
        Could we just keep writing the listens in the queue and import simultaneously?
      • 2020-05-26 14714, 2020

      • ruaok
        that is exactly what I want to do.
      • 2020-05-26 14721, 2020

      • iliekcomputers
        Writing the listens to timescale
      • 2020-05-26 14726, 2020

      • iliekcomputers
        I meant to say
      • 2020-05-26 14736, 2020

      • ruaok
        ah, no, not ideal.
      • 2020-05-26 14700, 2020

      • ruaok
        the insert will be most performant when you have insert in sequential time order from oldest to newest.
      • 2020-05-26 14717, 2020

      • iliekcomputers
        Okay, that makes sense.
      • 2020-05-26 14718, 2020

      • ruaok
        having listens stream in will break that.
      • 2020-05-26 14750, 2020

      • ruaok
        what I suppose can try is to stream the listens to timescale *until* the import. then stop it, let the queue grow, do the import, then catch up.
      • 2020-05-26 14709, 2020

      • ruaok
        I *think* that wont impact the import so badly.
      • 2020-05-26 14730, 2020

      • iliekcomputers
        If the rabbitmq queue falls over, we'll just start the process over again with something else I guess. Sounds reasonable to me.
      • 2020-05-26 14707, 2020

      • ruaok
        given that the import code is going to run once, by me, I'm not too keen on importing the code into lb-server and tidying it up to our strict standards.
      • 2020-05-26 14723, 2020

      • ruaok
        can you review two scripts on their own and mainly look for logical errors?
      • 2020-05-26 14742, 2020

      • iliekcomputers
        OK, sure.
      • 2020-05-26 14702, 2020

      • ruaok
      • 2020-05-26 14711, 2020

      • ruaok
      • 2020-05-26 14739, 2020

      • ruaok
        that is the first script. it reads in the listens dump and creates a single file that gnu sort can sort on the command line.
      • 2020-05-26 14751, 2020

      • ruaok
      • 2020-05-26 14700, 2020

      • ruaok
        is the second script that does the actual importing.
      • 2020-05-26 14721, 2020

      • iliekcomputers
        Okay, will read through when I get the time.
      • 2020-05-26 14725, 2020

      • ruaok
        ok, great.
      • 2020-05-26 14730, 2020

      • ruaok
        I'll work up a migration doc next.
      • 2020-05-26 14738, 2020

      • iliekcomputers
        Great, thanks!
      • 2020-05-26 14751, 2020

      • ruaok
        the importer is a bit tricky, since it uses threads to run 5 insertions at the same time.
      • 2020-05-26 14704, 2020

      • ruaok
        tuned to make the import move at a reasonable speed.
      • 2020-05-26 14759, 2020

      • ruaok
        the two most critical functions are check_for_duplicates and import_dump_file
      • 2020-05-26 14709, 2020

      • iliekcomputers
        Thanks, I'll take a look.
      • 2020-05-26 14739, 2020

      • zas
        yvanzo: we can stop search-server (old one) containers right?
      • 2020-05-26 14755, 2020

      • yvanzo
        zas: it is still available for mirrors using LUCENE search, not sure it is still used.
      • 2020-05-26 14726, 2020

      • zas
        we need to sort this out, because those are using quite a lot resources we may use for something else. What about stopping them for a while and see if anyone complains (if they do, we'll see what to do, but unlikely starting those again, rather pointing them at solr), I remember we had some hacks in preparation to ease the move, but they were never finished, and lead to a lot of complexity.
      • 2020-05-26 14707, 2020

      • yvanzo
        is there any stats about search.musicbrainz.org ?
      • 2020-05-26 14708, 2020

      • zas
        the story of the death of old search servers is too long, I want to put an end to it, we have better things to do with those resources
      • 2020-05-26 14704, 2020

      • ruaok
        make the death of those services worth it, ok?
      • 2020-05-26 14711, 2020

      • ruaok
        pick some good bullets.
      • 2020-05-26 14754, 2020

      • yvanzo
        zas: we defintely have to point search.mb.o at solr, that is MBH-502.
      • 2020-05-26 14755, 2020

      • BrainzBot
      • 2020-05-26 14731, 2020

      • zas
        We have some web access stats
      • 2020-05-26 14716, 2020

      • zas
        43k hits yesterday
      • 2020-05-26 14742, 2020

      • zas
        30k were 200s
      • 2020-05-26 14750, 2020

      • ruaok
        no way. that's incredible? sad?
      • 2020-05-26 14729, 2020

      • yvanzo
        The MBS PR for that ticket seems to be mostly ready, it just gathered a bit of dust.
      • 2020-05-26 14708, 2020

      • zas
        can we take some time on this this week, and sort it out?
      • 2020-05-26 14732, 2020

      • yvanzo
        yup!
      • 2020-05-26 14732, 2020

      • BrainzGit
        [bookbrainz-site] prabalsingh24 opened pull request #428 (UserCollection…master): Merge master into UserCollection https://github.com/bookbrainz/bookbrainz-site/pul…
      • 2020-05-26 14741, 2020

      • zas
        ok, tell me when
      • 2020-05-26 14758, 2020

      • yvanzo
        we need a roadmap at least
      • 2020-05-26 14722, 2020

      • yvanzo
        pointing at solr could be made on Monday along with MBS release.
      • 2020-05-26 14736, 2020

      • zas
        bitmap disables search indexer already: https://github.com/metabrainz/docker-server-confi…
      • 2020-05-26 14759, 2020

      • zas
        so I guess current instances aren't updated since a while
      • 2020-05-26 14723, 2020

      • yvanzo
        bitmap: I changed staticbrainz project on jenkins to remove previous temporary container, but the actual issue is that this container is not run and thus doesn't have a build/ directory anyway. Doesn’t seem to be in use currently anyway.
      • 2020-05-26 14732, 2020

      • ishaanshah
        iliekcomputers: Hi, please ping me when you get time
      • 2020-05-26 14717, 2020

      • jmp_music has quit
      • 2020-05-26 14758, 2020

      • iliekcomputers
        ishaanshah: hey
      • 2020-05-26 14714, 2020

      • iliekcomputers
        ruaok: thanks, I'll take a look today, after work
      • 2020-05-26 14720, 2020

      • ishaanshah
        hi
      • 2020-05-26 14724, 2020

      • ruaok
        no rush.
      • 2020-05-26 14743, 2020

      • ishaanshah
        I was working on fixing the db tests
      • 2020-05-26 14716, 2020

      • ishaanshah
        However I am running into some weird issue
      • 2020-05-26 14736, 2020

      • ishaanshah
      • 2020-05-26 14752, 2020

      • ishaanshah
        This line should insert the release statistics in the db
      • 2020-05-26 14710, 2020

      • ishaanshah
        However it does not
      • 2020-05-26 14711, 2020

      • ishaanshah
      • 2020-05-26 14731, 2020

      • ishaanshah
        I tried out inserting manually and the function seems to be working fine
      • 2020-05-26 14751, 2020

      • ishaanshah
        artist data is getting inserted, but not release
      • 2020-05-26 14701, 2020

      • iliekcomputers
        That sounds weird.
      • 2020-05-26 14706, 2020

      • ishaanshah
        maybe I am making some silly error
      • 2020-05-26 14714, 2020

      • iliekcomputers
      • 2020-05-26 14726, 2020

      • jmp_music joined the channel
      • 2020-05-26 14747, 2020

      • ishaanshah
      • 2020-05-26 14742, 2020

      • iliekcomputers
        it's probably something in the query.
      • 2020-05-26 14757, 2020

      • iliekcomputers
        i'll have to look into it, can't think of anything off the top of my head.
      • 2020-05-26 14748, 2020

      • ishaanshah
        I tried the query separately in docker container
      • 2020-05-26 14700, 2020

      • ishaanshah
        It works correctly
      • 2020-05-26 14730, 2020

      • ishaanshah
        I will take another look at it
      • 2020-05-26 14701, 2020

      • ishaanshah
        Is there any way to make sqlalchemy log if there is any error in the query
      • 2020-05-26 14707, 2020

      • iliekcomputers
        sqlalchemy would raise an exception if the query was erroneous (ideally)
      • 2020-05-26 14730, 2020

      • iliekcomputers
        i'm suspicious of the query.format
      • 2020-05-26 14744, 2020

      • iliekcomputers
        maybe you could write a specific function for inserting release and see if that works
      • 2020-05-26 14758, 2020

      • iliekcomputers
        if it does, see what the difference between the two is
      • 2020-05-26 14726, 2020

      • ishaanshah
        Hmm, I will do that
      • 2020-05-26 14711, 2020

      • iliekcomputers
        and check that the release data is being passed around into the query correctly