#metabrainz

/

      • supersandro2000 has quit
      • 2020-09-10 25404, 2020

      • supersandro2000 joined the channel
      • 2020-09-10 25400, 2020

      • iconoclasthero joined the channel
      • 2020-09-10 25413, 2020

      • dseomn1 has quit
      • 2020-09-10 25446, 2020

      • dseomn joined the channel
      • 2020-09-10 25440, 2020

      • ishaanshah
        Morning!
      • 2020-09-10 25424, 2020

      • ishaanshah
        iliekcomputers: About the cronjob for incremental dump, if we are creating an incremental dump everyday then we should import it on the same day too
      • 2020-09-10 25455, 2020

      • ishaanshah
        Otherwise we will never import that dump at all, because before the next day's import a newer incremental dump will be created which will then get imported.
      • 2020-09-10 25438, 2020

      • sumedh joined the channel
      • 2020-09-10 25423, 2020

      • thomasross has quit
      • 2020-09-10 25457, 2020

      • sumedh has quit
      • 2020-09-10 25407, 2020

      • BrainzGit
        [listenbrainz-server] ishaanshah opened pull request #1083 (master…ishaan/listening-activity-range-update): LB-690: Minor improvements to Listening Activity graph https://github.com/metabrainz/listenbrainz-server…
      • 2020-09-10 25408, 2020

      • BrainzBot
        LB-690: Dates on the listening-activity graph don't match with the labels https://tickets.metabrainz.org/browse/LB-690
      • 2020-09-10 25445, 2020

      • nelgin
        yvanzo
      • 2020-09-10 25446, 2020

      • nelgin
        Before replication
      • 2020-09-10 25446, 2020

      • nelgin
        | search.index | 194852 |
      • 2020-09-10 25446, 2020

      • nelgin
        After replication
      • 2020-09-10 25446, 2020

      • nelgin
        | search.index | 451263 |
      • 2020-09-10 25446, 2020

      • nelgin
        There is no way that I can ever catch up.
      • 2020-09-10 25419, 2020

      • nelgin
      • 2020-09-10 25431, 2020

      • v6lur joined the channel
      • 2020-09-10 25400, 2020

      • MusicbrainzB0T joined the channel
      • 2020-09-10 25454, 2020

      • MusicbrainzB0T2 has quit
      • 2020-09-10 25431, 2020

      • diru1100
        Morning!!
      • 2020-09-10 25404, 2020

      • diru1100
        My lecturer dropped a bomb saying college starts on 21st 😬😬😬
      • 2020-09-10 25450, 2020

      • CatQuest
        sep?
      • 2020-09-10 25413, 2020

      • CatQuest
        well school has already been in session here fro 2 weeks :D
      • 2020-09-10 25426, 2020

      • CatQuest
        man am i glad i no longer have it
      • 2020-09-10 25455, 2020

      • SomalRudra
        my college started a week back
      • 2020-09-10 25439, 2020

      • zas
        It seems changes that significantly reduced traffic between gateways and mb backend servers also had an impact on web service (that wasn't obvious at start)
      • 2020-09-10 25445, 2020

      • zas
      • 2020-09-10 25410, 2020

      • zas
        not sure it can only be explained by https://github.com/metabrainz/musicbrainz-server/…
      • 2020-09-10 25442, 2020

      • zas
        bitmap: ^^
      • 2020-09-10 25435, 2020

      • yvanzo
        hi nelgin, thanks, if you want to turn it down: admin/setup-amqp-triggers uninstall && sudo docker-compose stop indexer
      • 2020-09-10 25424, 2020

      • zas
        hmmm wait, perhaps graphs don't tell the truth here, checking again
      • 2020-09-10 25431, 2020

      • zas
        ah ignore, for some reason this graph was set to stack, so visually it shows a drop on ws, but it doesn't exist
      • 2020-09-10 25435, 2020

      • diru1100
        Oh really, I have to physically attend college it seems. Online classes started 5 weeks back
      • 2020-09-10 25422, 2020

      • zas
      • 2020-09-10 25400, 2020

      • BrainzGit
        [musicbrainz-server] reosarevok opened pull request #1689 (master…MBS-11086): MBS-11086: Add icon for tabs with errors in release editor https://github.com/metabrainz/musicbrainz-server/…
      • 2020-09-10 25402, 2020

      • BrainzBot
        MBS-11086: Request: option for high-contrast-compatible indication of validation problems https://tickets.metabrainz.org/browse/MBS-11086
      • 2020-09-10 25400, 2020

      • yvanzo
        bitmap, reosarevok: updated blog post with yesterday’s hotfixes, pushed a git tag and fixed previous git tag message. Make sure to delete your local tag: git tag -d v-2020-09-07 # then fetch to get new tag
      • 2020-09-10 25416, 2020

      • nelgin
        yvanzo, well...There has to be a better way to optimize the indexing? Maybe it's time to think of a different type of database? Ever through about NoSQL?
      • 2020-09-10 25415, 2020

      • yvanzo
        nelgin: live indexing works perfectly in production, the issue is more that about setup for mirrors.
      • 2020-09-10 25442, 2020

      • jesus2099 joined the channel
      • 2020-09-10 25402, 2020

      • jesus2099
        Rotab yvanzo bitmap CatQuest: Indeed I don't use Opera 12 any more. ;)
      • 2020-09-10 25421, 2020

      • jesus2099
        reosarevok > "Sigh. I wish github was a bit better at showing where the only change is space
      • 2020-09-10 25424, 2020

      • yvanzo
        nelgin: for example, there is only one Solr instance in musicbrainz-docker whereas we have many nodes in production.
      • 2020-09-10 25401, 2020

      • pristine___
        How can I create current_user object in tests?
      • 2020-09-10 25404, 2020

      • jesus2099
      • 2020-09-10 25411, 2020

      • pristine___
        It fails at current_user.id :(
      • 2020-09-10 25422, 2020

      • reosarevok
        jesus2099: huh!
      • 2020-09-10 25424, 2020

      • reosarevok
        That's neat
      • 2020-09-10 25453, 2020

      • jesus2099
        Yes, once you know it exists, you just have to find it back on the page. :)
      • 2020-09-10 25404, 2020

      • jesus2099
        (the diff settings button)
      • 2020-09-10 25410, 2020

      • jesus2099
        bitmap: I recently added a CAA ticket because it seems the CORS headers are missing when you use the release-group API. The release API CORS headers are OK but if you use the release-group API, that forwards to the release API, then you get the error: has been blocked by CORS policy: No 'Access-Control-Allow-Origin' header is present on the
      • 2020-09-10 25411, 2020

      • jesus2099
        requested resource.
      • 2020-09-10 25417, 2020

      • jesus2099
      • 2020-09-10 25418, 2020

      • BrainzBot
        CAA-122: Enable CORS on API (web services)
      • 2020-09-10 25454, 2020

      • jesus2099
        Las time (CAA-4), the "bug" was forwarded to the (CAA teams?) who fixed it by updating CORS headers.
      • 2020-09-10 25454, 2020

      • BrainzBot
        CAA-4: Enable CORS (Cross-Origin Resource Sharing) https://tickets.metabrainz.org/browse/CAA-4
      • 2020-09-10 25418, 2020

      • jesus2099
        If I understood correctly. I'm not saying I'm 100% sure something should be fixed.
      • 2020-09-10 25411, 2020

      • shivam-kapila
        pristine___: self.temporary_login
      • 2020-09-10 25456, 2020

      • pristine___
        shivam-kapila: tried. Still not working
      • 2020-09-10 25434, 2020

      • iliekcomputers
        ishaanshah: it'll import the full dump with the same id as the incremental dump, so we should be good.
      • 2020-09-10 25459, 2020

      • iliekcomputers
        The full dump command creates a full dump with the ID of the last incremental dump
      • 2020-09-10 25448, 2020

      • iliekcomputers
        What I am worried about is this: suppose incremental dump creation fails, we request import of the newest incremental dump, we might be importing the same incremental dump twice
      • 2020-09-10 25433, 2020

      • ishaanshah
        > it'll import the full dump with the same id as the incremental dump, so we should be good.
      • 2020-09-10 25433, 2020

      • ishaanshah
        I didn't get you
      • 2020-09-10 25427, 2020

      • ishaanshah
        rn, suppose we trigger a full dump on 1st
      • 2020-09-10 25452, 2020

      • iliekcomputers
        1st of month - incremental dump with Id x is created.
      • 2020-09-10 25407, 2020

      • iliekcomputers
        Later on the 1st - full dump with Id x is created
      • 2020-09-10 25427, 2020

      • iliekcomputers
        2 - full dump with Id x is imported
      • 2020-09-10 25417, 2020

      • ishaanshah
        2 - another incremental dump is created right
      • 2020-09-10 25423, 2020

      • iliekcomputers
        We don't need to import the incremental dump with Id x because the full dump will contain the same data
      • 2020-09-10 25431, 2020

      • ishaanshah
        but we arent importing this one
      • 2020-09-10 25438, 2020

      • iliekcomputers
        Oh
      • 2020-09-10 25442, 2020

      • iliekcomputers
        Oof
      • 2020-09-10 25448, 2020

      • iliekcomputers
        My bad
      • 2020-09-10 25458, 2020

      • iliekcomputers
        Yeah, we should be importing that
      • 2020-09-10 25412, 2020

      • ishaanshah
        either we should skip generating that dump or import everyday
      • 2020-09-10 25400, 2020

      • iliekcomputers
        Yeah, I guess we need to import everyday.
      • 2020-09-10 25407, 2020

      • jesus2099 has left the channel
      • 2020-09-10 25412, 2020

      • iliekcomputers
        We need some Id validation on the spark side as well
      • 2020-09-10 25432, 2020

      • iliekcomputers
        Right now it's all dependent on the cron job and brittle
      • 2020-09-10 25459, 2020

      • ishaanshah
        hmm, so the last imported id for incremental
      • 2020-09-10 25416, 2020

      • ishaanshah
        otherwise we might end up importing it twice
      • 2020-09-10 25419, 2020

      • iliekcomputers
        Spark should store the current Id somewhere, check if the dump it's importing is greater than the ID and then import
      • 2020-09-10 25421, 2020

      • iliekcomputers
        Yeah
      • 2020-09-10 25457, 2020

      • ishaanshah
        id can be stored in hdfs only ig, cause we dont have redis in spark cluster
      • 2020-09-10 25418, 2020

      • iliekcomputers
        Hdfs makes sense to me.
      • 2020-09-10 25407, 2020

      • iliekcomputers
        Maybe make it a dataframe with history. (id, imported_timestamp, dump_type, dump_timestamp)
      • 2020-09-10 25459, 2020

      • ishaanshah
        cool, cool, I'll make a PR for it over this weekend...
      • 2020-09-10 25409, 2020

      • iliekcomputers
        Sounds good, thanks!
      • 2020-09-10 25419, 2020

      • ishaanshah
        btw, how much time did the import take?
      • 2020-09-10 25423, 2020

      • iliekcomputers
        Let's see how it runs in the meanwhile, it should still be stable this week
      • 2020-09-10 25428, 2020

      • ishaanshah
        like the copy part
      • 2020-09-10 25440, 2020

      • iliekcomputers
        ishaanshah: the copy command took ~20 min I think
      • 2020-09-10 25404, 2020

      • iliekcomputers
        It's not very scalable, but that's because of the way we store data in hdfs
      • 2020-09-10 25419, 2020

      • ishaanshah
        yep, I saw the ticket
      • 2020-09-10 25426, 2020

      • iliekcomputers
        I figure we'll have to partition the parquet files based on the listen submission timestamps
      • 2020-09-10 25439, 2020

      • iliekcomputers
        That way we can just add a new file to hdfs and we're done
      • 2020-09-10 25447, 2020

      • BrainzGit
        [musicbrainz-server] yvanzo opened pull request #1690 (master…fix-lmsg): Amend d5e3f84e62 to fix localizable string https://github.com/metabrainz/musicbrainz-server/…
      • 2020-09-10 25452, 2020

      • iliekcomputers
        But that's a pretty huge project
      • 2020-09-10 25453, 2020

      • ishaanshah
        I tried to partition the data by inserted at when I was working on incremental dumps
      • 2020-09-10 25413, 2020

      • ishaanshah
        but it ended up taking far more time, so decided against it
      • 2020-09-10 25420, 2020

      • iliekcomputers
        Yeah
      • 2020-09-10 25448, 2020

      • iliekcomputers
        I think we're OK in the short term, but it'll have to get done eventually
      • 2020-09-10 25459, 2020

      • ishaanshah
        yep, agreed
      • 2020-09-10 25411, 2020

      • nelgin
        yvanzo can I start up more instances to try and clear the backlog?
      • 2020-09-10 25419, 2020

      • yvanzo
        nelgin: no, docker compose supports setting a number of 'replicas' per service, but the current image for the 'search' service doesn't support it yet.
      • 2020-09-10 25426, 2020

      • ruaok
      • 2020-09-10 25401, 2020

      • yvanzo
        I agree on “A conversation could change a life.” ;)
      • 2020-09-10 25415, 2020

      • ruaok
        true dat
      • 2020-09-10 25415, 2020

      • pristine___
        Cute
      • 2020-09-10 25453, 2020

      • yvanzo
        nelgin: I’m not convinced this is the main bottleneck either, this is something we will likely be looking at this afternoon.
      • 2020-09-10 25401, 2020

      • nelgin
        OK, let me know if you want me to pull any logs, I'm about to go to bed.
      • 2020-09-10 25456, 2020

      • yvanzo
        Thanks
      • 2020-09-10 25442, 2020

      • nelgin
        Then I'll be back in 6 hours.
      • 2020-09-10 25451, 2020

      • nelgin
        Just finishing up a glass of water
      • 2020-09-10 25405, 2020

      • reosarevok
        yvanzo: does https://github.com/metabrainz/musicbrainz-server/… need spaces at the end / beginning of the segments?
      • 2020-09-10 25421, 2020

      • reosarevok
        (sorry, I forgot we can't have nice things in perl :p )
      • 2020-09-10 25412, 2020

      • yvanzo
        reosarevok: good catch
      • 2020-09-10 25425, 2020

      • yvanzo
        reosarevok: fixed it
      • 2020-09-10 25459, 2020

      • shivam-kapila
        pristine___: can you show the test
      • 2020-09-10 25401, 2020

      • Gazooo794 has quit
      • 2020-09-10 25441, 2020

      • Gazooo794 joined the channel
      • 2020-09-10 25437, 2020

      • BrainzGit
        [musicbrainz-server] yvanzo merged pull request #1690 (master…fix-lmsg): Amend d5e3f84e62 to fix localizable string https://github.com/metabrainz/musicbrainz-server/…
      • 2020-09-10 25408, 2020

      • supersandro2000 has quit
      • 2020-09-10 25428, 2020

      • supersandro2000 joined the channel
      • 2020-09-10 25402, 2020

      • BrainzGit
        [bookbrainz-site] MonkeyDo merged pull request #511 (master…collection-change-default-privacy): Collection: Change default privacy to public https://github.com/bookbrainz/bookbrainz-site/pul…
      • 2020-09-10 25459, 2020

      • BrainzGit
        [bookbrainz-site] MonkeyDo merged pull request #510 (master…fix-edition-group-bug): fix: convert collectionType to appropiate format before param-validation https://github.com/bookbrainz/bookbrainz-site/pul…
      • 2020-09-10 25432, 2020

      • travis-ci joined the channel
      • 2020-09-10 25432, 2020

      • travis-ci
        Project bookbrainz-site build #3430: passed in 4 min 53 sec: https://travis-ci.org/bookbrainz/bookbrainz-site/…
      • 2020-09-10 25432, 2020

      • travis-ci has left the channel
      • 2020-09-10 25414, 2020

      • travis-ci joined the channel
      • 2020-09-10 25414, 2020

      • travis-ci
        Project bookbrainz-site build #3431: passed in 4 min 37 sec: https://travis-ci.org/bookbrainz/bookbrainz-site/…
      • 2020-09-10 25414, 2020

      • travis-ci has left the channel
      • 2020-09-10 25455, 2020

      • pristine___
        ruaok: hey. I have raised InternalServerError if labs.api.listenbrainz.org does not return 200 as status code or if there is any other problem with it. What do you think?
      • 2020-09-10 25449, 2020

      • ruaok
        hmmm.
      • 2020-09-10 25431, 2020

      • ruaok
        if labs returns a 400 error, then the problem is on the caller (your) side of things. Should that be an ISE?
      • 2020-09-10 25404, 2020

      • reosarevok
        "External Server Error" :D
      • 2020-09-10 25432, 2020

      • BrainzGit
        [listenbrainz-server] mayhem merged pull request #1080 (master…distinct-similar-top-artist): [LB-703] Similar and top artist should be distinct for a user https://github.com/metabrainz/listenbrainz-server…
      • 2020-09-10 25432, 2020

      • BrainzBot
        LB-703: Top artists and similar artist should be distinct https://tickets.metabrainz.org/browse/LB-703
      • 2020-09-10 25401, 2020

      • ruaok
        pristine___: should I reload request_consumer since I merged the above?
      • 2020-09-10 25410, 2020

      • pristine___
        Yup.
      • 2020-09-10 25441, 2020

      • pristine___
        that's shouldn't be an ISE
      • 2020-09-10 25453, 2020

      • pristine___
        ruaok: whenever any such error is raised do we get an email?
      • 2020-09-10 25413, 2020

      • ruaok
        which component does your question pertain to?
      • 2020-09-10 25414, 2020

      • pristine___
        ISE, not found any such error. Is it just displayed to the user or do we also get an email?
      • 2020-09-10 25420, 2020

      • pristine___
        I am asking this because