#metabrainz

/

      • Lotheric has quit
      • 2021-04-14 10448, 2021

      • Lotheric joined the channel
      • 2021-04-14 10443, 2021

      • thomasross joined the channel
      • 2021-04-14 10440, 2021

      • d4rkie joined the channel
      • 2021-04-14 10437, 2021

      • D4RK has quit
      • 2021-04-14 10413, 2021

      • _lucifer
        ruaok: yes, all nodes are up and the cluster looks sane. however, unless we submit a request we cannot be sure it works as expected. i did not start the request consumer on the newleader because i have a few doubts about how we should do it. do we want to run it directly on newleader or in a container on it?
      • 2021-04-14 10432, 2021

      • thomasross has quit
      • 2021-04-14 10407, 2021

      • RikkoM has quit
      • 2021-04-14 10416, 2021

      • flamingspinach has quit
      • 2021-04-14 10433, 2021

      • flamingspinach joined the channel
      • 2021-04-14 10434, 2021

      • sumedh joined the channel
      • 2021-04-14 10442, 2021

      • BrainzGit
        [musicbrainz-server] reosarevok merged pull request #2057 (master…edit-preview-id-warning): Fix Catalyst warning: id is missing in edit previews https://github.com/metabrainz/musicbrainz-server/…
      • 2021-04-14 10416, 2021

      • RikkoM joined the channel
      • 2021-04-14 10406, 2021

      • RikkoM has quit
      • 2021-04-14 10441, 2021

      • adhi001 joined the channel
      • 2021-04-14 10452, 2021

      • yef has quit
      • 2021-04-14 10402, 2021

      • yef joined the channel
      • 2021-04-14 10402, 2021

      • yef has quit
      • 2021-04-14 10402, 2021

      • yef joined the channel
      • 2021-04-14 10423, 2021

      • BrainzGit
        [musicbrainz-server] reosarevok merged pull request #2061 (master…empty-review-markdown-warning): Fix Catalyst warning: don't pass undef review text to markdown https://github.com/metabrainz/musicbrainz-server/…
      • 2021-04-14 10459, 2021

      • iliekcomputers has quit
      • 2021-04-14 10409, 2021

      • BrainzGit
        [musicbrainz-server] reosarevok opened pull request #2062 (master…missing-link-type-id-warning): Fix Catalyst warning: don't try undef linktype id to access $loaded https://github.com/metabrainz/musicbrainz-server/…
      • 2021-04-14 10422, 2021

      • shivam-kapila has quit
      • 2021-04-14 10442, 2021

      • shivam-kapila joined the channel
      • 2021-04-14 10406, 2021

      • iliekcomputers joined the channel
      • 2021-04-14 10411, 2021

      • sumedh has quit
      • 2021-04-14 10449, 2021

      • RikkoM joined the channel
      • 2021-04-14 10452, 2021

      • shivam-kapila
        _lucifer: good work 🎉
      • 2021-04-14 10415, 2021

      • BrainzGit
        [musicbrainz-server] reosarevok opened pull request #2063 (master…EditReleaseEvents-map-keys): Add keys to EditReleaseEvents to stop warning https://github.com/metabrainz/musicbrainz-server/…
      • 2021-04-14 10456, 2021

      • reosarevok
      • 2021-04-14 10406, 2021

      • reosarevok
        yvanzo, bitmap: we only have 4 tickets ready for beta. Do you think we can improve that today? :)
      • 2021-04-14 10429, 2021

      • ruaok
        mooin!
      • 2021-04-14 10440, 2021

      • ruaok
        _lucifer: not sure in this case. what do you think?
      • 2021-04-14 10403, 2021

      • _lucifer
        ruaok, i do not know about the networking concerns in case we use docker for request consumer. if there aren't any, then maybe let's use docker because we can just bring down the image and start a new one when dependencies etc. change, running directly could make these change a bit more involved sometimes.
      • 2021-04-14 10447, 2021

      • _lucifer
        but we'll also need to figure how to make docker talk back and forth with the host. i don't know if that is easy or hard.
      • 2021-04-14 10450, 2021

      • ruaok
        I see no network concerns and I think you have a good point about pulling the container, so lets use docket.
      • 2021-04-14 10429, 2021

      • ruaok
        for that, put the container in host mode -- in host mode the container runs inside the normal port mapping of the host.
      • 2021-04-14 10447, 2021

      • ruaok
        and the container has direct access to the host network layer.
      • 2021-04-14 10402, 2021

      • _lucifer
        interesting, let's try docker then.
      • 2021-04-14 10420, 2021

      • ruaok
      • 2021-04-14 10444, 2021

      • _lucifer
        i just figured out a way to test the cluster without the consumer and found an issue.
      • 2021-04-14 10400, 2021

      • ruaok
        ok.
      • 2021-04-14 10417, 2021

      • _lucifer
        the namenode is having trouble connecting to datanodes. again could be firewall or misconfigration
      • 2021-04-14 10429, 2021

      • _lucifer
        i am rechecking the configurations first
      • 2021-04-14 10432, 2021

      • ruaok
        when you're ready to test for real, just let me know. we need to turn off the old consumer, but then we can start sending requests from lemmy
      • 2021-04-14 10438, 2021

      • ruaok
        k
      • 2021-04-14 10402, 2021

      • ruaok
        have you also change the configuration of the nodes because we have loads more ram?
      • 2021-04-14 10449, 2021

      • _lucifer
        no that can be done in the spark-submit file we have in LB repo
      • 2021-04-14 10437, 2021

      • _lucifer
        the values there will override the defaults anyways so i thought not to meddle with those
      • 2021-04-14 10433, 2021

      • ruaok
        ok, then we'll need to remember those when we install the consumer
      • 2021-04-14 10449, 2021

      • _lucifer
        that script is executed inside the request consumer.
      • 2021-04-14 10450, 2021

      • _lucifer
        yes
      • 2021-04-14 10429, 2021

      • adhi001 has quit
      • 2021-04-14 10404, 2021

      • ruaok does some weeding out of GSoC apps
      • 2021-04-14 10401, 2021

      • ruaok
        hmm. ok. we got one covid project proposal.
      • 2021-04-14 10404, 2021

      • ruaok
        wait what?
      • 2021-04-14 10431, 2021

      • _lucifer
        lol i saw that.
      • 2021-04-14 10403, 2021

      • _lucifer
        one org got a draft of Egypt's Budget 😆
      • 2021-04-14 10417, 2021

      • ruaok
        well, I'm curious to see what the landscape looks like after yvanzo and Mr_Monkey toss out the not worthy MB and BB projects.
      • 2021-04-14 10433, 2021

      • MRiddickW has quit
      • 2021-04-14 10419, 2021

      • ruaok
        alastairp: Mr_Monkey I've been working with timescale last night to suss out a few things while taking a look at the listen counts.
      • 2021-04-14 10432, 2021

      • ruaok
        I've found a lot of interesting stuff to share. let me pack up and head to the office.
      • 2021-04-14 10433, 2021

      • alastairp
        cool
      • 2021-04-14 10435, 2021

      • alastairp
        see you here
      • 2021-04-14 10452, 2021

      • ruaok
        loading a users page on LB should already be faster now: https://listenbrainz.org/user/gracz54
      • 2021-04-14 10427, 2021

      • ruaok
        not that I've seen.
      • 2021-04-14 10415, 2021

      • alastairp
        maybe .de will contact you since it's hosted at hetzner
      • 2021-04-14 10435, 2021

      • ruaok
        did they provide a link of any kind?
      • 2021-04-14 10411, 2021

      • alastairp
        will show you in the office
      • 2021-04-14 10417, 2021

      • ruaok
        k
      • 2021-04-14 10450, 2021

      • _lucifer
        ruaok, tested with a sample jar. cluster is running smoothly, we can proceed on setting up the new request consumer
      • 2021-04-14 10401, 2021

      • ruaok
        great, go for it.
      • 2021-04-14 10443, 2021

      • shivam-kapila
        Covid project. But why?
      • 2021-04-14 10419, 2021

      • shivam-kapila
        it> one org got a draft of Egypt's Budget 😆
      • 2021-04-14 10419, 2021

      • shivam-kapila
        Another got medical chapters as a pdf
      • 2021-04-14 10451, 2021

      • adhawkins has quit
      • 2021-04-14 10445, 2021

      • adhawkins joined the channel
      • 2021-04-14 10447, 2021

      • BrainzGit
        [musicbrainz-server] reosarevok opened pull request #2064 (master…normalize-warnings): Fix catalyst warning: don't try to trim/sanitize undef "strings" https://github.com/metabrainz/musicbrainz-server/…
      • 2021-04-14 10423, 2021

      • alastairp
      • 2021-04-14 10454, 2021

      • alastairp
        if you have unicode and want to turn it to text, `errors='ignore'` should skip over any non-encodable characters
      • 2021-04-14 10402, 2021

      • alastairp
        uh, unicode -> bytes I mean
      • 2021-04-14 10416, 2021

      • alastairp
        but I'm not sure if at the point of this error handler the data is as a string or as json
      • 2021-04-14 10453, 2021

      • alastairp
      • 2021-04-14 10423, 2021

      • ruaok
        its an exception object that I cast to str().
      • 2021-04-14 10430, 2021

      • alastairp
        although, now I see that this error message (`raise APIBadRequest("Listen submission contains invalid characters.")`) doesn't include the contents
      • 2021-04-14 10400, 2021

      • alastairp
        try str(message, errors='ignore')
      • 2021-04-14 10447, 2021

      • ruaok
      • 2021-04-14 10403, 2021

      • alastairp
        hmm, now I'm a bit confused by that PR
      • 2021-04-14 10415, 2021

      • ruaok
        sentry agrees.
      • 2021-04-14 10417, 2021

      • alastairp
        _lucifer: did we stop receiving the "enter null into timescale" errors in sentry?
      • 2021-04-14 10446, 2021

      • alastairp
        because PR 1371 wraps `_send_listens_to_queue`, which adds items to rabbitmq, not sending to postgres
      • 2021-04-14 10457, 2021

      • ruaok
        ah, no. the error is now being formatted correctly, but not fixed. see sentry.
      • 2021-04-14 10409, 2021

      • _lucifer
      • 2021-04-14 10436, 2021

      • _lucifer
        last seen 12 days ago.
      • 2021-04-14 10448, 2021

      • alastairp
        yeah, right
      • 2021-04-14 10453, 2021

      • alastairp
      • 2021-04-14 10412, 2021

      • alastairp
        same error, but in timescale writer
      • 2021-04-14 10421, 2021

      • _lucifer
        interesting, to confirm these listens are coming from the API?
      • 2021-04-14 10444, 2021

      • _lucifer
        no i think `_send_listens_to_queue` wraps both queues. https://github.com/metabrainz/listenbrainz-server…
      • 2021-04-14 10401, 2021

      • BrainzGit
        [listenbrainz-server] amCap1712 opened pull request #1388 (master…spark-new-cluster): Setup request consumer for new cluster setup https://github.com/metabrainz/listenbrainz-server…
      • 2021-04-14 10417, 2021

      • _lucifer
        ruaok: can you review the above PR? i'll build and test the request consumer then. i haven't changed memory configurations yet. will do that once request consumer works.
      • 2021-04-14 10432, 2021

      • alastairp
        _lucifer: what do you mean it wraps both queues?
      • 2021-04-14 10422, 2021

      • ruaok
        _lucifer: will try soon. we're in the office and have heaps on.
      • 2021-04-14 10442, 2021

      • alastairp
        we were just talking about this, we probably need to re-address this PR, and check for null characters in the input in the webserver, so that we can return http400 to the user
      • 2021-04-14 10422, 2021

      • alastairp
        because if we have webserver -> push to queue, and then a separate process that reads from queue -> push to database, it's the wrong place to catch the error because now it's disconnected from the user request
      • 2021-04-14 10441, 2021

      • alastairp
        however I'm still really confused as to how we managed to catch a postgres error in the webserver and stop these errors from happening
      • 2021-04-14 10443, 2021

      • _lucifer
        I meant that _send_listens_to_queue checks and send the listens to the appropriate queue. oh ok! i understand now what you meant
      • 2021-04-14 10414, 2021

      • _lucifer
        alastairp: https://sentry.metabrainz.org/metabrainz/listenbr… i see those errors happened in messybrainz
      • 2021-04-14 10420, 2021

      • _lucifer
        not in timescale_writer
      • 2021-04-14 10452, 2021

      • _lucifer
        so they got caught before being sent to the queue.
      • 2021-04-14 10432, 2021

      • RikkoM has quit
      • 2021-04-14 10455, 2021

      • ruaok
      • 2021-04-14 10406, 2021

      • ruaok
      • 2021-04-14 10408, 2021

      • alastairp
        _lucifer: messybrainz!
      • 2021-04-14 10422, 2021

      • alastairp
        that was the missing part of the process that we skipped :D
      • 2021-04-14 10423, 2021

      • alastairp
        thanks
      • 2021-04-14 10437, 2021

      • _lucifer
        alastairp: ruaok: once you have some time, let me know if you have any further comments on https://github.com/metabrainz/listenbrainz-server… . i'd like to deploy it and run the sql updates.
      • 2021-04-14 10453, 2021

      • alastairp
        ruaok: right, that looks OK. I was just unsure about the .rollback()
      • 2021-04-14 10405, 2021

      • alastairp
        this connection exists only for a batch of listens, right?
      • 2021-04-14 10413, 2021

      • alastairp
        not for many batches
      • 2021-04-14 10433, 2021

      • alastairp
        question: this batch will have many listens from many people. if it fails, what happens?
      • 2021-04-14 10444, 2021

      • alastairp
        all listens fail to insert, or just the one with \0 ?
      • 2021-04-14 10400, 2021

      • ruaok
        all fail.
      • 2021-04-14 10420, 2021

      • ruaok
        because this is not the place for us to be validating listens.
      • 2021-04-14 10425, 2021

      • ruaok
        that should happen earlier.
      • 2021-04-14 10428, 2021

      • alastairp
        yeah, sure
      • 2021-04-14 10448, 2021

      • alastairp
        but does this mean that we will loose other users' listens with this fix?
      • 2021-04-14 10458, 2021

      • ruaok
        _lucifer: on 1384 how do you plan to test this migration? run the new music services tables script but not the migrate existing users script?
      • 2021-04-14 10408, 2021

      • _lucifer
        ruaok, i think we can run both as the migrate existing users is just copy existing users.
      • 2021-04-14 10434, 2021

      • _lucifer
        on test.lb it'll read the users from the new table and on prod from the old table
      • 2021-04-14 10436, 2021

      • ruaok
        and then truncate the table before the actual release of this into produtction?
      • 2021-04-14 10442, 2021

      • _lucifer
        yes
      • 2021-04-14 10446, 2021

      • ruaok
        ok, makes sense.
      • 2021-04-14 10448, 2021

      • ruaok
        aight, given the current context and release plan, this is fine by me.
      • 2021-04-14 10437, 2021

      • _lucifer
        awesome, thanks!
      • 2021-04-14 10416, 2021

      • _lucifer
        while i am at it, do you have any PRs you want to include in the release?
      • 2021-04-14 10419, 2021

      • _lucifer
        Mr_Monkey: ^
      • 2021-04-14 10440, 2021

      • ruaok
        the ones that are critical are merged already.
      • 2021-04-14 10447, 2021

      • ruaok
        (user similarity page)
      • 2021-04-14 10401, 2021

      • alastairp
        oh, you know what I think caused this
      • 2021-04-14 10404, 2021

      • ruaok
        reading 1388 right now... I see you're creating newcluster files in an effort to not overwrite the existing cluster setup for now, but that we'll get rid of these when we ditch the old cluster?
      • 2021-04-14 10409, 2021

      • alastairp
        I'm having flashbacks to the id3 spec
      • 2021-04-14 10411, 2021

      • sumedh joined the channel
      • 2021-04-14 10428, 2021

      • _lucifer
        yes
      • 2021-04-14 10405, 2021

      • _lucifer
        we have to configure the test setups as well before removing the old files
      • 2021-04-14 10458, 2021

      • _lucifer
        so i created new ones, we can cleanup once the prod and test up are running well
      • 2021-04-14 10410, 2021

      • alastairp
        tests running
      • 2021-04-14 10410, 2021

      • ruaok
        ok, lgtm for testing purposes.
      • 2021-04-14 10423, 2021

      • alastairp
        there is a VW car called the id3
      • 2021-04-14 10430, 2021

      • alastairp
        and so now it's difficult to search for 'id3 spec'