#metabrainz

/

      • _lucifer
        ah not that one
      • 2021-05-17 13719, 2021

      • _lucifer
      • 2021-05-17 13734, 2021

      • _lucifer
        i downloaded the metadata of this dataset and uploaded it to similarity.bono
      • 2021-05-17 13731, 2021

      • alastairp
        right
      • 2021-05-17 13748, 2021

      • _lucifer
      • 2021-05-17 13757, 2021

      • _lucifer
        reproduced the error just now
      • 2021-05-17 13757, 2021

      • alastairp
        so just from looking at the error message it appears that it might be complaining about such a large number of parameters passed in?
      • 2021-05-17 13734, 2021

      • _lucifer
        probably. here the parameters are recording ids i think?
      • 2021-05-17 13715, 2021

      • alastairp
        yes
      • 2021-05-17 13720, 2021

      • alastairp
        recording id, offset
      • 2021-05-17 13715, 2021

      • alastairp
        so, I think I left a comment on the PR - maybe we need to load this in chunks
      • 2021-05-17 13751, 2021

      • alastairp
        if there are more than x items (10,000?) then load them in chunks
      • 2021-05-17 13737, 2021

      • alastairp
        however, I don't understand the error, because max_stack_depth appears to be related to recursive functions, and `load_many_low_level` doesn't seem to do anything unexpected
      • 2021-05-17 13740, 2021

      • bitmap
        yvanzo: reosarevok: hey, we'll start at meeting time in a couple hours if that's okay. I'll set up a banner message
      • 2021-05-17 13704, 2021

      • reosarevok
        Sure! You mean preparations or release?
      • 2021-05-17 13723, 2021

      • reosarevok
        (as in, do we start the preparations nowish or in 2h)
      • 2021-05-17 13757, 2021

      • bitmap
        we can start preparations nowish, but finish dinner first :)
      • 2021-05-17 13717, 2021

      • _lucifer
        alastairp, yeah right. i haven't dived very deep in the code yet so not i am not sure either what's happening. i'll start looking into the PR and then see how we can chunk it.
      • 2021-05-17 13736, 2021

      • alastairp
        I think chunking is probably going to be the easiest way, the question will be how big can we make the limit to minimise number of round trips to pg. there is a function called `chunks` copied in a few places in that source, we should move it to a utils module so that we can use it from other places
      • 2021-05-17 13751, 2021

      • mglubb joined the channel
      • 2021-05-17 13754, 2021

      • _lucifer
        makes sense
      • 2021-05-17 13702, 2021

      • ruaok
        uh oh. hi mglubb !
      • 2021-05-17 13749, 2021

      • ruaok
      • 2021-05-17 13702, 2021

      • ruaok
        _lucifer: I should be able to import a new dump into the spark cluster now, yes?
      • 2021-05-17 13705, 2021

      • yvanzo
        bitmap: that works for me, thanks!
      • 2021-05-17 13715, 2021

      • _lucifer
        ruaok: \o/, yes
      • 2021-05-17 13715, 2021

      • bitmap
        great
      • 2021-05-17 13739, 2021

      • ruaok
        bitmap: I had a look atthe migration doc, but I'm quite out of date on that... but looks... complicated.
      • 2021-05-17 13706, 2021

      • ruaok
        as you may have seen, we've had a lot of drama already today. we're your lightning rod for today. 🤞
      • 2021-05-17 13708, 2021

      • bitmap
        yea, MB has so many moving parts nowadays
      • 2021-05-17 13717, 2021

      • ruaok
        I feel yer pain.
      • 2021-05-17 13728, 2021

      • bitmap
        haha
      • 2021-05-17 13733, 2021

      • ruaok
        I remember the day when we had a web server and a DB.
      • 2021-05-17 13744, 2021

      • mglubb has quit
      • 2021-05-17 13756, 2021

      • mglubb joined the channel
      • 2021-05-17 13733, 2021

      • bitmap
        how quaint!
      • 2021-05-17 13748, 2021

      • mglubb
        :) hi ruaok. Don't panic! Just checking in on the ETA for the schema change
      • 2021-05-17 13757, 2021

      • ruaok
        yeah. also the days when there was no traffic.
      • 2021-05-17 13710, 2021

      • ruaok
        mglubb: too late. today's been a shit day already. :)
      • 2021-05-17 13712, 2021

      • bitmap
        mglubb: we're on schedule to start for 17 UTC at the moment
      • 2021-05-17 13729, 2021

      • bitmap
        2 hours from now :)
      • 2021-05-17 13741, 2021

      • mglubb
        :) Sorry to hear that. Thanks bitmap!
      • 2021-05-17 13707, 2021

      • ruaok grumbles about the global chip shortage
      • 2021-05-17 13755, 2021

      • bitmap
        I'm not 100% sure what to say for an estimated downtime. I think the database backup alone will take ~30 minutes...
      • 2021-05-17 13719, 2021

      • bitmap
        plus this time we have to build some new materialized tables after the upgrade finishes
      • 2021-05-17 13706, 2021

      • bitmap
        maybe we can put MB in read-only mode during the DB backup rathern than 503
      • 2021-05-17 13718, 2021

      • ruaok
        +1 bitmap
      • 2021-05-17 13759, 2021

      • ruaok
        _lucifer: import requested.
      • 2021-05-17 13725, 2021

      • _lucifer
        uhhh, why is ssh newleader.listenbrainz.org not working
      • 2021-05-17 13738, 2021

      • ruaok
        sorry, I forgot to tell you.
      • 2021-05-17 13738, 2021

      • _lucifer
      • 2021-05-17 13748, 2021

      • ruaok
        I fixed it. newleader is now leader.
      • 2021-05-17 13757, 2021

      • _lucifer
        oh great! thanks!
      • 2021-05-17 13739, 2021

      • _lucifer
        but it didn't switch correctly i think or do i need to do something on my end as well? getting DNS conflict error
      • 2021-05-17 13750, 2021

      • ruaok
        an SSH warning?
      • 2021-05-17 13710, 2021

      • ruaok
        " WARNING: POSSIBLE DNS SPOOFING DETECTED!" ?
      • 2021-05-17 13712, 2021

      • _lucifer
        yes
      • 2021-05-17 13723, 2021

      • ruaok
        look for `Offending key for IP in /Users/robert/.ssh/known_hosts:161`
      • 2021-05-17 13732, 2021

      • ruaok
        then edit that file and remove the line.
      • 2021-05-17 13736, 2021

      • _lucifer
        👍
      • 2021-05-17 13737, 2021

      • ruaok
        line 161 in my case.
      • 2021-05-17 13751, 2021

      • _lucifer
        works now! thanks!
      • 2021-05-17 13753, 2021

      • _lucifer
        2021-05-17 15:08:16,302 listenbrainz_spark.ftp.download INFO Downloading listenbrainz-listens-dump-440-20210516-114150-spark-full.tar.xz from FTP...
      • 2021-05-17 13704, 2021

      • _lucifer
        download started, \o/
      • 2021-05-17 13714, 2021

      • ruaok is tailing spark.log as well
      • 2021-05-17 13743, 2021

      • _lucifer
        alastairp, the people in #docker suggested me to file a bug in buildkit repo regarding the cache from issues if its reproducible
      • 2021-05-17 13718, 2021

      • alastairp
        _lucifer: great idea, let's try and reproduce it!
      • 2021-05-17 13722, 2021

      • _lucifer
        currently i am unable to reproduce it locally so not filing currently
      • 2021-05-17 13728, 2021

      • alastairp
        yeah, fine
      • 2021-05-17 13715, 2021

      • _lucifer
        also i noticed one thing, the command being set in the errorneous image is the run command of listenbrainz-base
      • 2021-05-17 13737, 2021

      • alastairp
        right, that's what I saw. it's the last RUN
      • 2021-05-17 13747, 2021

      • alastairp
        however, the run command is CMD, not RUN
      • 2021-05-17 13736, 2021

      • ruaok
        I made that mistake in a private project the other day. that was cofusing.
      • 2021-05-17 13740, 2021

      • ruaok
        confusing too
      • 2021-05-17 13700, 2021

      • ruaok
        zas: I don't really buy hetzner's "suspected fan" comment: https://stats.metabrainz.org/d/mPeQpO7iz/alerts-c…
      • 2021-05-17 13725, 2021

      • zas
        we don't have temperature before they changed the fan (I fixed it, a kernel module was missing)
      • 2021-05-17 13746, 2021

      • ruaok
        oh.
      • 2021-05-17 13708, 2021

      • ruaok
        well, I'm going to keep it under heavy load for 1000s and if it passes, we'll resume moving services to it.
      • 2021-05-17 13730, 2021

      • zas
        the graph looks normal to me, it follows load, cpu isn't throttling (I just checked)
      • 2021-05-17 13755, 2021

      • ruaok
        yeah, me too.
      • 2021-05-17 13702, 2021

      • ruaok
        I'll let it finish.
      • 2021-05-17 13734, 2021

      • zas
        hetzner buys crappy cpu fans, likely they have a whole stock of those....
      • 2021-05-17 13705, 2021

      • ruaok
        I bet they replace them with better fans than the factory fans.. I just don't understand why they dont do it on install.
      • 2021-05-17 13727, 2021

      • ruaok
        it would save them a lot of money/customer aggrevation.
      • 2021-05-17 13712, 2021

      • ruaok
        and it does jive with what we were doing -- loads of heavy compression just before crash.
      • 2021-05-17 13758, 2021

      • ruaok
        alastairp: I'm going to move redis back to lemmy. our timestamps are all invalid on boingo and now will need to be recomputed. I'll do that on lemmy now that it looks like lemmy is happy again.
      • 2021-05-17 13712, 2021

      • alastairp
        👍
      • 2021-05-17 13742, 2021

      • alastairp
        I forgot about that data. not much we could have done about it in the moment
      • 2021-05-17 13746, 2021

      • ruaok
        1000s test complete. all good.
      • 2021-05-17 13711, 2021

      • alastairp
        possible advantage to having a dedicated redis cluster spread over a few machines to ensure that we don't lose all of the data at once in case of an outage
      • 2021-05-17 13719, 2021

      • ruaok
        yeah.
      • 2021-05-17 13734, 2021

      • ruaok
        all services moved back, _lucifer alastairp
      • 2021-05-17 13747, 2021

      • ruaok
        will clean up docker-server-configs
      • 2021-05-17 13731, 2021

      • _lucifer
        !m ruaok
      • 2021-05-17 13731, 2021

      • BrainzBot
        You're doing good work, ruaok!
      • 2021-05-17 13748, 2021

      • alastairp
        > Informe Económico/Fiscal 2020
      • 2021-05-17 13754, 2021

      • alastairp
        I'd rather be fixing melting servers
      • 2021-05-17 13722, 2021

      • ruaok
        should I make a server melt for you?
      • 2021-05-17 13725, 2021

      • BrainzGit
        [listenbrainz-server] mhor opened pull request #1466 (master…fix-parameters-types-declaration): Fix "/artist-map" parameters types declaration https://github.com/metabrainz/listenbrainz-server…
      • 2021-05-17 13737, 2021

      • alastairp
        if it means I can stop working on my tax, that'd be great
      • 2021-05-17 13703, 2021

      • ruaok goes to hug his accountant
      • 2021-05-17 13736, 2021

      • alastairp
        I just emailed my accountant saying "hey, this doesn't look right...". let's see what the response is
      • 2021-05-17 13710, 2021

      • ruaok
        and I just tried telling our landlord at the office to start pulling funds from our belgian account, not spanish.
      • 2021-05-17 13722, 2021

      • ruaok
        I think I can hear them laughing all the way across town.
      • 2021-05-17 13711, 2021

      • ruaok goes to do some less stressful coding
      • 2021-05-17 13717, 2021

      • ruaok
        _lucifer: michael ran out of disk. HTH?
      • 2021-05-17 13706, 2021

      • _lucifer
        ruaok: /dev/mapper/vg0-tmp 22G 22G 0 100% /tmp
      • 2021-05-17 13718, 2021

      • _lucifer
        temp ran out of space not sure why though...
      • 2021-05-17 13736, 2021

      • ruaok
        can we move spark's tmp use elsewhere?
      • 2021-05-17 13744, 2021

      • ruaok
        22G is clearly not enough.
      • 2021-05-17 13714, 2021

      • _lucifer
        i already have moved that. its in /data/tmp
      • 2021-05-17 13726, 2021

      • _lucifer
        i do not know what is using /tml
      • 2021-05-17 13731, 2021

      • _lucifer
        */tmp
      • 2021-05-17 13721, 2021

      • ruaok
        /tmp/tmpxpjctl_i is using all teh sapce.
      • 2021-05-17 13736, 2021

      • _lucifer
        yeah just saw that
      • 2021-05-17 13713, 2021

      • _lucifer
        umm that has the spark download tar.
      • 2021-05-17 13726, 2021

      • _lucifer
        maybe its python's temp dir?
      • 2021-05-17 13739, 2021

      • ruaok
        quite likely yes.
      • 2021-05-17 13751, 2021

      • _lucifer
        i'll change that.
      • 2021-05-17 13732, 2021

      • ruaok
        +1
      • 2021-05-17 13729, 2021

      • BrainzGit
        [listenbrainz-server] MonkeyDo opened pull request #1467 (master…monkey-STFU-brainzplayer): Stop BrainzPlayer in other tabs/windows https://github.com/metabrainz/listenbrainz-server…
      • 2021-05-17 13701, 2021

      • ruaok
        LB functionality fully restored.
      • 2021-05-17 13738, 2021

      • _lucifer
        ruaok, alastairp: what's the way to set an env variable so that its available always not just login shell?
      • 2021-05-17 13756, 2021

      • alastairp
        what do you mean "always"?
      • 2021-05-17 13703, 2021

      • ruaok
        I think you want .bashrc
      • 2021-05-17 13713, 2021

      • ruaok
        which gets executed when a new shell is created.
      • 2021-05-17 13727, 2021

      • _lucifer
        like if we set one using docker -e, any running process can see it.
      • 2021-05-17 13732, 2021

      • ruaok
        (as opposed to .login which gets executed when a user logs in)
      • 2021-05-17 13751, 2021

      • ruaok
        _lucifer: you can't do that. env vars are shell specific.
      • 2021-05-17 13753, 2021

      • alastairp
        _lucifer: but is this something that you're running with a process supervisor? or in a shell?
      • 2021-05-17 13708, 2021

      • alastairp
        the answer is normally "it depends on what tool you're using"
      • 2021-05-17 13740, 2021

      • _lucifer
        oh! i want to set `TMPDIR` env variable so that the python changes the tempdir.
      • 2021-05-17 13752, 2021

      • alastairp
        and what is running python?
      • 2021-05-17 13755, 2021

      • _lucifer
        spark
      • 2021-05-17 13700, 2021

      • alastairp
        and what is running spark?
      • 2021-05-17 13718, 2021

      • _lucifer
        we run spark-submit script and that handles everything
      • 2021-05-17 13731, 2021

      • _lucifer
        the script comes bundled with spark
      • 2021-05-17 13725, 2021

      • alastairp
      • 2021-05-17 13737, 2021

      • alastairp
        I don't know enough about how spark works to know which of these options might work
      • 2021-05-17 13702, 2021

      • alastairp
        in short, there's no generic way to add environment variables to a single file and have them appear in every process that starts
      • 2021-05-17 13725, 2021

      • _lucifer
        ah cool, i was wondering if there's something one works for all. as there isn't i look for spark specific solution.
      • 2021-05-17 13727, 2021

      • _lucifer
        thanks!
      • 2021-05-17 13726, 2021

      • sumedh has quit
      • 2021-05-17 13714, 2021

      • BrainzGit
        [sir] release v2.1.0 has been published by yvanzo: https://github.com/metabrainz/sir/releases/tag/v2…
      • 2021-05-17 13721, 2021

      • BrainzGit
        [listenbrainz-server] alastair merged pull request #1466 (master…fix-parameters-types-declaration): Fix "/artist-map" parameters types declaration https://github.com/metabrainz/listenbrainz-server…
      • 2021-05-17 13702, 2021

      • BrainzGit
        [mbsssss] release v-2021-05-14 has been published by yvanzo: https://github.com/metabrainz/mbsssss/releases/ta…
      • 2021-05-17 13753, 2021

      • sumedh joined the channel
      • 2021-05-17 13749, 2021

      • Freso
        Is there an environment variable or something that can be used to set Picard’s debug level (overriding config)? I know there’s the `-d` flag, but I’m looking for something that will lower the level, but also potentially customise it depending on other things.
      • 2021-05-17 13759, 2021

      • _lucifer
        ruaok, i have set the tempdir. pyspark picks it up correctly. i think request consumer will as well. should we import an incremental dump first to confirm?
      • 2021-05-17 13739, 2021

      • ruaok
        not sure it will do that, since we only have out-of-sequence incrementals.