#metabrainz

/

14:35 PM
_lucifer

ah not that one

2021-05-17 13719, 2021

14:35 PM
_lucifer

this one https://acousticbrainz.org/datasets/61265979-235e…

2021-05-17 13734, 2021

14:35 PM
_lucifer

i downloaded the metadata of this dataset and uploaded it to similarity.bono

2021-05-17 13731, 2021

14:36 PM
alastairp

right

2021-05-17 13748, 2021

14:37 PM
_lucifer

https://www.irccloud.com/pastebin/2JAkb2lp/

2021-05-17 13757, 2021

14:37 PM
_lucifer

reproduced the error just now

2021-05-17 13757, 2021

14:38 PM
alastairp

so just from looking at the error message it appears that it might be complaining about such a large number of parameters passed in?

2021-05-17 13734, 2021

14:39 PM
_lucifer

probably. here the parameters are recording ids i think?

2021-05-17 13715, 2021

14:40 PM
alastairp

yes

2021-05-17 13720, 2021

14:40 PM
alastairp

recording id, offset

2021-05-17 13715, 2021

14:42 PM
alastairp

so, I think I left a comment on the PR - maybe we need to load this in chunks

2021-05-17 13751, 2021

14:42 PM
alastairp

if there are more than x items (10,000?) then load them in chunks

2021-05-17 13737, 2021

14:47 PM
alastairp

however, I don't understand the error, because max_stack_depth appears to be related to recursive functions, and `load_many_low_level` doesn't seem to do anything unexpected

2021-05-17 13740, 2021

14:47 PM
bitmap

yvanzo: reosarevok: hey, we'll start at meeting time in a couple hours if that's okay. I'll set up a banner message

2021-05-17 13704, 2021

14:51 PM
reosarevok

Sure! You mean preparations or release?

2021-05-17 13723, 2021

14:51 PM
reosarevok

(as in, do we start the preparations nowish or in 2h)

2021-05-17 13757, 2021

14:51 PM
bitmap

we can start preparations nowish, but finish dinner first :)

2021-05-17 13717, 2021

14:53 PM
_lucifer

alastairp, yeah right. i haven't dived very deep in the code yet so not i am not sure either what's happening. i'll start looking into the PR and then see how we can chunk it.

2021-05-17 13736, 2021

14:54 PM
alastairp

I think chunking is probably going to be the easiest way, the question will be how big can we make the limit to minimise number of round trips to pg. there is a function called `chunks` copied in a few places in that source, we should move it to a utils module so that we can use it from other places

2021-05-17 13751, 2021

14:54 PM
mglubb joined the channel

2021-05-17 13754, 2021

14:54 PM
_lucifer

makes sense

2021-05-17 13702, 2021

14:55 PM
ruaok

uh oh. hi mglubb !

2021-05-17 13749, 2021

14:58 PM
ruaok

_lucifer: phew: dumps saved: http://data.metabrainz.org/pub/musicbrainz/listen…

2021-05-17 13702, 2021

14:59 PM
ruaok

_lucifer: I should be able to import a new dump into the spark cluster now, yes?

2021-05-17 13705, 2021

14:59 PM
yvanzo

bitmap: that works for me, thanks!

2021-05-17 13715, 2021

14:59 PM
_lucifer

ruaok: \o/, yes

2021-05-17 13715, 2021

14:59 PM
bitmap

great

2021-05-17 13739, 2021

14:59 PM
ruaok

bitmap: I had a look atthe migration doc, but I'm quite out of date on that... but looks... complicated.

2021-05-17 13706, 2021

15:00 PM
ruaok

as you may have seen, we've had a lot of drama already today. we're your lightning rod for today. 🤞

2021-05-17 13708, 2021

15:00 PM
bitmap

yea, MB has so many moving parts nowadays

2021-05-17 13717, 2021

15:00 PM
ruaok

I feel yer pain.

2021-05-17 13728, 2021

15:00 PM
bitmap

haha

2021-05-17 13733, 2021

15:00 PM
ruaok

I remember the day when we had a web server and a DB.

2021-05-17 13744, 2021

15:01 PM
mglubb has quit

2021-05-17 13756, 2021

15:01 PM
mglubb joined the channel

2021-05-17 13733, 2021

15:02 PM
bitmap

how quaint!

2021-05-17 13748, 2021

15:02 PM
mglubb

:) hi ruaok. Don't panic! Just checking in on the ETA for the schema change

2021-05-17 13757, 2021

15:02 PM
ruaok

yeah. also the days when there was no traffic.

2021-05-17 13710, 2021

15:03 PM
ruaok

mglubb: too late. today's been a shit day already. :)

2021-05-17 13712, 2021

15:04 PM
bitmap

mglubb: we're on schedule to start for 17 UTC at the moment

2021-05-17 13729, 2021

15:04 PM
bitmap

2 hours from now :)

2021-05-17 13741, 2021

15:04 PM
mglubb

:) Sorry to hear that. Thanks bitmap!

2021-05-17 13707, 2021

15:05 PM
ruaok grumbles about the global chip shortage

2021-05-17 13755, 2021

15:05 PM
bitmap

I'm not 100% sure what to say for an estimated downtime. I think the database backup alone will take ~30 minutes...

2021-05-17 13719, 2021

15:06 PM
bitmap

plus this time we have to build some new materialized tables after the upgrade finishes

2021-05-17 13706, 2021

15:07 PM
bitmap

maybe we can put MB in read-only mode during the DB backup rathern than 503

2021-05-17 13718, 2021

15:09 PM
ruaok

+1 bitmap

2021-05-17 13759, 2021

15:09 PM
ruaok

_lucifer: import requested.

2021-05-17 13725, 2021

15:12 PM
_lucifer

uhhh, why is ssh newleader.listenbrainz.org not working

2021-05-17 13738, 2021

15:12 PM
ruaok

sorry, I forgot to tell you.

2021-05-17 13738, 2021

15:12 PM
_lucifer

but ssh michael.metabrainz.org is

2021-05-17 13748, 2021

15:12 PM
ruaok

I fixed it. newleader is now leader.

2021-05-17 13757, 2021

15:12 PM
_lucifer

oh great! thanks!

2021-05-17 13739, 2021

15:13 PM
_lucifer

but it didn't switch correctly i think or do i need to do something on my end as well? getting DNS conflict error

2021-05-17 13750, 2021

15:13 PM
ruaok

an SSH warning?

2021-05-17 13710, 2021

15:14 PM
ruaok

" WARNING: POSSIBLE DNS SPOOFING DETECTED!" ?

2021-05-17 13712, 2021

15:14 PM
_lucifer

yes

2021-05-17 13723, 2021

15:14 PM
ruaok

look for `Offending key for IP in /Users/robert/.ssh/known_hosts:161`

2021-05-17 13732, 2021

15:14 PM
ruaok

then edit that file and remove the line.

2021-05-17 13736, 2021

15:14 PM
_lucifer

👍

2021-05-17 13737, 2021

15:14 PM
ruaok

line 161 in my case.

2021-05-17 13751, 2021

15:15 PM
_lucifer

works now! thanks!

2021-05-17 13753, 2021

15:15 PM
_lucifer

2021-05-17 15:08:16,302 listenbrainz_spark.ftp.download INFO Downloading listenbrainz-listens-dump-440-20210516-114150-spark-full.tar.xz from FTP...

2021-05-17 13704, 2021

15:16 PM
_lucifer

download started, \o/

2021-05-17 13714, 2021

15:16 PM
ruaok is tailing spark.log as well

2021-05-17 13743, 2021

15:17 PM
_lucifer

alastairp, the people in #docker suggested me to file a bug in buildkit repo regarding the cache from issues if its reproducible

2021-05-17 13718, 2021

15:18 PM
alastairp

_lucifer: great idea, let's try and reproduce it!

2021-05-17 13722, 2021

15:18 PM
_lucifer

currently i am unable to reproduce it locally so not filing currently

2021-05-17 13728, 2021

15:18 PM
alastairp

yeah, fine

2021-05-17 13715, 2021

15:21 PM
_lucifer

also i noticed one thing, the command being set in the errorneous image is the run command of listenbrainz-base

2021-05-17 13737, 2021

15:24 PM
alastairp

right, that's what I saw. it's the last RUN

2021-05-17 13747, 2021

15:24 PM
alastairp

however, the run command is CMD, not RUN

2021-05-17 13736, 2021

15:25 PM
ruaok

I made that mistake in a private project the other day. that was cofusing.

2021-05-17 13740, 2021

15:25 PM
ruaok

confusing too

2021-05-17 13700, 2021

15:28 PM
ruaok

zas: I don't really buy hetzner's "suspected fan" comment: https://stats.metabrainz.org/d/mPeQpO7iz/alerts-c…

2021-05-17 13725, 2021

15:29 PM
zas

we don't have temperature before they changed the fan (I fixed it, a kernel module was missing)

2021-05-17 13746, 2021

15:29 PM
ruaok

oh.

2021-05-17 13708, 2021

15:30 PM
ruaok

well, I'm going to keep it under heavy load for 1000s and if it passes, we'll resume moving services to it.

2021-05-17 13730, 2021

15:31 PM
zas

the graph looks normal to me, it follows load, cpu isn't throttling (I just checked)

2021-05-17 13755, 2021

15:31 PM
ruaok

yeah, me too.

2021-05-17 13702, 2021

15:32 PM
ruaok

I'll let it finish.

2021-05-17 13734, 2021

15:32 PM
zas

hetzner buys crappy cpu fans, likely they have a whole stock of those....

2021-05-17 13705, 2021

15:33 PM
ruaok

I bet they replace them with better fans than the factory fans.. I just don't understand why they dont do it on install.

2021-05-17 13727, 2021

15:33 PM
ruaok

it would save them a lot of money/customer aggrevation.

2021-05-17 13712, 2021

15:35 PM
ruaok

and it does jive with what we were doing -- loads of heavy compression just before crash.

2021-05-17 13758, 2021

15:37 PM
ruaok

alastairp: I'm going to move redis back to lemmy. our timestamps are all invalid on boingo and now will need to be recomputed. I'll do that on lemmy now that it looks like lemmy is happy again.

2021-05-17 13712, 2021

15:38 PM
alastairp

👍

2021-05-17 13742, 2021

15:38 PM
alastairp

I forgot about that data. not much we could have done about it in the moment

2021-05-17 13746, 2021

15:38 PM
ruaok

1000s test complete. all good.

2021-05-17 13711, 2021

15:39 PM
alastairp

possible advantage to having a dedicated redis cluster spread over a few machines to ensure that we don't lose all of the data at once in case of an outage

2021-05-17 13719, 2021

15:39 PM
ruaok

yeah.

2021-05-17 13734, 2021

15:47 PM
ruaok

all services moved back, _lucifer alastairp

2021-05-17 13747, 2021

15:47 PM
ruaok

will clean up docker-server-configs

2021-05-17 13731, 2021

15:54 PM
_lucifer

!m ruaok

2021-05-17 13731, 2021

15:54 PM
BrainzBot

You're doing good work, ruaok!

2021-05-17 13748, 2021

15:54 PM
alastairp

> Informe Económico/Fiscal 2020

2021-05-17 13754, 2021

15:54 PM
alastairp

I'd rather be fixing melting servers

2021-05-17 13722, 2021

15:55 PM
ruaok

should I make a server melt for you?

2021-05-17 13725, 2021

15:55 PM
BrainzGit

[listenbrainz-server] mhor opened pull request #1466 (master…fix-parameters-types-declaration): Fix "/artist-map" parameters types declaration https://github.com/metabrainz/listenbrainz-server…

2021-05-17 13737, 2021

15:55 PM
alastairp

if it means I can stop working on my tax, that'd be great

2021-05-17 13703, 2021

15:56 PM
ruaok goes to hug his accountant

2021-05-17 13736, 2021

15:56 PM
alastairp

I just emailed my accountant saying "hey, this doesn't look right...". let's see what the response is

2021-05-17 13710, 2021

15:57 PM
ruaok

and I just tried telling our landlord at the office to start pulling funds from our belgian account, not spanish.

2021-05-17 13722, 2021

15:57 PM
ruaok

I think I can hear them laughing all the way across town.

2021-05-17 13711, 2021

15:59 PM
ruaok goes to do some less stressful coding

2021-05-17 13717, 2021

16:03 PM
ruaok

_lucifer: michael ran out of disk. HTH?

2021-05-17 13706, 2021

16:05 PM
_lucifer

ruaok: /dev/mapper/vg0-tmp 22G 22G 0 100% /tmp

2021-05-17 13718, 2021

16:05 PM
_lucifer

temp ran out of space not sure why though...

2021-05-17 13736, 2021

16:05 PM
ruaok

can we move spark's tmp use elsewhere?

2021-05-17 13744, 2021

16:05 PM
ruaok

22G is clearly not enough.

2021-05-17 13714, 2021

16:06 PM
_lucifer

i already have moved that. its in /data/tmp

2021-05-17 13726, 2021

16:06 PM
_lucifer

i do not know what is using /tml

2021-05-17 13731, 2021

16:06 PM
_lucifer

*/tmp

2021-05-17 13721, 2021

16:07 PM
ruaok

/tmp/tmpxpjctl_i is using all teh sapce.

2021-05-17 13736, 2021

16:07 PM
_lucifer

yeah just saw that

2021-05-17 13713, 2021

16:08 PM
_lucifer

umm that has the spark download tar.

2021-05-17 13726, 2021

16:08 PM
_lucifer

maybe its python's temp dir?

2021-05-17 13739, 2021

16:08 PM
ruaok

quite likely yes.

2021-05-17 13751, 2021

16:08 PM
_lucifer

i'll change that.

2021-05-17 13732, 2021

16:09 PM
ruaok

+1

2021-05-17 13729, 2021

16:13 PM
BrainzGit

[listenbrainz-server] MonkeyDo opened pull request #1467 (master…monkey-STFU-brainzplayer): Stop BrainzPlayer in other tabs/windows https://github.com/metabrainz/listenbrainz-server…

2021-05-17 13701, 2021

16:14 PM
ruaok

LB functionality fully restored.

2021-05-17 13738, 2021

16:14 PM
_lucifer

ruaok, alastairp: what's the way to set an env variable so that its available always not just login shell?

2021-05-17 13756, 2021

16:14 PM
alastairp

what do you mean "always"?

2021-05-17 13703, 2021

16:15 PM
ruaok

I think you want .bashrc

2021-05-17 13713, 2021

16:15 PM
ruaok

which gets executed when a new shell is created.

2021-05-17 13727, 2021

16:15 PM
_lucifer

like if we set one using docker -e, any running process can see it.

2021-05-17 13732, 2021

16:15 PM
ruaok

(as opposed to .login which gets executed when a user logs in)

2021-05-17 13751, 2021

16:15 PM
ruaok

_lucifer: you can't do that. env vars are shell specific.

2021-05-17 13753, 2021

16:15 PM
alastairp

_lucifer: but is this something that you're running with a process supervisor? or in a shell?

2021-05-17 13708, 2021

16:16 PM
alastairp

the answer is normally "it depends on what tool you're using"

2021-05-17 13740, 2021

16:16 PM
_lucifer

oh! i want to set `TMPDIR` env variable so that the python changes the tempdir.

2021-05-17 13752, 2021

16:16 PM
alastairp

and what is running python?

2021-05-17 13755, 2021

16:16 PM
_lucifer

spark

2021-05-17 13700, 2021

16:17 PM
alastairp

and what is running spark?

2021-05-17 13718, 2021

16:17 PM
_lucifer

we run spark-submit script and that handles everything

2021-05-17 13731, 2021

16:17 PM
_lucifer

the script comes bundled with spark

2021-05-17 13725, 2021

16:18 PM
alastairp

https://www.google.com/search?q=spark-submit+envi… this is where I'd start.

2021-05-17 13737, 2021

16:18 PM
alastairp

I don't know enough about how spark works to know which of these options might work

2021-05-17 13702, 2021

16:19 PM
alastairp

in short, there's no generic way to add environment variables to a single file and have them appear in every process that starts

2021-05-17 13725, 2021

16:19 PM
_lucifer

ah cool, i was wondering if there's something one works for all. as there isn't i look for spark specific solution.

2021-05-17 13727, 2021

16:19 PM
_lucifer

thanks!

2021-05-17 13726, 2021

16:20 PM
sumedh has quit

2021-05-17 13714, 2021

16:23 PM
BrainzGit

[sir] release v2.1.0 has been published by yvanzo: https://github.com/metabrainz/sir/releases/tag/v2…

2021-05-17 13721, 2021

16:24 PM
BrainzGit

[listenbrainz-server] alastair merged pull request #1466 (master…fix-parameters-types-declaration): Fix "/artist-map" parameters types declaration https://github.com/metabrainz/listenbrainz-server…

2021-05-17 13702, 2021

16:27 PM
BrainzGit

[mbsssss] release v-2021-05-14 has been published by yvanzo: https://github.com/metabrainz/mbsssss/releases/ta…

2021-05-17 13753, 2021

16:37 PM
sumedh joined the channel

2021-05-17 13749, 2021

16:39 PM
Freso

Is there an environment variable or something that can be used to set Picard’s debug level (overriding config)? I know there’s the `-d` flag, but I’m looking for something that will lower the level, but also potentially customise it depending on other things.

2021-05-17 13759, 2021

16:46 PM
_lucifer

ruaok, i have set the tempdir. pyspark picks it up correctly. i think request consumer will as well. should we import an incremental dump first to confirm?

2021-05-17 13739, 2021

16:47 PM
ruaok

not sure it will do that, since we only have out-of-sequence incrementals.