in #metabrainz

2:21 AM
yyoung

Hi yvanzo, the 'v-2021-04-19' link to the blog is wrong, please fix it :) https://github.com/metabrainz/musicbrainz-docke...
2:35 AM
flamingspinach has quit
2:36 AM
flamingspinach joined the channel
4:35 AM
sumedh joined the channel
4:47 AM
Lotheric_ joined the channel
4:48 AM
ZaphodBeeblebrox joined the channel
4:48 AM
ZaphodBeeblebrox has quit
4:48 AM
ZaphodBeeblebrox joined the channel
4:49 AM
MRiddickW_ joined the channel
4:50 AM
Lotheric has quit
4:51 AM
MRiddickW has quit
4:51 AM
CatQuest has quit
4:58 AM
sumedh has quit
5:15 AM
sumedh joined the channel
5:23 AM
yvanzo

Thanks, fixed.
6:25 AM
BrainzGit

[musicbrainz-server] reosarevok opened pull request #2069 (master…MBS-11598): MBS-11598: Do not generate image link if we don't have a suffix https://github.com/metabrainz/musicbrainz-serve...
7:04 AM
Etua joined the channel
7:07 AM
Etua has quit
8:24 AM
sampsyo has quit
8:25 AM
sampsyo joined the channel
10:00 AM
MRiddickW_ has quit
10:23 AM
Darkloke joined the channel
10:42 AM
_lucifer

https://www.irccloud.com/pastebin/njkQKxzi/
10:44 AM
alastairp: ruaok: does the above query look fine? it works in psql, i am wondering if sqlalchemy will correctly quote the `:error_message` param.
10:45 AM
ruaok

not sure about that one.
10:45 AM
_lucifer

for example, if i pass `error_message="Hello, World!"` i want that `:error_message` be substituted with `'"Hello, World!"'`
10:45 AM
alastairp

my feeling is that :error_message should get quoted properly
10:45 AM
but I couldn't tell you for sure, probably better to just load up a shell and try it
10:46 AM
_lucifer

yeah, i'll do that.
10:46 AM
alastairp

service_details = jsonb_set(coalesce(service_details, '{}'), '{error_message}', 'hello world')
10:46 AM
is this what you want?
10:46 AM
I don't think it'll double-quote it, not sure if that's what you're asking for
10:47 AM
_lucifer

yeah, the double quotes are needed in this case
10:47 AM
alastairp

I suspect you'll have to pass in {"error_message": '"' + message + '"'} when passing args to execute, if it lets you do it
10:48 AM
or maybe ":error_message" in coalesce, but again, unsure sorry
10:48 AM
try a few things and tell us which one works :)
10:48 AM
_lucifer

sure :D
10:59 AM
BrainzGit

[listenbrainz-server] MonkeyDo opened pull request #1395 (master…monkey-fix-spotify-player): Spotify player fixes and improvements https://github.com/metabrainz/listenbrainz-serv...
11:31 AM
Darkloke has quit
12:27 PM
Lotheric_ is now known as Lotheric
12:47 PM
Sophist_UK joined the channel
12:47 PM
ruaok

yvanzo: any idea what is up with the sir queues?
12:48 PM
Sophist-UK has quit
13:00 PM
chaban

The last few days I've been marking thousands of recordings as video. Could it be related?
13:00 PM
Making good progress :) https://musicbrainz.org/statistics/timeline/cou...
13:01 PM
MRiddickW joined the channel
13:25 PM
yvanzo

chaban: that seems unlikely to be related, unless you have peaks of high activity.
13:32 PM
ruaok: not yet, there is no error from sir itself at least.
14:03 PM
ruaok

Mr_Monkey: remind me please, which two time_range tickets are fixed by my work?
14:03 PM
Mr_Monkey

Let me fish those out
14:04 PM
LB-701 and LB-862
14:04 PM
BrainzBot

LB-701: My listen page is almost empty https://tickets.metabrainz.org/browse/LB-701
14:04 PM
LB-862: Listens view broken after deleting last.fm imported tracks https://tickets.metabrainz.org/browse/LB-862
14:05 PM
ruaok

thx
14:09 PM
_lucifer

alastairp: ruaok: json.dumps works.
14:09 PM
ruaok

what exactly?
14:10 PM
_lucifer

https://www.irccloud.com/pastebin/izu8tCKG/
14:11 PM
ruaok

ah, great.
14:13 PM
_lucifer

ah, just doing '"{}"'.format(error_message) also works. the previous time i tried that i had made a typo due to which it hadn't worked.
14:14 PM
ruaok

_lucifer: alastairp: https://github.com/metabrainz/listenbrainz-serv... is ready for review.
14:14 PM
alastairp

ok, thanks
14:15 PM
_lucifer

👍
14:15 PM
alastairp

_lucifer: is error_message a string, or a dictionary?
14:15 PM
_lucifer

string
14:15 PM
alastairp

so in this case you're only using json.dumps to add quotes to it?
14:15 PM
_lucifer

yes, i was. but format thing worked as well so i'll go ahead with it,
14:16 PM
alastairp

I've done this before where I use json.JSONEncoder().encode(val)
14:17 PM
because, consider:
14:17 PM
s = 'foo"bar'; json.dumps(s)
14:17 PM
'"foo\\"bar"'
14:18 PM
_lucifer

ah! makes sense.
14:18 PM
ruaok

alastairp: got a sec for a sanity check?
14:19 PM
alastairp

give me 5, just fighting with haproxy
14:19 PM
ruaok

perfect. it will take me 5 minutes to explain. ;D
14:19 PM
_lucifer

is there any difference between using dumps or the encoder?
14:20 PM
ruaok

the listen count are very expensive to calculate and when we first push this branch into production, the site will appear effectively broken because nothing is loading.
14:20 PM
to mitigate that, I plan to write a script that:
14:20 PM
1) Picks the last inserted timestamp from the listen table.
14:21 PM
2) For each user, set a zero listen count, zero timestamp.
14:21 PM
3) iterate over every listen from begin of time to this timestamp. tabulate listen counts and time.
14:23 PM
_lucifer

how much expensive are we talking about?
14:23 PM
ruaok

4) At the end of this script INCREMENT each of the users counts by the calculated total. Update timestamps to ensure that calculated timestamps won't overwrite any timestamps that may have been calculated since this process started.
14:24 PM
This process allows for the timescale_writer to keep writing new listens and for the update script and the timescale writer to coexist peacefully.
14:24 PM
In theory we should catch all the listens as they come in.
14:24 PM
thoughts?
14:24 PM
_lucifer: I saw one take 85s.
14:25 PM
_lucifer

😵
14:25 PM
alastairp

here
14:26 PM
ruaok

exactly, which is why I really want to stop computing them.
14:26 PM
alastairp

yeah, doing a script to pre-compute these things makes a lot of sense. does this mean we'll have to make a deploy with the infrastructure available but disabled? or will you be able to do it before the big deploy?
14:27 PM
tabulate listen counts and time - this will just find min/max time and num listens for each user?
14:27 PM
ruaok

> tabulate listen counts and time - this will just find min/max time and num listens for each user?
14:27 PM
yes
14:27 PM
> or will you be able to do it before the big deploy?
14:28 PM
I think that if we deploy the timescale_writer first, then we can run the tabulate script, then push the web container, then I think the right thing will happen.
14:28 PM
More correctly:
14:28 PM
1. tabulate.
14:29 PM
2. deploy timescale writer
14:29 PM
3. tabulate again
14:29 PM
4. deploy web
14:29 PM
then the release should be seamless.
14:29 PM
alastairp

what service updates max and num listens when necessary? timescale writer?
14:29 PM
ruaok

was timescale writer, but a lot of that logic has moved to timescale listenstore.
14:30 PM
where it belongs.
14:30 PM
alastairp

but the writer container, not the web container?
14:30 PM
ruaok

there is a lot of really needed cleanup in this PR.
14:30 PM
alastairp

do you need any special code running in a container in order to do 1 ?
14:30 PM
ruaok

yes,its runs in the writer container with no special code needed.
14:31 PM
_lucifer

beta, and prod share the cache container so we could run the script in beta if needed?
14:31 PM
ruaok

just python module speaking that logic moved to the listenstore.
14:31 PM
_lucifer: yeah, sure. good idea.
14:31 PM
alastairp

ruaok: yeah, that's what I was trying to get at. running in beta should be fine
14:31 PM
ruaok

the key being, the writer needs to be the first thing to move as part of the deployment.
14:32 PM
you can test all this one test.lb right now.
14:32 PM
load your feed page to find out what I mean.
14:32 PM
it will appear broken, I assure you.
14:32 PM
alastairp

what's the purpose of 3? because of stuff that might come in between tabulate and when we shut down the old timescale writer?
14:33 PM
ruaok

#1 is needed for #2 to work right. #3 is the part that does the actual work.
14:33 PM
I suppose #1 could be reduced to "set all redis keys to 0."
14:33 PM
but that is more code to write, lol
14:34 PM
alastairp

ah, I follow now
14:46 PM
_lucifer

another thing, how about getting the list of all users and then querying the api in beta container for all users.
14:48 PM
instead of writing another script for the same task.
14:48 PM
ruaok

that could work, but it would tie up the DB for hours. and it would make N passes over the data, as opposed to 1.
14:49 PM
and this other script I am talking about, all its pieces already exist.
14:49 PM
its just a matter of conjuring them into one script.
14:49 PM
_lucifer

ah ok! a special script makes sense then
14:53 PM
ruaok signs the 🦄's butt
14:53 PM
ruaok

er, contract.
14:53 PM
_lucifer

lol
14:54 PM
shivam-kapila

Lol. I feel ya. Fiddling with docs too
14:56 PM
ruaok

lol. :)
14:56 PM
no, the contract hasn't been countersigned yet.
15:02 PM
alastairp

ok hey
15:02 PM
we got part of https://freesound.org/ running in a kubernetes cluster
15:03 PM
there's definitely something to be said for just sshing into a computer and running your webserver in a screen
15:03 PM
_lucifer

nice but why the move to kubes?
15:04 PM
alastairp

because our IT department supports it
15:05 PM
_lucifer

ah! :D
15:06 PM
yvanzo

I would love to read more your experience about it :)
15:07 PM
alastairp

🤮
15:07 PM
lol
15:07 PM
nah, it's not too bad. we're lucky that IT gave us a set of templates to copy and fill out.
15:07 PM
yvanzo

:D
15:08 PM
alastairp

there are a lot of moving parts, and coming into it with no knowledge about how everything fits together there was a lot of guessing. I'm sure that as we migrate more services to it we'll come to understand better how everything works
15:08 PM
some things like auto-scaling are really neat
15:25 PM
_lucifer

this is some next level shit https://git.kernel.org/pub/scm/linux/kernel/git...
15:29 PM
ruaok

someone has an ego problem, for sure.
15:44 PM
iliekcomputers

big tech and personal project things -- name a worse pair
15:44 PM
ruaok

crocs and high heels?