in #metabrainz

11:56 AM
ruaok

what do we gain by making this change?
11:57 AM
_lucifer

i thought that we could store and retrieve as the values as integers. but i just tested with redis-cli and that is not the case, it returns integers as strings.
11:57 AM
so just ignore that comment.
11:58 AM
alastairp

the only gain perhaps is that logically the values are all joined together in a single key
11:58 AM
instead of having 2 keys per user, there are just 2 count keys in redis
11:59 AM
_lucifer

ah, alastairp's motivation was different.
12:00 PM
alastairp

_lucifer: a reminder, all counts in redis are always strings
12:00 PM
even if you use `INCR foo`, it'll be stored in a string
12:00 PM
_lucifer

oh! i didn't know that.
12:01 PM
ruaok

good enough for now
12:08 PM
353850 listens processed: exact 78138 high 800 med 2426 low 2697 no 23198 prev 246591 err 0 93.4%
12:08 PM
some really nice stats coming from the MBID writer working on live listens coming in.
12:15 PM
alastairp

ruaok: listenstore review done
12:15 PM
-> lunch. back to answer questions soon
12:24 PM
piti has quit
13:09 PM
ruaok

all comments addressed, most fixed.
13:09 PM
alastairp

I tried the select max(listened_at) query without a group by and it gave me a result
13:10 PM
not sure what's different in your one
13:10 PM
ruaok

I took it out and it barfed on me.
13:10 PM
different timescale versions?
13:10 PM
alastairp

you took out just group by, or group by and order by?
13:11 PM
ruaok

just group by
13:11 PM
alastairp

I can reproduce the error if I take out just group by
13:11 PM
but you don't need order by, because you're already using max/min?
13:11 PM
ruaok

LIMIT 1
13:12 PM
🤦‍♂️
13:12 PM
alastairp

but max(listened_by) only returns 1 row
13:13 PM
ruaok

I wonder if this was confusing the query planner making things slow.
13:14 PM
but now tests are failing. huh
13:16 PM
BenOckmore has quit
13:24 PM
BrainzGit

[bookbrainz-site] MonkeyDo merged pull request #623 (master…dependabot/npm_and_yarn/underscore-1.13.1): chore(deps): bump underscore from 1.10.2 to 1.13.1 https://github.com/bookbrainz/bookbrainz-site/p...
13:30 PM
17WAAF7UX joined the channel
13:30 PM
17WAAF7UX

Project bookbrainz-site build #3955: passed in 4 min 35 sec: https://travis-ci.org/bookbrainz/bookbrainz-sit...
13:30 PM
17WAAF7UX has left the channel
13:33 PM
ruaok

fix pushed.
13:37 PM
alastairp: have you started looking at https://github.com/metabrainz/listenbrainz-serv... yet?
13:37 PM
we need that merged before I can deploy.
13:38 PM
alastairp

yeah, I had a glance through that this morning as well. no comments
13:38 PM
let me approve both PRs
13:39 PM
ruaok

ok, hoping for the unit tests to pass. if so, I'm going to proceed with the deployment.
13:39 PM
alastairp

ok, great. I'm here for the rest of the afternoon too
13:39 PM
ruaok

k
13:40 PM
unit tests passed. waiting for prod tests now.
13:45 PM
passed.
13:45 PM
BrainzGit

[listenbrainz-server] mayhem merged pull request #1396 (be-gone-time-ranges…no-more-time-ranges-deployment): Recalculate all user data https://github.com/metabrainz/listenbrainz-serv...
13:46 PM
[listenbrainz-server] mayhem merged pull request #1390 (master…be-gone-time-ranges): Remove time ranges, refactor listen fetching and improve listen count/timestamp caching https://github.com/metabrainz/listenbrainz-serv...
13:47 PM
ruaok

if I push a tag, does it build the images automatically now?
13:47 PM
alastairp

_lucifer: ^
13:47 PM
ruaok: in theory yes, though once last week it didn't work for me and we're not sure
13:47 PM
_lucifer

yes, it should.
13:47 PM
alastairp

give it a go and we'll see
13:47 PM
ruaok

k
13:49 PM
where can I find out that the images completed building?
13:50 PM
_lucifer

https://github.com/metabrainz/listenbrainz-serv...
13:50 PM
BrainzGit

[bookbrainz-site] snyk-bot opened pull request #624 (master…snyk-fix-11a6bc053b6e8918d0937be199c5387c): [Snyk] Fix for 6 vulnerabilities https://github.com/bookbrainz/bookbrainz-site/p...
13:50 PM
alastairp

top menu -> actions-> build image and publish to docker hub
13:50 PM
ruaok

thx
13:55 PM
travis-ci joined the channel
13:55 PM
travis-ci

Project bookbrainz-site build #3957: failed in 3 min 43 sec: https://travis-ci.org/bookbrainz/bookbrainz-sit...
13:55 PM
travis-ci has left the channel
13:56 PM
BrainzGit

[bookbrainz-site] MonkeyDo closed pull request #624 (master…snyk-fix-11a6bc053b6e8918d0937be199c5387c): [Snyk] Fix for 6 vulnerabilities https://github.com/bookbrainz/bookbrainz-site/p...
13:57 PM
alastairp

_lucifer: looks like this job may not have used any docker cache. any thoughts as to why? Either because it's the first build since we fixed the ARGs, or because it failed to store the cache the last time it ran?
13:57 PM
_lucifer

the latter or because the cache got evicted.
13:58 PM
the tests run often but the release is seldom so its probable that the release cache gets evicted.
13:59 PM
alastairp

right. I'm not sure - 10 minutes for a build is quite a long time. I know that if I do a small incremental release locally (e.g a .0 then a .1) then the build time is going to be much smaller
13:59 PM
shorter
13:59 PM
_lucifer

yeah, right.
14:00 PM
ruaok

gah. refreshing the 30day continuous aggregate also takes up gobs of diskspace. :(
14:00 PM
_lucifer

the cache got stored this time, alastairp.
14:03 PM
ruaok, could this be related to WITH NO DATA option?
14:03 PM
ruaok

it is.
14:03 PM
but then you would normally refresh the agg and it would run into the same issues.
14:04 PM
I need to see if I can have it refresh the agg in chunks (5 years at a time)
14:04 PM
yvanzo

nelgin: There is a more general issue: These logs don’t provide enough details to identify the edit(s) that caused the failure.
14:05 PM
It is known that some edits can lead to such failure too, see https://tickets.metabrainz.org/browse/SEARCH-577 for example.
14:05 PM
BrainzBot

SEARCH-577: Index limit exceeded on medium formats update
14:08 PM
yvanzo

It’s doable to increase your 'index_limit' value but to the risk of PostgreSQL being less available for other tasks during such events.
14:08 PM
As a workaround... ^
14:16 PM
I will resume addressing indexer’s issues (and prerequisites to port to python 3) after the next schema change release.
14:18 PM
ruaok

wow, gaga. 1210M/s write speed. zoooooom!
14:21 PM
still, hurry up, eh gaga?
14:22 PM
alastairp

what's going on? You can generate the new aggregate before we deploy, then when we deploy it'll start using it, and then we can delete the old one?
14:25 PM
ruaok

that is what I am doing.
14:25 PM
its stopped consuming disk, so now just need to wait.
14:27 PM
_lucifer

alastairp: if you are idle currently, can you take a look at the youtube PR?
14:28 PM
alastairp

I approved them
14:29 PM
we're going to have to have a talk about this workflow, I can see that I'm a bottleneck for reviews and I'm not sure how we can set things up to get them processed quicker
14:31 PM
_lucifer

thanks!
14:32 PM
one issue regarding the time ranges and youtube PR has been that these are huge PRs. if we could do small incremental PRs, merges and deployments. it would have been easier.
14:32 PM
but i don't think it was possible in these 2 cases.
14:33 PM
although we could probably setup a weekly meeting sort of thing like the MB team have.
14:34 PM
alastairp

yeah, iliekcomputers has commented about the value of small PRs before, and I agree there
14:34 PM
but sometimes a change has to be big (time ranges, this one, etc)
14:34 PM
_lucifer

yeah, sometimes there's no option but otherwise we should aim for small PRs.
14:34 PM
alastairp

a few times for SoC I've tried to do these incremental PRs like you did here, but in my experience that's always gone badly too
14:35 PM
_lucifer

yeah, rebasing so on becomes a mess.
14:42 PM
ruaok

i don't rebase anymore -- it always turns into a painful mess. merging master in works much better for me.
14:42 PM
ok, aggregate built, finally.
14:45 PM
starting deploy.
14:46 PM
taking taking web-test for the time being.
14:49 PM
_lucifer

yeah, i gave up on rebasing last week too.
14:51 PM
alastairp, ruaok, Mr_Monkey: so how about we try to aim for a weekly release, we can set a day in advance that works for all of us. that day we try to review and get merged as many PRs as possible and release at the end of day. just like the PR smashing days we had some time ago, but only every week.
14:52 PM
ruaok

I like it.
14:53 PM
hmm. my tagged image is not here: https://hub.docker.com/r/metabrainz/listenbrain...
14:54 PM
should be v-2021-05-07.0
14:54 PM
alastairp

oh
14:54 PM
https://github.com/metabrainz/listenbrainz-serv...
14:54 PM
we change the args to push.sh but didn't update it here
14:55 PM
PR incoming
14:55 PM
_lucifer

oh! sorry, my bad.
14:56 PM
BrainzGit

[listenbrainz-server] alastair opened pull request #1434 (master…build-image-push-args): Fix arguments passed to push.sh https://github.com/metabrainz/listenbrainz-serv...
14:56 PM
alastairp

so the 'prod' image on docker hub is the one we want
14:56 PM
_lucifer

yes right
14:57 PM
alastairp

_lucifer: it's worth a try setting a release day, though I don't want the day to turn into a panic to try and merge as many PRs as possible before we release
14:57 PM
I think our periodic "fix prs" day that we do at the office is working well, though the last time we did that we definitely picked up the easy ones
14:58 PM
BrainzGit

[listenbrainz-server] alastair merged pull request #1434 (master…build-image-push-args): Fix arguments passed to push.sh https://github.com/metabrainz/listenbrainz-serv...
14:58 PM
[listenbrainz-server] alastair merged pull request #1433 (master…spelling): Fix grammar https://github.com/metabrainz/listenbrainz-serv...
14:58 PM
alastairp

tagging v-2021-05-07.1
14:58 PM
ruaok

I'm pushing v-2021-05-07.0 now.
14:59 PM
alastairp

ah, sure. that'll work too
14:59 PM
_lucifer

alastairp: right, i agree. we should aim for a reasonable amount of changes.
15:00 PM
alastairp

_lucifer: the job failed to pull cache layers again
15:00 PM
ruaok

though if .1 is done and ready, I'll use that.
15:00 PM
alastairp

nah, it'll be another 10 minutes, because for some reason the build job isn't using a cache
15:00 PM
ruaok

k
15:01 PM
these build times are getting redonkulous
15:01 PM
alastairp

yeah, this 10 minutes is for a full LB prod image. we added caching in the github actions precicesly to avoid this, and when building locally on your machine it should reuse caches
15:01 PM
but even then, it's a pretty long process from 0
15:02 PM
ruaok

yeah
15:04 PM
alastairp

growing up in a country at the far end of a single cable to the rest of the internet still makes me cringe at downloading the same packages again and again every time you want to make a new deployment. it feels like such a waste
15:05 PM
ruaok

huh? still not found???
15:06 PM
is push.sh busted?
15:06 PM
do I need to merge another branch?
15:07 PM
_lucifer

the image is not pushed yet, a couple of minutes more.
15:07 PM
alastairp

how did you try and push it yourself?
15:07 PM
ruaok

MY image didn't appear.
15:07 PM
docker/push.sh prod v-2021-05-07.0
15:07 PM
alastairp

sorry - I thought you pulled :prod and renamed it to v-2021-05-07.0
15:07 PM
_lucifer

you do not need prod now.
15:08 PM
alastairp

ok. we made this change and didn't inform you of it - sorry about that
15:08 PM
_lucifer

docker/push.sh v-2021-05-07.0
15:08 PM
alastairp

the syntax is now ^
15:08 PM
_lucifer

the action has also pushed now, so you can use that for now.
15:09 PM
ruaok

finally found it.
15:09 PM
_lucifer

alastairp, i'll enable DEBUG logging on the actions and see if I can find the issue.
15:10 PM
alastairp

_lucifer:
15:10 PM
> Stored root cache, key: docker-layer-caching-Build image and publish to Docker Hub-61ad5188f09e6d2abe065a8e450c87c8238c1c1a0f06684921281fbebe0e9d70-root, id: 50510
15:10 PM
on the post-run job
15:10 PM
ruaok

what if i need push.sh beta beta ?
15:10 PM
alastairp

but when starting: