in #metabrainz

0:16 AM
MRiddickW joined the channel
4:42 AM
sumedh joined the channel
5:00 AM
sumedh has quit
7:28 AM
BrainzGit

[listenbrainz-server] amCap1712 opened pull request #1393 (master…gh): Check for CI env variable https://github.com/metabrainz/listenbrainz-serv...
7:58 AM
_lucifer

alastairp: what do you think is a good way to handle the above issue in the test.sh script. The test.sh script currently exits with a ) exit code even if the tests fails. I was thinking to use `set -e` but that doesn't work because we use functions that return non-zero values eg: `is_unit_db_running`. Other option is to store the exit code of the command that ran tests in test.sh and then return it at the end of the script.
7:59 AM
This looks fine to me. Do you think there's a better a solution?
8:02 AM
alastairp

_lucifer: hi, I saw that message last week
8:02 AM
does test.sh run multiple tests in the same job? (e.g. frontend runs 3 things in a row, right?
8:03 AM
so if the first one fails, will the other two run?
8:03 AM
_lucifer

No, we call it thrice to do three different things.
8:04 AM
https://github.com/metabrainz/listenbrainz-serv...
8:04 AM
alastairp

oh, that's great
8:04 AM
and do we try and do cleanup at the end of the job?
8:05 AM
_lucifer

test.sh handles that
8:05 AM
alastairp

even if you set -e ?
8:06 AM
_lucifer

actually set -e doesn't work here as i mentioned above. my intent was set -e then even if cleanup doesn't happen in the CI its fine because the container will just go away.
8:06 AM
alastairp

yeah, right
8:07 AM
_lucifer

when is_unit_db_running or any other function we define in the script returns a non zero value it exits.
8:07 AM
so we'll have to handle it manually i think.
8:07 AM
alastairp

I was going to suggest one option would be to use exec, that makes docker-compose take over the scrit, and its return value will be the return value
8:07 AM
but that means you won't be able to do a cleanup after
8:07 AM
_lucifer

that's an issue when we run locally.
8:08 AM
alastairp

yeah
8:08 AM
so, you were thinking of catching the return value of docker-compose, and then returning that at the very end of the script?
8:08 AM
_lucifer

yes
8:09 AM
one issue is that we lose the return value of the cleanup but i see we are not using it anyway, we just exit 0.
8:09 AM
alastairp

I don't see a problem with not having the return value of the cleanup
8:09 AM
_lucifer

cool, let's do that then catch the return value and return it at end.
8:14 AM
alastairp

even though it's a bit annoying I still think it's the better way to go, using test.sh in the action
8:20 AM
_lucifer

yeah, also we'll probably replace it with a test.py soon which should be easier to work with IMO.
8:59 AM
outsidecontext

Nena should have sung about "100 Luftballons", then it wouldn't fool Picard's guess track number from filename :(
9:00 AM
ruaok

time to build a time machine, clearly.
9:03 AM
_lucifer

alastairp: updated https://github.com/metabrainz/listenbrainz-serv...
9:23 AM
BrainzGit

[bookbrainz-site] MonkeyDo merged pull request #577 (master…feature/mailer): Adding mailing service to the codebase https://github.com/bookbrainz/bookbrainz-site/p...
9:25 AM
[bookbrainz-site] MonkeyDo merged pull request #590 (master…autocomplete-search-limit): Autocomplete search limit https://github.com/bookbrainz/bookbrainz-site/p...
9:28 AM
travis-ci joined the channel
9:28 AM
travis-ci

Project bookbrainz-site build #3861: passed in 4 min 30 sec: https://travis-ci.org/bookbrainz/bookbrainz-sit...
9:28 AM
travis-ci has left the channel
9:33 AM
travis-ci joined the channel
9:33 AM
Project bookbrainz-site build #3862: passed in 6 min 40 sec: https://travis-ci.org/bookbrainz/bookbrainz-sit...
9:33 AM
travis-ci has left the channel
9:46 AM
ruaok

zas: ping
9:48 AM
zas

pong
9:49 AM
ruaok

hey. those 5 machines that make up the old spark cluster are ready to be decommissioned. unless you have a use for them?
9:49 AM
if not, I'll cancel them now.
9:52 AM
zas

I don't use them, so I guess you can proceed
9:53 AM
ruaok

do you have any need for them?
9:53 AM
since we have them? I mean, they were previously used server with no setup cost, so we don't loose much if we dont use them.
9:54 AM
MRiddickW has quit
9:57 AM
zas

what are the specs?
9:58 AM
ruaok

8 core, 32GB RAM, 200GB disk
9:58 AM
zas

those are named lb-* in hetzner robot right?
9:59 AM
ruaok

yes
9:59 AM
zas

I could use them I think, to migrate a part of our installation to vswitch and 20.04
9:59 AM
ruaok

all of them?
10:00 AM
zas

I guess one or two would be enough to experiment
10:00 AM
let's say 2, for convenience
10:00 AM
ruaok

ok, I'll ditch the rest.
10:00 AM
I'll leave you to get rid of them once we no longer need them.
10:00 AM
zas

ok
10:01 AM
I can trash everything on them I guess, so I'll do a full reinstall from scratch
10:02 AM
BrainzGit

[bookbrainz-site] MonkeyDo merged pull request #593 (master…merge-queue-buttons): FIX(BB-606): Buttons overlapping in merge entity section https://github.com/bookbrainz/bookbrainz-site/p...
10:02 AM
ruaok

yes
10:03 AM
three servers cancelled.
10:04 AM
the remaining two are now called zas-test-0 and zas-test-1
10:07 AM
travis-ci joined the channel
10:07 AM
travis-ci

Project bookbrainz-site build #3863: passed in 4 min 26 sec: https://travis-ci.org/bookbrainz/bookbrainz-sit...
10:07 AM
travis-ci has left the channel
10:32 AM
ruaok

alastairp: _lucifer : https://github.com/metabrainz/listenbrainz-serv...
10:33 AM
do you know where that function is called from? I can't find the right place...
10:35 AM
_lucifer

ruaok: i see its being called in get_or_create in the same file which is then called from listenbrainz/webserver/login/provider.py
10:36 AM
iliekcomputers

https://github.com/metabrainz/listenbrainz-serv...
10:36 AM
this is the endpoint
10:42 AM
BrainzGit

[bookbrainz-site] MonkeyDo merged pull request #601 (master…browser-compatibility): FIX(BB-615): Copy/Paste annotation text in FireFox <= 60 https://github.com/bookbrainz/bookbrainz-site/p...
10:48 AM
travis-ci joined the channel
10:48 AM
travis-ci

Project bookbrainz-site build #3864: failed in 4 min 48 sec: https://travis-ci.org/bookbrainz/bookbrainz-sit...
10:48 AM
travis-ci has left the channel
10:53 AM
travis-ci joined the channel
10:53 AM
Project bookbrainz-site build #3864: passed in 4 min 8 sec: https://travis-ci.org/bookbrainz/bookbrainz-sit...
10:53 AM
travis-ci has left the channel
10:57 AM
ruaok

_lucifer & iliekcomputers: I see now -- thanks!
11:09 AM
in get_or_create() I would like to set default mint_ts and max_ts timestamps for a user into redis, but only if the user is new. but in the db.user.py module is the wrong place to do it (logically, and also importing the timescale listenstore yields an import circular dependency. so it should be done in listenbrainz/webserver/login/provider.py
11:09 AM
_lucifer

ruaok, python packages set up on new cluster. dataframes generated successfully. there is an incompatitble change wrt to schema in spark 3.* which needs to be fixed before recommendations start working. currently working on that.
11:10 AM
ruaok

but, there I can't tell if the user was created or not -- could return a tuple of (user, created), but our tests call this function 94 times, which makes the PR much larger and more noise in our codebase.
11:10 AM
thoughts for how to best to this iliekcomputers _lucifer alastairp ?
11:11 AM
_lucifer: thanks. this is all perfect timing, I hope to get recommendations working again this week.
11:11 AM
alastairp

ruaok: sorry, was in a call and didn't see your first question
11:11 AM
ruaok

no worries.
11:13 AM
alastairp

in django, their version of get_or_create returns a tuple (object, created), where you can check this
11:13 AM
but as you point out, the method is already used in a bunch of places
11:13 AM
ruaok

94 more changes to the PR would suck.
11:14 AM
alastairp

oh, you already said that. missed that part of the line
11:14 AM
ruaok

I could just add another get_by_mb_row_id call to check to see if the users exists. if it is fetched, the next call should effectively be cached in PG and return quickly.
11:15 AM
_lucifer

if the only issue is that, we could do it in a separate PR and merge that first.
11:15 AM
alastairp

just loading code up now
11:15 AM
ruaok

_lucifer: yeah, I'd honestly prefer to do it in different way and not clutter up our tests more than needed.
11:17 AM
alastairp

(there are also 34 instances of `create` in tests which for some reason don't use get_or_create :|)
11:17 AM
ruaok: hmmm
11:17 AM
ruaok

indeed.
11:18 AM
alastairp

if you did it in provider.get_user, you could get, if it's none set a sentinel, do get or create, then if it was created, set the redis stuff
11:18 AM
that's what you suggested?
11:19 AM
ruaok

yeah
11:19 AM
alastairp

yea, fine by me
11:19 AM
ruaok

k, less mess.
11:21 AM
alastairp

one potential problem there is that this is only used from the web login flow (right?). if we need this data in redis for an integration test, we might have to create it manually, which then means we might need 2 ways of making the same data
11:22 AM
https://twitter.com/tweetsoutloud/status/138409...
11:50 AM
ruaok

alastairp: I dont think we need an integration test -- I'll add a regular one and should be fine.
11:57 AM
meh, we have no tests at all for testing the login process.
11:59 AM
_lucifer

a good time to do that when we do account migration
11:59 AM
ruaok

agreed.
12:04 PM
alastairp

ruaok: right, I didn't specifically mean that we need an integration test for this. I mean, if you have an integration test that requires reading these values from redis, how are you going to set them?
12:05 PM
ruaok

by calling the cache.put -- afterall I am only inserting (0,0)
12:06 PM
the idea is this: if a user creates an account and then goes to their listens page, that would causes us to do an INDEX scan on the whole DB, with the guarntee of not finding anything.
12:06 PM
but inserting (0,0) for the timestamps, means that the timescale listenstore knowns there are no listens and does not query.
12:07 PM
alastairp

ok, good
12:08 PM
_lucifer

these keys will not be expiring automatically?
12:08 PM
CatQuest has quit
12:09 PM
CatQuest joined the channel
12:09 PM
CatQuest has quit
12:09 PM
CatQuest joined the channel
12:09 PM
ruaok

no, they are now intended to be in redis at all times without expiry. we now update the keys when the DB values change.
12:09 PM
in fact this is the only reason why listen counts were ever accurate in the first place, lol. ;)
12:10 PM
_lucifer

cool, you might want to pass `time=0` manually then
12:11 PM
we'll be updating BU sometime soon so that the caller has to always specify a timeout. `time=0` if you do not want expiry.
12:11 PM
ruaok

oh, that's important.
12:13 PM
_lucifer

New recommendations generated.
12:13 PM
ruaok

:D
12:17 PM
alastairp

this is a good place to remind us that we had this open ticket about separating the redis interface in BU into cache (epheremal, may disappear) and data (need to keep around permanently), and if we should have 2 different interfaces for that task [not suggesting we do it now, let's just use cache, but something to keep in mind for future improvements]
12:19 PM
ruaok

heh https://brainzutils.readthedocs.io/en/latest/ca...
12:19 PM
not terribly useful.
12:19 PM
_lucifer

yeah, rtd is not setup completely for BU yet.
12:20 PM
alastairp

oops, sorry about that
12:20 PM
_lucifer

I think one interface is good enough. we can call setex methods to cache and call it a day. after all the underlying implementation is the same.
12:21 PM
its unlikely we'd use different things for cache and an in memory db. and we already have h* methods in cache and will be having set methods soon.
12:21 PM
ruaok

https://www.irccloud.com/pastebin/SCG42uBt/
12:22 PM
is that correct for setting a 0 expiry then?
12:23 PM
_lucifer

yes. you can do `time=0` for more clarity if you want
12:24 PM
alastairp

_lucifer: unfortuantely the version of BU ruaok is using will still have it called expireat :(
12:24 PM
_lucifer

also, there is a `namespace` keyword arg, which you can pass `self.ns` to.
12:25 PM
alastairp

that is: cache.set(redis_user_listen_count+username, 0, 0, namespace=self.ns)
12:25 PM
ruaok

I think I don't need to pass in a namespace at all, I think that already setup.
12:25 PM
`listenbrainz:listenbrainz.ts.lb_test_1`
12:25 PM
_lucifer

alastairp, no i think it'll be `time` for him. we haven't made a release after changing to `expirein`.
12:25 PM
but let me check to confirm.
12:25 PM
alastairp

_lucifer: oh, I got them around the wrong way then
12:27 PM
_lucifer

yup still in draft release https://github.com/metabrainz/brainzutils-pytho...
12:28 PM
ruaok

so time=0 is correct?
12:28 PM
_lucifer

yes.
12:29 PM
alastairp, https://github.com/metabrainz/listenbrainz-serv... is updated with the comment. should i merge?
12:29 PM
alastairp

(sorry for the confision)