9:00 AM
iliekcomputers says hi from Bangkok 😊
9:01 AM
ruaok waves at iliekcomputers
9:01 AM
ruaok
LB is down, but likely not our fault. the DB is freaking out. but we have a poor error message up there.
9:02 AM
iliekcomputers
Yikes.
9:02 AM
ruaok: open a ticket and I will take a look today?
9:03 AM
ruaok
ok.
9:03 AM
first I gotta try and keep the sites up. everything is tipping over.
9:04 AM
iliekcomputers can't wait for Sentry to start working with LB tbh
9:10 AM
vishalchoudhary[ joined the channel
9:17 AM
djwhitey joined the channel
9:18 AM
djwhitey_ has quit
9:23 AM
xps2 has quit
9:27 AM
xps2 joined the channel
9:30 AM
samj1912
ruaok: sorry, just woke up
9:31 AM
ruaok
k. postgres is freaking out again.
9:31 AM
I stopped sir & solr, no change (not surprising)
9:32 AM
running a vacuum analyze now, no improvement so far.
9:32 AM
do you remember what else helped in the past?
9:32 AM
samj1912
ruaok, vaccum analyze is what you want
9:32 AM
It should work once it's done
9:33 AM
ruaok
do we not have the auto-vacuum thingy running?
9:35 AM
samj1912
PG does. I think the problem is once sir does a re index with recordings, it somehow messes up PG
9:35 AM
In the past we have had such problems only when sir did a complete reindex
9:35 AM
ruaok
and we just did, no?
9:36 AM
samj1912
Yup
9:36 AM
ruaok
from now on, when you re-index. run a vacuum anayze!
9:36 AM
ruaok gives samj1912 a light spanking
9:36 AM
samj1912
But this time I did it against Master (bowie)
9:36 AM
Last time it was against queen
9:36 AM
ruaok
clearly it matters not.
9:37 AM
samj1912
We thought it was hot standby feedback that was causing the problem
9:37 AM
From today, it seems not
9:37 AM
Because PG should technically be doing this on its own.
9:37 AM
Weird that it's not
9:37 AM
I'll look into it with bitmap
9:39 AM
yvanzo
Is it possible to read docker logs from another place than bowie?
9:41 AM
docker logs --since takes forever
9:43 AM
samj1912
yvanzo: redirect to stdout and then do a head?
9:43 AM
9:44 AM
bitmap should be better informed about the value though
9:45 AM
yvanzo
samj1912: I used --tail
9:46 AM
(from the beginning)
9:47 AM
samj1912
Worked?
9:48 AM
yvanzo
Yes and no, it took several minutes and returned logs with unmatched timestamps.
9:49 AM
samj1912
I generally do docker logs --since xyz 2&>1 | head -n 50
9:49 AM
Anirudh joined the channel
9:49 AM
xps2 has quit
9:49 AM
Anirudh
Okay, so hey!
9:50 AM
yvanzo
Ok, sorry, it is tail, not head :/
9:50 AM
Anirudh
I'm Anirudh, and I'm participating in GCI
9:50 AM
And I don't know what's going on, would someone just update me??
9:50 AM
yvanzo
Welcome! Do you have any question?
9:51 AM
Anirudh
No, I just was messing around when I landed here.... So....
9:51 AM
I was actually thinking which task I should take up next
9:53 AM
samj1912
ruaok: `1 S lxd 20934 20929 0 80 0 - 8642073 - 2017 ? 00:01:28 postgres: autovacuum launcher process ` autovaccum is on and running since 2017
9:53 AM
we will have to investigate why its not doing its job :\
9:53 AM
Anirudh has left the channel
9:56 AM
ugh dammnit, `hot_standby_feedback` is on on bowie :\
9:56 AM
Vini joined the channel
9:57 AM
I thought bitmap turned it off
9:57 AM
Major_Lurker
Hi guys I take it the 502 errors are related to the above?
9:57 AM
samj1912
Major_Lurker: yes
9:57 AM
Major_Lurker
thought so thanks.... good luck :)
9:58 AM
rahul_india joined the channel
9:58 AM
samj1912
its on bowie so it shouldn't matter but still
9:58 AM
ill ask bitmap to remove it
9:59 AM
rahul_india
10:00 AM
server*
10:00 AM
Major_Lurker
rahul_india, just wait they are working on it
10:00 AM
rahul_india
okay
10:01 AM
rahul_india has quit
10:06 AM
samj1912
ruaok: is the vaccum still running?
10:06 AM
*vacuum
10:17 AM
ruaok
it finished.
10:17 AM
sigh
10:17 AM
samj1912
just now?
10:18 AM
ruaok
dunno. I went back to napping, since I've got the flu
10:18 AM
samj1912
I dont think it helped
10:19 AM
ruaok
sure doesn't look that way.
10:19 AM
but, I have no ideas on what else it could be.
10:19 AM
I can't think straight.
10:20 AM
samj1912
should we restart postgres?
10:20 AM
Vini has quit
10:21 AM
ruaok
10:21 AM
do you know how to do that?
10:21 AM
samj1912
nope
10:24 AM
just docker restart wont do?
10:27 AM
ruaok
I dont know. nor do we want to corrupt the DB
10:28 AM
samj1912
ruaok: did you run vacuum as postgres or musicbrainz?
10:28 AM
ruaok
musicbrainz
10:28 AM
samj1912
running it again seems to timeout
10:29 AM
10:29 AM
should I redo it with set_statement_timeout 0?
10:29 AM
yeah
10:29 AM
ruaok
it won't help to do it again.
10:30 AM
if it didn't fix anything, running again won't improve it.
10:31 AM
samj1912
postgres doesn't have access to musicbrainz_db right?
10:31 AM
ruaok
it should, why?
10:35 AM
we could try re-starting all the -web and -ws containers
10:35 AM
see if that improves things.
10:35 AM
samj1912
okay
10:38 AM
yvanzo
Can we stop sentry btw?
10:39 AM
It is ~10% of queries.
10:40 AM
ruaok
sure, we've done that in the past.
10:40 AM
rvedotrc has quit
10:43 AM
Vini joined the channel
10:45 AM
xps2 joined the channel
10:45 AM
rvedotrc joined the channel
10:45 AM
yvanzo
done
10:49 AM
samj1912
10:49 AM
still running
10:53 AM
last time I think it took 2 hrs when bitmap ran it
10:56 AM
Dragonzeron has quit
10:57 AM
Vini has quit
11:02 AM
seems to be getting better?
11:03 AM
ruaok
no
11:03 AM
maybe.
11:03 AM
samj1912
ouch, load back to 40 :\
11:03 AM
ruaok
the CPU on bowie is the key to look for.
11:03 AM
it dropped for a brief moment, but back above 90%
11:04 AM
we need to think about other things. vacuum analyze didn't work, nor will it work the second time.
11:04 AM
samj1912
its not from the second time? given the age its from the first time?
11:05 AM
ruaok
I ran one on bowie.
11:06 AM
samj1912
yeah, it seems to be that one?
11:07 AM
I wasn't up at the time it ran
11:08 AM
ruaok
your last two statements make no sense to me
11:08 AM
oh
11:08 AM
nm.
11:08 AM
you weren't up... got it.
11:10 AM
samj1912
meanwhile, what should we do, I restarted a couple of containers
11:10 AM
didn't seem to help
11:11 AM
ruaok
restarting postgres is probably the right thing to do, but without knowing how...
11:12 AM
samj1912
yvanzo: do you have any idea?
11:20 AM
yvanzo
samj1912: turning hot_standby_feedback off again?
11:21 AM
samj1912
yvanzo: we will have to restart pg for that
11:21 AM
and its just `on` on the master
11:21 AM
pg docs say it shouldn't matter there
11:22 AM
I am just waiting for the vacuum analyze to complete, its almost running since 2 hrs now, should be near completion
11:23 AM
yvanzo
samj1912: hot_standby is 'on' on bowie/master, and hot_standby_feedback is 'on' on queen/slave, that would just require restarting pg slave. But I'm not sure it would matter really.
11:24 AM
samj1912
hot_standby_feedback should be off on queen? O.o
11:24 AM
im pretty sure bitmap turned it off
11:24 AM
yup, its off
11:25 AM
hot_standby is `on` on queen
11:25 AM
yvanzo
Oops indeed, it is the opposite
11:25 AM
:(
11:26 AM
samj1912
yvanzo: do you know how to restart pg master?
11:29 AM
Vini joined the channel
11:30 AM
yvanzo
nope
11:39 AM
naiveai
Leo_Verto: do I base my branch off of pipenv-docker?
11:42 AM
samj1912
yvanzo: do you have any idea what musicbrainz_ro does?