ocharles-, warp, ianmcorvidae, getting 502s for just about everything.
andreypopp joined the channel
reosarevok joined the channel
reosarevok
Anyone knows why the hell we're having 50x on every single page?
Leftmost
No, and no one with access seems to be around to figure it out.
reosarevok
Well
Seems to be back for now
Leftmost
It's been in and out for me for a while.
reosarevok
Yeah, ok
Gone again now
*grumbles*
andreypopp joined the channel
petesake joined the channel
DremoraLV joined the channel
warp
hello!
bandtrace joined the channel
bandtrace joined the channel
Leftmost joined the channel
nikki_ joined the channel
ruaok joined the channel
ruaok
warp: PING
warp
ack
zas joined the channel
nikki_ wakes up to a pile of ISEs about problems reading from the redis server
nikki_: yep, we're aware of it. and it's even worse now apparantly.
nikki_
I thought the redis stuff was supposed to stop it from ISEing like that :/
warp
site is back.
nikki_: this is a different ISE
nikki_
well, there's still a bunch of these "Can't use an undefined value as a HASH reference" ones
warp
nikki_: the theory was that either: 1. memcached would lose sessions (it's a cache, not a datastore). 2. if connection to memcached was lost a new session was created, so still losing the session.
nikki_: which is why we switched to redis, because in redis is a datastore, and the connection handling is better
but redis ran out of filehandles (memcached has stuff to deal with this, and redis as well, but our super old version of redis doesn't)
nikki_: and ofcourse there are many other things broken in the release editor which can make it ISE.
nikki_
so I've noticed
the majority of the ISEs are the release editor crashing or people submitting cd stubs without a tracklist :(
oh, or the random search ones
Leftmost
Is it okay if I feel a great sense of satisfaction when I add a disc ID that kills a CD stub?
warp
Leftmost: yes.
nikki_
we can't exactly stop you :P
Leftmost
Just because you can't stop me doesn't mean it's okay. :-P
zas
Hmmm, i cannot log to acoustid.org with my usual credentials, is this related to the issue MB just had ?
luks
zas: might be
the MB auth requests are timing out
zas
ohoho, 502 again on MB
reosarevok joined the channel
ocharles- joined the channel
ocharles wakes up
ocharles
warp: ping
warp
ocharles: hello!
ocharles
still file handle troubles?
warp
ocharles: we've got redis at 100% cpu for no apparent reason
I don't think it's file handles.
Mineo joined the channel
ocharles
what server?
warp
roobarb
ocharles
lets have a looksie
warp
max open files is set correctly when I check /proc/19501/limits
connect clients hovers between 200 and 300 when I can connect. (redis-cli info)
connected
ocharles
we have a 16 core machine so a load of 2 doesn't seem the end of the world, I guess?
warp
yeah, that should be fine.
ocharles
can I have sudo on that machine?
warp
sure
ocharles: done.
ocharles
thanks
warp
hrm. now there's two redis-server's running.
ocharles
redis seems to be writing 30MB/s
so I imagine that cpu usage is almost entirely io dominated
warp
ok
so it needs to flush/save less.
ocharles
but according to /var/log/redis it isn't flushing that ofte
warp
at most it should flush once every minute. but only if 10000 keys have changed since the last save.
ocharles
hum, mabye it's not that, atop isn't showing much activity for the disk actually
warp
:(
andreypopp joined the channel
ocharles: what are you currently doing?
(there's still two redis-servers running, which cannot be good, I'd like to kill one
19501, which also seems to have killed the other one. so perhaps it's normal.
ocharles
the logs say:
30 Mar 12:04:55 * Opening TCP port: bind: Address already in use
warp
but if one is an intentional/internal fork of the other, I would have expected their start time to be the same.
ocharles
so I don't think it's normal
also
30 Mar 12:44:47 - The server is now ready to accept connections on port 6379
30 Mar 11:20:14 * WARNING overcommit_memory is set to 0! Background save may fail under low condition memory. To fix this issue add 'vm.overcommit_memory = 1' to /etc/sysctl.conf and then reboot or run the command 'sysctl vm.overcommit_memory=1' for this to take effect.
but that's fail, not spin
warp
the server has 4GB free memory, so that message doesn't seem relevant to our current trouble.
ocharles
this connection is too weak to do anything useful :/
warp
ocharles: anyway, so I don't understand why it's being this finnicky. it should have enough file handles, enough CPU, enough memory.
ocharles
acid2 [at roobarb]:~$ ps aux | grep strc
a^C^[[A^C^C^C^C^Z
for example
:)
warp
ecuador cable internet ftw!
ocharles: I'm inclined to install redis 2.x either on roobarb or a new hoser vm. 1.2 seems old. though upgrading and hoping that magically fixes a problem is in general not the best strategy.
ocharles
i'm ok with that
warp
ok
ocharles
i really can't do anything other than advise i'm afraid
i'm seeing java take 400% of the cpu though atm
warp
ocharles: yep, roobarb and dora are the search servers.
ocharles
i know
but i mean the load doesn't seem to be coming from redis right now
warp nods.
warp
and redis is working fine when it's not spiking at 100%
ocharles
it seems to have only broken today though
and ruaok did an upgrade on the search servers yesterday, so i wonder if the events are correlated?