-
chhavi_
ruaok: Yes, we can do that :)
-
Leo__Verto
yvanzo, have you gotten around to running the script I sent you yet?
-
zas
samj1912, yvanzo: i re-use musicbrainz_server_session cookie, with cookie prefix haproxy option
-
samj1912
cool
-
zas
it needs more tests though, not sure yet if the config is correct, still digging docs
-
samj1912
but that wont help with sir posting docs right?
-
zas: I am shifting sir from solr1 to floating ip?
-
zas
for post i have another config i'm working on, we can target one server
-
samj1912
or should I wait?
-
zas: always 1 server wont be nice
-
zas
wait, i finish the config, then we can test
-
samj1912
we need to target the leader
-
that will result in least latency
-
otherwise solr forwards the request to leader
-
and then leader sends replication packets back to other replicas
-
zas
but how can we get the leader IP in haproxy?
-
samj1912
hmm, I dont think we can
-
not sure
-
zas
well, it embeds lua
-
can we query solr to get the leader?
-
samj1912
we might
-
its too complicated though
-
lets leave it to later
-
zas
ok
-
can you restart the load testing script?
-
samj1912
okay, live or stress?
-
zas
stress it !
-
samj1912
okay
-
that will be directly on the lb though
-
not via mbs
-
zas
that's ok
-
i just want to be sure everything is still ok, and test with 3 xc51
-
hmm, it doesnt load balance
-
Leo__Verto
-
zas
due to the sticky thing
-
samj1912: please stop the script, i'll change the config
-
samj1912
okay
-
-
stats if you want
-
zas
ok, start it again
-
samj1912
okay
-
ruaok
play it again, sam!
-
ruaok runs away
-
zas
-
samj1912
running
-
ruaok
lol. meta memes.
-
samj1912
meta meta-memes? :P
-
Leo__Verto
meta is what we do best
-
ruaok
it is part of our name, afterall.
-
zas
samj1912: let it run few mins (the script, not ruaok)
-
samj1912
XD
-
zas
requests are perfectly balanced, but not the timings... weird it is
-
Leo__Verto
perfectly balanced, as all things should be
-
Leo__Verto just couldn't resist
-
samj1912
zas: something to do with stat display?
-
zas
:)
-
samj1912: nope, it is confirmed by logs
-
that's more related to caches imho
-
or the leader thing
-
samj1912
let me know when to stop
-
zas
never stop !
-
ruaok feels like he stumbled into an inspirational hippie IRC channel.
-
let's stress til it dies! no no ... no no limit !
-
samj1912: looks at cpu usage
-
ruaok
samj1912: give the replay script a percentage that is higher than 100%.
-
(will need code changes, but still)
-
zas
solr1 62% and others are at 100%
-
ruaok
where doe sit break?
-
samj1912
ruaok: we are doing a stress test currently
-
zas
still 100% 200 responses
-
samj1912
wait let me increase concurrency then
-
-
3x concurrency now
-
zas
ok
-
115 req/s zero loss on this test
-
samj1912
zas, I am stopping sir for a while
-
ruaok
is solr-1 running some other task that slows things down?
-
samj1912
ruaok: yup
-
zas
sir prolly
-
samj1912
sir
-
ruaok
ah.
-
samj1912
and replication packets to other nodes
-
zas
from which host do you run the test samj1912 ?
-
samj1912
stopped sir
-
zas
herb?
-
ruaok
someone please write a tool called m'aam.
-
samj1912
yup
-
zas
ok
-
samj1912
sir is down
-
zas
ok stress again !
-
samj1912
its on
-
didnt stop the stress test
-
zas: how is it?
-
oh wait
-
let me reshuffle the reqs
-
otherwise caches
-
zas
ok
-
samj1912
-
zas
peaking at 212 ops
-
samj1912
restarted with shuffle
-
zas
you can increase concurrency a bit ? i'd like it to throw some"enough! enough! i'll tell you all!"
-
samj1912
its at 100 :P
-
ruaok
"She can't handle any more captain!!!"
-
samj1912: make it go to 11!
-
er, 110!
-
samj1912
I am not sure herb can :P
-
zas
can you run the same test script from the db server on hetzner cloud ?
-
samj1912
thats destroyed
-
ruaok
playing mean now. :)
-
zas
herb is is very good (usually)
-
ruaok
it needs the logs, zas.
-
samj1912
okay, 300 concurrency
-
what gives :
-
* :P
-
-
last test
-
at 300 concurrency
-
zas
solr1 has timings twice better than others...
-
samj1912
if it doesn't work, I will run it from another node
-
zas
still unexplained
-
samj1912
zas: its not been down since forever
-
zas
for now, we lose no query at 220 queries/s
-
samj1912
its got to have a lot of stuff on memory
-
not just caches
-
apart from the jvm, solr indexes are kept in memory
-
zas
can you loop the test so it runs 10 times ? and i will power down solr2 in the middle
-
samj1912
10 times?
-
I will have to figure out a doc size
-
going almost 300 ops now?
-
zas
yes
-
samj1912
nice
-
zas
btw, set this concurrency to 1000
-
herb doesn't care, <1 load
-
samj1912
lol okay
-
zas
imho we'll need to work on reducing latency
-
(btw, the bandwidth isn't counted between hetzner machines)
-
solr machines reached 50 load
-
samj1912
-
evident
-
ruaok
latency in what terms?
-
little we can do between the regular and cloud machines.
-
samj1912 just wants to put it in prod now
-
zas
the time needed to answer ONE query, because even without load i have the feeling it could be better than >100ms
-
samj1912: step by step ;)
-
samj1912
ah, I am not sure how I can improve solr anymore
-
zas
still running ?
-
ruaok
where can I see the response time graphs?
-
zas
load >70
-
samj1912
zas: stopped
-
zas