in #metabrainz

14:22 PM
monkey

I think that could improve matching figures a fair bit, at the detriment of precision
14:23 PM
alastairp

this is a similar issue to my JJ Johnson & Kai Winding example from before
14:23 PM
especially if MB doesn't have all of the credits set up
14:23 PM
ruaok

I *think* my system is setup carefully enough for that to not end in disaster.
14:24 PM
alastairp

not sure what album monkey is listening to, but there definitely isn't an AC here: https://musicbrainz.org/release/0754aafd-ee25-4...
14:24 PM
ruaok

which is why it matched.
14:24 PM
(with the detuning)
14:24 PM
monkey

That listen comes from the album Bacchanal. Recording here: https://musicbrainz.org/recording/cdbb76d9-549b...
14:24 PM
alastairp

however, the *artist rels* of most of those recordings do include those other artists
14:26 PM
monkey

Oh, and an interesting thing I just noticed :
14:26 PM
"Gabor Szabo Jim Stewart Hal Gordon Jimmy Keltner Louis Kabok" with or without commas doesn't match.
14:26 PM
If I cheat and write "Gabor Szabo feat. Jim Stewart Hal Gordon Jimmy Keltner Louis Kabok" instead (note the "feat.") I get a match
14:26 PM
https://labs.api.listenbrainz.org/explain-mbid-...
14:26 PM
ruaok

yep, https://labs.api.listenbrainz.org/explain-mbid-...
14:26 PM
alastairp

I guess that's because ruaok deletes 'feat.' and anything following it?
14:26 PM
but doesn't do that for ,
14:27 PM
ruaok

yes, but I propose to try exactly that.
14:27 PM
all the cases I've tried seem to work ok.
14:27 PM
see link above
14:33 PM
two tracks remaining on that Sound of Contact album.
14:36 PM
alastairp

nice
14:36 PM
spotify: "I am dimensionaut", MB: "I Am (Dimensionaut)"
14:37 PM
ruaok

that should end up matching, no?
14:37 PM
alastairp

yeah, https://labs.api.listenbrainz.org/explain-mbid-... shows a match
14:37 PM
but not on LB
14:38 PM
ruaok

still not finished with Nov 17
14:38 PM
alastairp

great
14:38 PM
ruaok

I keep restarting it.
14:40 PM
ruaok finds another lovely album on huesound
14:41 PM
BrainzGit

[listenbrainz-server] 14MonkeyDo merged pull request #1739 (03master…monkey-window-title-trackname): LB-747: Update browser window title with currently playing track's name https://github.com/metabrainz/listenbrainz-serv...
15:00 PM
lucifer

do we have gateways meeting now or in an hour?
15:00 PM
alastairp

in an hour
15:01 PM
lucifer

👍
15:01 PM
ruaok

monkey: https://listenbrainz.org/user/alastairp/?min_ts...
15:01 PM
I have this page open and new items keep getting added to the TOP of the page.
15:02 PM
monkey

Yeah, there's already a ticket for that.
15:02 PM
alastairp

that's LB-834
15:02 PM
BrainzBot

LB-834: New listens are added to top of wrong Recent Listens page https://tickets.metabrainz.org/browse/LB-834
15:02 PM
ruaok

k, thanks!
15:02 PM
alastairp

twice today I was going to tell monkey
15:02 PM
and then I decided that he has too much on his plate already :)
15:02 PM
monkey

LB-834
15:02 PM
Ah, soz, didn't see the past above
15:02 PM
alastairp

ruaok: the solution is to tell _me_ to stop listening to music :)
15:03 PM
ruaok

au contraire, I just found out you ARE!
15:03 PM
alastairp disconnects spotify
15:04 PM
monkey

We're soon going to remove BrainzPlayer from the right hand side, maybe that'll be a good place for "sicne you opened this page…" sort of list
15:04 PM
alastairp

sure
15:04 PM
I remember we had this long discussion about how we should deal with paging if you're using the app as new listens are coming in
15:04 PM
and I don't think I agreed with all of your suggestions
15:04 PM
monkey

Yeah, none of them seem great
15:05 PM
We have no concept of "page 1" so it makes it pretty tricky
15:07 PM
alastairp

monkey: just listened to Bacchanal
15:07 PM
it was good
15:07 PM
monkey

Great album
15:15 PM
CatQuest

no.
15:16 PM
I *want* brainzplayer to (optionally) be on the right
15:18 PM
monkey

CatQuest: I can isolate the CSS the we currently have. If you have a browser script that can load CSS for a specific page that'll work
15:19 PM
lucifer: Thoughts on #1586 (CB review modal) ? Maybe something we want to hide while we finish the rest of the feature, or is it ready?
15:20 PM
In any case, I think that's all the PRs on my side for today.
15:22 PM
And ruaok were you able to have a look at #1718 yesterday?
15:22 PM
ruaok

lemme look now
15:27 PM
lucifer

monkey: i havent been able to look at the other PRs yet so not sure how we should proceed. i'll get back to you on this once i have looked at those.
15:27 PM
monkey

OK, couldn't remember either where we are at
15:40 PM
BrainzGit

[musicbrainz-server] 14kawanoseiya opened pull request #2344 (03master…patch-1): Update INSTALL.md https://github.com/metabrainz/musicbrainz-serve...
15:43 PM
reosarevok

Is there another "send random PRs and win prizes" competition? ^
16:00 PM
lucifer

meeting time! :D
16:00 PM
zas

hey
16:00 PM
lucifer

hi!
16:00 PM
ruaok waves
16:01 PM
zas

ok, should I start? alastairp ?
16:02 PM
alastairp

hi
16:02 PM
zas

so, I did a list about what's working for us regarding gateways, and what's not
16:02 PM
first what's working:
16:03 PM
autossl (auto-generation of letsencrypt certs)
16:03 PM
ruaok

zas: you are leading this meeting, yes?
16:03 PM
zas

well, I prefer to start, not really "leading" ;)
16:04 PM
autoconfiguration of frontends & backends from consul (docker-server-configs + gitzconsul + serviceregistrator + consul + consul-template)
16:04 PM
load balancing with weights
16:05 PM
lucifer has quit
16:05 PM
redundancy: being able to switch between gateways, even though it's manual, that's something we want to keep somehow for maintenance tasks
16:05 PM
monkey has quit
16:06 PM
what doesn't work well:
16:06 PM
alastairp has quit
16:06 PM
ssl puts a serious pressure on cpu, limiting number of requests we can handle (aka the CAA redirect mess)
16:06 PM
lucifer joined the channel
16:06 PM
alastairp joined the channel
16:07 PM
alastairp

sorry, irccloud is sad. here now
16:07 PM
zas

too much "pre-processing" on gateways: some stuff we do there should be done on backends instead, but that's like this for multiple reasons
16:07 PM
monkey joined the channel
16:07 PM
lucifer

i missed some messages too. looking at chatlogs now
16:07 PM
alastairp_ joined the channel
16:07 PM
zas

grrrr
16:08 PM
monkey

Bad timing for a split
16:08 PM
zas

yep :(
16:08 PM
lack of redundancy: no auto switch in case of failure
16:08 PM
lack of horizontal scalability
16:09 PM
only level 7 load balancing
16:09 PM
so, I started to think about a redesign
16:10 PM
I did a lot of research, about tools we could use, that are open source, and free (as in beer)
16:10 PM
alastairp

currently requests for all MeB services go to 1 gateway? and it's routed by nginx/openresty?
16:10 PM
ruaok

I never got a response from the haproxy people about non profit licensing.
16:10 PM
zas

yes
16:11 PM
alastairp: in fact, everything is going through gateways, but few services
16:12 PM
btw, I think it would be preferable to not have "holes" (aka services not going through gateways), it makes few things harder to control (like IP blocking etc)
16:12 PM
so, to scale, we have to upgrade the hardware, that's not great
16:13 PM
alastairp

so the goal is to get traffic off of the gateways sooner? (not sure if we're talking about goals yet)
16:14 PM
zas

nope, the goal is to have better gateways, we still have to load balance over backends, do caching, do filtering , handle ssl, etc
16:14 PM
but we need a solution that is more robust & scalable
16:15 PM
any questions til now?
16:15 PM
ok, I continue: hetzner failover IPs
16:16 PM
those are great, basically we can switch one or more IPs between machines (kiki & herb in this case)
16:16 PM
ruaok

with the caveat that the automatic detecting that something needs failing over is VERY SLOW.
16:17 PM
zas

but this is a slow process, because we have to use hetzner API, which is slow to answer
16:17 PM
lucifer

how slow? 1m, 5m, 10m?
16:17 PM
kepstin

(still faster than updating dns tho, probably)
16:17 PM
ruaok

but if we detect the failure ourselves and tell hetzer to switch over it is fast, yes?
16:17 PM
zas

the switch itself takes 1 second or less, but the API takes 1min or more
16:17 PM
alastairp

is hetzner API the only way of doing the failover? (I ask because I have done the same with keepalived, and you just bring up the IP on the host that needs it)
16:18 PM
zas

it is never fast: detect the failure at T, connect to API at T+1 second, wait til T+60s, switch at T+61 seconds
16:18 PM
ruaok

alastairp: yes, it must be done via hetzner otherwise their routers can't route traffice.
16:18 PM
alastairp

right
16:18 PM
ruaok

oh, it is NEVER fast?
16:18 PM
that's pretty crap.
16:19 PM
zas

yes :( it was a bit faster in the past, but now never under a minute
16:19 PM
ruaok

oy
16:19 PM
alastairp

devil's advocate: 60 seconds may be faster than us getting a telegram, working out what's wrong, sshing in to gateway and triggering swtichover
16:19 PM
lucifer

how do we do the failover currently?
16:19 PM
zas

alastairp: yes
16:19 PM
lucifer: manually
16:20 PM
lucifer

right, i mean what's the process once we decide to do it? still need to go through the api?
16:20 PM
zas

why? because I never manage to have a stable setup with failures detection
16:20 PM
we run a script, which asks API to switch IP
16:20 PM
alastairp

what is the usecase for switching over? We currently do it when zas upgrades gateways. We did it a few weeks ago when the server was overloaded. what other cases are there? server disappears unexpectedly?
16:21 PM
kepstin

unless you can convince them to reconfigure their switches to allow using something like ARP triggered from your machines to do the failover, there probably isn't a faster option.
16:21 PM
zas

kepstin: we can't convince them to do that, this is why we have to use the API
16:22 PM
lucifer

so the only difference in the automatic failover and the script we have now is the time to detect the failure after that both processes are same, right?
16:22 PM
zas

aka we can't keepalived those IPs as we would on our own network
16:22 PM
alastairp

lucifer: and as zas points out, getting a stable setup that only switches over on real errors
16:23 PM
lucifer

i see, makes sense
16:23 PM
ruaok

https://docs.hetzner.com/robot/dedicated-server...
16:24 PM
kepstin

interesting, so it probably involves a routing reconfiguration, since they support this across the datacenter, not just for machines on the same switch. would explain why it's slow to set up :/
16:24 PM
zas

kepstin: yes
16:25 PM
reosarevok has quit
16:25 PM
lucifer

uh irccloud acting up again
16:25 PM
ruaok

> Switching a failover IP/subnet takes between 40 and 60 seconds.
16:26 PM
lucifer has quit
16:26 PM
zas

yes, in practice, 60 seconds rather than 40s
16:26 PM
lucifer joined the channel
16:27 PM
but, it is important, the actual switch is fast: < 1 second, we lose connections though when it happens