Which data is stored, in example in server logs? IP addresses?
Anything else I need to mention in the app? Something like "this is not a part of MetaBrainz"?
prout has quit
prout joined the channel
yvanzo
O’Moin
Hi relaxoMob: Good questions. As a developer, make sure to follow https://musicbrainz.org/doc/MusicBrainz_API and more particularly “What can I do with”, “Who can use”, “Application rate limiting and identification”
Our privacy policy should be updated to include the API, so yes link to it but no it doesn’t cover it at the moment.
And yes, you can specify that the MusicBrainz API is provided by a third-party, the MetaBrainz Foundation.
bitmap, zas: available now, just reading the back logs.
zas
yvanzo: moooin
BrainzGit
[musicbrainz-server] 14reosarevok opened pull request #3094 (03master…MBS-13368): MBS-13368: Properly compare series relationships if series is target https://github.com/metabrainz/musicbrainz-serve...
zas
yvanzo: what are the exact dependencies for this stuff? I was thinking about quest, but this one isn't on 10.2.2.0/24 (but it is on 10.10.10.0/24) and not sure if we can run everything there, but for sure it has disk space and free to use right now;
also, we have many services only listening on 10.2.2.0/24, it can be an issue if there are to be accessed from 10.10.10.0/24 on machines not having 10.2.2.x IP.
yvanzo
zas: It needs to have access to pg (jimmy/hendrix) and possibly redis (aphex) through consul, and to sshd (aretha) through local config.
s/possibly// (for sure)
(pg through consul too)
zas
well, consul will report IP/port as provided by serviceregistrator, and that's usually 10.2.2.x IP, which is an issue for quest
so we need to find a way to expose needed services on 10.10.10.x too
that's tricky
atj: around?
the experiment I did yesterday shows we can get gateways using backends on 10.10.10.x (because they are parts of this network, and backend service run on it too, plus serviceregistrator provides correct IP/port)
would it be possible to use a mirror db (instead of directly accessing to jimmy/hendrix) ? Why does it need redis (which one?) ?
yvanzo
Those were the dependencies for mb production cron, I will check those for meb.o website too
zas: no, because it is also making changes to the DB.
zas
hmmm, then we need to get db accessible from 10.10.10.x, what needs to be done for it to happen? and it will not suffice... ok. Can't we run this stuff on hendrix and/or jimmy ? they have everything needed
yvanzo
It needs access to musicbrainz-redis-store-4 to expire some sessions, and access to musicbrainz-redis-cache-4 for caching.
hendrix is running the secondary DB iiuc, it could be the best choice.
zas
yes, and it's much simpler to set up
yvanzo
zas: where are the primary/secondary roles defined atm? (somewhere in ansible iirc)
atj
i'll be around in 30 mins or so
why does the dump fail on trille?
zas
disk space
trille doesn't have much of it, it wasn't meant to host huge dumps
Weird Indian pizzas. Lunchtime break at the Indian summit.
yvanzo
monkey: I need my breakfast now!
oops, mayhem ^
weird but at least no pineapple ;)
atj
how much disk space is required?
yvanzo
More than 100G
(disk space currently available on trille)
atj
is that compressed size?
yvanzo
+ the current volumes, let me check
more than 250G, total
it contains a lot of already compressed archives.
(250G is what trille provides for those volumes and free disk space left)
atj
that's not too bad then
yvanzo
(I mean 250G is currently not enough but not too far.)
If we can have 350G, we should be fine until we split those into separate containers that can run on different servers.
mayhem
yvanzo: I was informed that these are not the weird pizzas we wanted. These are standard fare. Super tasty.
monkey
They did look pretty normal
fletchto99 has quit
fletchto99 joined the channel
yvanzo
zas: just noticed that there is currently no temperature data for jimmy and hendrix
(I was trying to compare their respective activity as hendrix is also running the search indexer)
but otherwise they use about the same amount of memory, so hendrix definitely seems to be a good choice, I will prepare PRs
zas
yvanzo: yes, it is because the hardware is very new, and it seems there's no support yet for temp sensors on those. I guess we'll be able to get it work after some time passed.
yvanzo
👍
atj
yvanzo: I will create a dedicated ZFS dataset for the dumps that will need to be bind mounted in the containers
Maxr1998_ joined the channel
Maxr1998 has quit
bitmap
yvanzo: the primary/secondary status is mainly stored in the database cluster (and you can query PG to determine whether a server is primary). ideally it isn't hard-coded anywhere, but there are a couple places still: the barman and repmgr configuration files
the PRIMARY var in docker-postgres-cluster/Makefile actually isn't used, I'll remove it
yvanzo
bitmap: Ok thanks!
bitmap
btw, I nearly have a patch ready that will cause MB to direct a lot more read-only traffic to the standby...but if the current load average on jimmy is any indication, that hopefully shouldn't be any problem for running the dumps and whatnot
yvanzo
bitmap: Is there any reason for defining `RSYNC_FULLEXPORT_PORT` in `admin/config.incremental-json-dump.sh`instead of `script/services.sh`? Is it okay if I change that?
bitmap
yvanzo: yes, I think it makes senes to move them
it makes it easier to change the configuration without restarting the container (if you are setting the env variable on container start)
but restarting a cron container isn't an issue
yvanzo
It’s just that this has to be changed for moving stuff from trille to hendrix
bitmap
right, having it all in one repo makes things easier too.
bitmap: Also why do we have production cron on trille and the sshd for fullexport on aretha, while we have json dumper on aretha and the sshd for it on trille?
bitmap
heh
well, json dumps are on aretha because it had the most space available for it
and ftp is on aretha, which is where the full exports are synced to
relaxoMob has quit
the incremental json dumps are synced back to trille because that's where metabrainz.org is hosted
the full json dumps are sent to ftp
yvanzo
Ok, I didn’t notice that meb.o had a direct dependency on this volume.
I can see it from `services.sh` now.
Thanks!
relaxoMob joined the channel
bitmap
reosarevok: just remembered that for https://github.com/metabrainz/musicbrainz-serve... since prod and beta share a cache, we will have to deploy the {Data,Entity}::LinkAttributeType types to both prod and beta before we can put the rest of the patch on beta
reosarevok
Oh, ok. Do you see any reason why that'd be problematic?
(other than the annoyance of putting them on prod early)
bitmap
nope, I mainly just have to put it in a separate commit/PR that targets the production branch