in #metabrainz

10:21 AM
lucifer

`listenbrainz-timescale-pg13-data` has been in use since.
10:21 AM
mayhem[m]

is that mounted anywhere? if not, nuke it.
10:21 AM
lucifer

i can delete the volume to free up 700g
10:22 AM
mayhem[m]

how about typesense-data and typesense-new-data ?
10:22 AM
that sounds like there is something duplicated.
10:23 AM
lucifer

yes that can also be removed
10:24 AM
`/dev/md2 3.6T 2.1T 1.4T 61% /`
10:24 AM
:D
10:24 AM
mayhem[m]

I'll ask hetzner if we can get a AC102 with only two 7.68TB drives.
10:25 AM
lucifer

sounds great but might be an overkill for now. (7.68 drives seem to be expensive and we won't hit the limit for quite a while)
10:27 AM
mayhem[m]

ok, then lets leave this issue be for a little while. I don't feel that there are good options for what we currently need.
10:27 AM
lucifer

sure sounds good.
10:28 AM
mayhem[m]

but lets keep our eyes peeled on this.
10:28 AM
lucifer

yup makes sense, thanks!
10:29 AM
to be clear, so also postponing the LB backup server thing for now?
10:29 AM
mayhem[m]

it doesn't quite feel that the options are right for us now. I understand the need, but the path looks kinda meh.
10:30 AM
for all the reasons mentioned above.
10:30 AM
lucifer

yup makes sense
10:30 AM
mayhem[m]

we can't easily get another gaga class machine without it costing close to 300/month.
10:30 AM
lucifer

true
10:31 AM
i guess buddy has enough backup so that we can setup barman at least?
10:31 AM
rimskii[m]

<lucifer> "try this instead" <- still not working :(
10:31 AM
lucifer

what error do you get rimskii[m] ?
10:32 AM
rimskii[m]

the same
10:32 AM
onnection to server at "127.0.0.1", port 5433 failed: Connection refused
10:32 AM
Is the server running on that host and accepting TCP/IP connections?
10:32 AM
lucifer

it will be different somehow because i changed the ports, share the new error.
10:32 AM
mayhem[m]

buddy: /dev/md1 7.3T 959G 6.3T 14% /data2
10:32 AM
lucifer

great,
10:32 AM
mayhem[m]

looks that way.
10:32 AM
lucifer

rimskii[m]: can you share the ssh command output?
10:33 AM
to check if the port forwarding worked.
10:34 AM
rimskii[m] uploaded an image: (1643KiB) < https://matrix.chatbrainz.org/_matrix/media/v3/download/matrix.org/LVXQvXnSMgENHyfmcCtrEcgV/Screenshot%202024-06-19%20at%2015.34.13.png >
10:34 AM
rimskii[m] uploaded an image: (1766KiB) < https://matrix.chatbrainz.org/_matrix/media/v3/download/matrix.org/RBFOPVUeACRtVFhDUAJDSfiB/Screenshot%202024-06-19%20at%2015.33.37.png >
10:35 AM
rimskii[m]

mybe I shoud have run first "ssh rimskii@wolf.metabrainz.org"; then "ssh -L 5432:localhost:5432 rimskii@wolf.metabrainz.org"?
10:38 AM
lucifer

rimskii[m]: the command is outdated.
10:38 AM
you have to do `ssh -L 5433:localhost:5432 rimskii@wolf.metabrainz.org`
10:38 AM
note the 5433:
10:40 AM
monkey[m]

That was the command rimskii ran, from what I can tell from the logs.
10:41 AM
rimskii[m]

lucifer: yep i tried it too
10:41 AM
lucifer

oh i see, i didn't look at the logs but just the command.
10:41 AM
hmm, that should work.
10:41 AM
according to logs, the port forwarding worked.
10:41 AM
monkey[m]

Could it be something regarding "127.0.0.1:5432" vs. "localhost:5432" ?
10:42 AM
I mean, I assume not, but ...
10:42 AM
lucifer

shuoldn't be. but its a mac
10:45 AM
monkey[m]

I tried to connect in the same way, works for me (on linux)
10:45 AM
One thing I had to adjust to be able to connect to the DB was username and password
10:45 AM
lucifer

the latest error points to a port forwarding error
10:48 AM
atj[m]

you can see from the logs that it is listening on 127.0.0.1 and ::1
10:48 AM
so localhost vs. 127.0.0.1 is irrelevant
10:50 AM
I will test on my MacBook but never had any issues on Linux vs. MacOS in this area
10:51 AM
monkey[m]

Same here, used to do port forwarding fine on macos
10:53 AM
Maybe something related to the macos firewall?
10:55 AM
mayhem[m]

or a university/home router configuration?
10:56 AM
atj[m]

not sure how that could make any difference, it's a local port forward
10:56 AM
so the router isn't involved
10:57 AM
rimskii: what terminal are you using?
10:59 AM
monkey[m]

Sorry if I'm way out, but where is the connection error coming from? Inside Docker? if so, does the local port 5433 need to be added to the docker container ports?
10:59 AM
rimskii[m]

atj[m]: the default one? mac?
10:59 AM
lucifer

mayhem[m]: atj[m]: i just hopped on a call with rimskii port forwarding is working. but the service that needs to access the port is inside docker.
10:59 AM
monkey[m]: yeah that
11:00 AM
monkey[m]

Might be good to try connecting with psql to remove any potential docker-related issue
11:00 AM
atj[m]

rimskii[m]: you mean terminal
11:00 AM
mayhem[m]

ah
11:00 AM
BrainzGit

[mb-solr] 14yvanzo opened pull request #58 (03master…solr-9.6.1): Upgrade Solr version to 9.6.1 and other dependencies https://github.com/metabrainz/mb-solr/pull/58
11:00 AM
rimskii[m]

atj[m]: yeah
11:00 AM
atj[m]

iTerm is way better FWIW :)
11:00 AM
monkey[m]

psql --U musicbrainz --host localhost --port 5433
11:00 AM
atj[m]

lucifer: ah, so that's an extra layer of fun
11:01 AM
monkey[m]

s//`/, s/--/-/, s//`/
11:01 AM
atj[m]: I second that
11:01 AM
rimskii[m]

atj[m]: oh i will try it then
11:02 AM
atj[m]

not really relevant at this point but just a suggestion
11:02 AM
monkey[m] backs out of everyone's business
11:02 AM
lucifer

rimskii[m]: can you replace `localhost` in MB_DATABASE_URI with `host.docker.internal` ?
11:03 AM
mayhem[m]

wow, that's shitty hetzner: "It´s possible to remove the default drives but there will be no price reduction for it."
11:04 AM
rimskii[m]

lucifer: omg it works!
11:05 AM
now i got a new error haha
11:05 AM
rimskii[m] uploaded an image: (145KiB) < https://matrix.chatbrainz.org/_matrix/media/v3/download/matrix.org/runZuVjtOYXpNkFuknRoSuNM/Screenshot%202024-06-19%20at%2016.04.39.png >
11:05 AM
mayhem[m]

one step forward....
11:05 AM
lucifer

rimskii[m]: i see, we don't have the mapping tables on wolf but i can run a script to create them there.
11:05 AM
will take a few hours for it complete.
11:05 AM
yvanzo[m]

atj: Is a new snapshot needed for upgrading Solr version in the cluster? Or should we test it separately?
11:06 AM
rimskii[m]

lucifer: mybe i can do that
11:06 AM
atj[m]

yvanzo[m]: I don't think so, if it works in 9.6.0 it will work in 9.6.1
11:08 AM
If we do it separately I can write docs on that too :)
11:10 AM
lucifer

rimskii[m]: sure. so there are two ways to do it, 1 is running the command locally with the databases connected over ssh but that means you need a stable network connection for a few hours at least.
11:10 AM
yvanzo[m]

OK, let’s try upgrading Solr 9.6.1 at first then.
11:10 AM
lucifer

alternative is cloning the listenbrainz-server on wolf. and running the commands from there.
11:10 AM
yvanzo[m]

I’m building the snapshot in parallel.
11:10 AM
lucifer

i think this way will work better for you
11:11 AM
mayhem[m] fires off another salvo of chatgpt drivel to hetzner.
11:11 AM
rimskii[m]

lucifer: ok
11:16 AM
atj[m]

yvanzo: just FYI, IME Solr can sometimes refuse to stop in an orderly fashion and the stop script waits 180 seconds before killing it
11:16 AM
so if you see the restart task hanging, don't worry it's normal (ish)
11:16 AM
just wait
11:24 AM
yvanzo[m]

atj: I pushed a commit to the `solr` branch and update the node 8 as suggested.
11:26 AM
lucifer

rimskii[m]: let me know when have cloned the repo and i can help update the script and let you know the commands to build the cache.
11:26 AM
atj[m]

yvanzo: LGTM, want to upgrade the entire cluster?
11:27 AM
yvanzo[m] uploaded an image: (83KiB) < https://matrix.chatbrainz.org/_matrix/media/v3/download/chatbrainz.org/WGSgqrySvDbPSeUWVXmkoNhQ/errors-when-updating-solr.png >
11:27 AM
yvanzo[m]

Those are transient errors that occurred when updating the node 8.
11:27 AM
rimskii[m]

lucifer: okay
11:27 AM
i have to build the docket too right?
11:27 AM
atj[m]

yvanzo[m]: that's normal AFAIU
11:28 AM
lucifer

rimskii[m]: yes but different containers.
11:28 AM
yvanzo[m]

atj: Should we pause between updating each node to preserve service availability?
11:30 AM
-f1 is certainly useful
11:32 AM
rimskii[m]

lucifer: which containers?
11:33 AM
cloned the repo & updated config
11:33 AM
yvanzo[m]

atj: Ideally we should have a `wait_for` the node is available again before moving to another task.
11:33 AM
atj: That would be for production though, I’ll proceed as documented for now.
11:34 AM
atj[m]

yvanzo: https://github.com/metabrainz/ansible-role-solr...
11:34 AM
yvanzo: do you want to wait a second and we can try using the serial strategy: https://docs.ansible.com/ansible/latest/playboo...
11:35 AM
yvanzo[m]

OK.
11:35 AM
atj[m]

it should only need a minor update to the playbook
11:39 AM
yvanzo: can I squash your commit into mine?
11:39 AM
I need to force push to the solr branch really otherwise it gets messy and I just want to merge one big commit
11:43 AM
yvanzo[m]

atj: No problem.
11:44 AM
lucifer

rimskii[m]: `cd mbid_mapping` then `cp config.py.sample config.py`.
11:44 AM
edit `config.py`, set `MBID_MAPPING_DATABASE_URI` and `MB_DATABASE_MASTER_URI` to `dbname=musicbrainz_db user=musicbrainz host=db port=5432 password=musicbrainz`
11:48 AM
kellnerd[m]

monkey: My MetaBrainz Tabler icon [pull request](https://github.com/tabler/tabler-icons/pull/1171) is up, FYI. Let's see if it is really that simple to get an icon added.
11:51 AM
atj[m]

yvanzo: i've added support for setting the batch size using `serial` and updated the documentation: https://github.com/metabrainz/metabrainz-ansibl...
11:51 AM
https://github.com/metabrainz/metabrainz-ansibl...
11:52 AM
this is better than -f1 because it means the entire play will be run on each node serially rather than each task
11:53 AM
so the "restart" and the "wait for start" handlers will run before moving on to the next host
11:53 AM
kellnerd[m]

<kellnerd[m]> "monkey, mayhem: Since my GSoC..." <- ^ Reminder just in case you have missed my message mayhem, last year an org admin was required to do that change on the SoC page IIRC (CC monkey).
11:56 AM
rimskii[m]

<lucifer> "edit `config.py`, set `MBID_MAPP..." <- done !
11:57 AM
lucifer

rimskii[m]: run `./build.sh`
11:58 AM
yvanzo[m]

atj: Now testing!
11:59 AM
rimskii[m]

lucifer: done !
12:00 PM
now "python mapper/manage.py canonical-data"?
12:00 PM
lucifer

run `tmux`
12:01 PM
then `docker run --rm -it --network musicbrainz-docker_default metabrainz/mbid-mapping python3 manage.py create-all`
12:03 PM
rimskii[m] uploaded an image: (515KiB) < https://matrix.chatbrainz.org/_matrix/media/v3/download/matrix.org/qygdBSoPyHjbKNBmzZarLYMH/Screenshot%202024-06-19%20at%2017.03.04.png >
12:04 PM
rimskii[m]

lucifer: should I change sqlaclhemy one too?
12:08 PM
lucifer

rimskii[m]: hmm i see, okay try this instead. `docker run --rm -it --network musicbrainz-docker_default metabrainz/mbid-mapping python3 manage.py canonical-data --use-mb-conn`
12:11 PM
yvanzo[m]

atj: Oops, I cannot edit solr.yml at the same time.
12:12 PM
(?)
12:12 PM
atj[m]

what do you mean?
12:13 PM
yvanzo[m]

I was editing solr.yml in the meantime and realized that ansible-playbook was still running, so interrupted it, restored the file, and started it again.
12:14 PM
Not sure if it is watching changes to solr.yml though.
12:14 PM
atj[m]

oh right, yes it's read when the process starts
12:14 PM
iconoclasthero has quit
12:16 PM
yvanzo[m]

It is actually downloading the JAR again when updating Solr.
12:17 PM
rimskii[m]

<lucifer> "rimskii: hmm i see, okay try..." <- done ! it works
12:18 PM
lucifer

rimskii[m]: did it finish already?
12:18 PM
yvanzo[m]

atj: Would there be a way to prevent routing requests to a node during the update?
12:18 PM
rimskii[m] uploaded an image: (1887KiB) < https://matrix.chatbrainz.org/_matrix/media/v3/download/matrix.org/UcpsbUTxTdaTvElKqSBvakNw/Screenshot%202024-06-19%20at%2017.18.40.png >
12:18 PM
rimskii[m]

lucifer: well seems so