in #metabrainz

11:27 AM
Freso

zas: Whenever you have a moment: https://community.metabrainz.org/t/discourse-do... :)
11:29 AM
lucifer

https://www.irccloud.com/pastebin/CINnmeJY/
11:29 AM
but otherwise also lot of differences in conf.
11:30 AM
some of those are commented so don't matter but i guess best to copy over the .conf file from old to new cluster?
11:32 AM
yeah this also does it https://github.com/metabrainz/docker-postgres-c...
11:32 AM
alastairp

lucifer: yeah, I'm doing this process as well, and it looks like I'm at about the same stage as you. let me look at this diff. I almost agree that we should just copy it, though we should also verify that there are no new options that arrived in 11 or 12
11:33 AM
cool, I also see the update_extensions.sql message. this just has `ALTER EXTENSION "timescaledb" UPDATE;` in it, which is great - because that's what we need to do anyway, right?
11:33 AM
lucifer

yes
11:34 AM
alastairp

and I see the message about re-running vaccum/analyze, so that seems good
11:34 AM
were you able to start up the new cluster?
11:35 AM
lucifer

not yet. trying to do that currently.
11:36 AM
alastairp

https://www.irccloud.com/pastebin/bAjZmq5M/
11:38 AM
skelly37 joined the channel
11:40 AM
skelly37 has quit
11:40 AM
skelly37 joined the channel
11:42 AM
lucifer

alastairp, my container has closed connection so i had to start another timescale one. yup works for me as well.
11:42 AM
i ran alter extension command but got error that it already exists and which probably makes sense.
11:42 AM
running vacuum, errors huh.
11:42 AM
ERROR: could not resize shared memory segment "/PostgreSQL.175590062" to 67128672 bytes: No space left on device
11:43 AM
alastairp

well, that error is understandable :)
11:43 AM
ah, but I see that bono has plenty of space
11:43 AM
are you using the pre-generated vaccum script?
11:43 AM
lucifer

/dev/md2 4.0T 1.8T 2.0T 49% /
11:44 AM
yeah. no not pre-generated script.
11:44 AM
just vacuum in session while connected to db
11:44 AM
it worked when i ran as postgres user.
11:55 AM
Lorenzo[m] joined the channel
12:00 PM
Lorenzo[m]

Hi folks, in the last few days I experienced some issues related to scrobbling on LB. What I send to LB is simply not listed on my profile (I'm pretty sure it's not an error on my side)
12:01 PM
I've checked the Bug Tracker and there is noting related to this issue (at least not in the last few weeks)
12:01 PM
Is it a known problem or should I open a ticket?
12:06 PM
alastairp

Lorenzo[m]: oh hi!
12:07 PM
Lorenzo[m]: we did in fact have another report from another person here, and today we're going to do an upgrade of our database to help us fix this issue, so fingers crossed this will fix your problem too
12:09 PM
Lorenzo[m]

Oh nice, I'll try to scrobble some music tomorow and I'll check if everything is fixed
12:10 PM
Thank you for your time folks, I really appreciate the project and your efforts
12:10 PM
atj

Lorenzo[m]: <3
12:12 PM
alastairp

lucifer: oh, I just had a thought. if we don't use --link, then pg_upgrade is going to copy all data files anyway. it's going to take longer (need to copy all files), but it's not going to touch the old ones, so maybe we can get by without doing a backup?
12:13 PM
alternatively, we make a network backup to another server, we could start with rsync now, and then re-run it to catch up modified files once we take the cluster down
12:14 PM
lucifer

yes right.
12:14 PM
skelly37 has quit
12:14 PM
i think MB did rsync last time.
12:14 PM
skelly37 joined the channel
12:15 PM
https://github.com/metabrainz/docker-postgres-c...
12:17 PM
alastairp

right, but is that just copying the data directory to the replica for quicker startup?
12:18 PM
800gb is going to take 2 hours to copy somewhere else over gigabit at least
12:18 PM
lucifer

oh ok, makes sense.
12:21 PM
alastairp: i am unsure which is better. your call.
12:22 PM
alastairp

zas: do we have a server with 900gb free space that we can rsync to?
12:23 PM
v6lur joined the channel
12:25 PM
atj

postgres data file compress really well btw
12:25 PM
*data files
12:26 PM
alastairp

atj: ah, interesting, might try that
12:26 PM
atj

although it depends on the contents
12:27 PM
alastairp

atj: let me get you up to date - we're doing a pg 11 to 13 upgrade. because we run pg in a docker container, we have the data in a volume. however, if we mount 2 volumes (1 for 11-data and 1 for 13-data) we can't hardlink between them with pg_upgrade, because they're different logical disks :(
12:27 PM
atj

makes sense
12:28 PM
zas

alastairp: kiss has 1.07Tb free on /dev/md2
12:28 PM
alastairp

so now we're wondering about doing the copy version of the upgrade, but we'd like enough disk space (on gaga) to have 1) the v11 data, 2) a backup of it in case anything goes wrong, 3) the v13 data - but gaga doesnt have enough disk for this (db is about 770gb)
12:28 PM
zas: yes, I was just looking through servers and found that, thanks
12:29 PM
atj: so - if you have any postgres upgrade and/or docker/volume/hardlink experience, it'd be interesting to hear your thoughts
12:29 PM
lucifer: did you try the upgrade with --link with both data directories in the same volume?
12:30 PM
atj

well, hardlinks definitely aren't going to work unless you're in the same volume
12:30 PM
lucifer

alastairp: no haven't done yet.
12:31 PM
Freso

reosarevok: Not sure if you missed this: https://community.metabrainz.org/t/announcement...
12:31 PM
alastairp

atj: I had hoped that because these volumes were on the same partition on the host, it'd just work :)
12:32 PM
reosarevok

Freso: iirc I transcluded but I guess I didn't answer?
12:32 PM
atj

alastairp: have you tried creating manual hardlinks?
12:33 PM
it would work from the host, but I don't think the container can see that they're the same filesystem
12:33 PM
alastairp

lucifer: oh, one other thing - we should decide if we want to keep running on pg/timescale on debian, and if so decide which base image, how to install it, and how to start it (because the debian version splits the config/data directories, but our alpine data dir has the combined directory)
12:34 PM
zas

atj: should we disable IPv6 on shorewall for now?
12:34 PM
alastairp

atj: on the host or in the container? this is done by pg_upgrade (in the container), so I don't think I can do it myself on the host and have it work
12:34 PM
atj

zas: I think so yes :/
12:35 PM
the role is nearly done, just working on it now
12:36 PM
alastairp

lucifer: actually, https://github.com/metabrainz/docker-postgres-c... is basically good for us, I think it'd be a good idea to base it off of this
12:37 PM
atj

alastairp: does the container have a shell? can you exec in and touch a file on one volume then try to hardlink it to the other volume?
12:38 PM
eg. docker exec -ti <container> bash
12:38 PM
alastairp

https://www.irccloud.com/pastebin/1fMNYoYE/
12:39 PM
atj

worth a try but as expected
12:39 PM
alastairp

https://bbs.archlinux.org/viewtopic.php?id=241866 seems to indicate that overlay2 causes it, and there are workarounds like using devicemapper as the storage driver
12:39 PM
but not something I want to get into now
12:40 PM
atj

my suggestion would be to create a compressed backup using tar and send it to another machine via ssh
12:41 PM
best to have a backup regardless
12:41 PM
alastairp

any suggestions based on time tradeoff between just copying 700gb, or waiting for it to compress first and then copying it?
12:41 PM
atj

compression will be worth it IMO
12:41 PM
alastairp

bzip or gzip?
12:41 PM
atj

I'd recommend using lzip for better speed
12:41 PM
alastairp

or lzip!
12:42 PM
atj

needs installing
12:42 PM
alastairp

installed on gaga
12:42 PM
what's the tar flag for lzip?
12:42 PM
atj

alastairp: also, use pv so you can see the speed of the backup
12:43 PM
--lzip
12:43 PM
alastairp

tar | pv | lzip, to be able to measure against the real size of the data directory?
12:43 PM
flags to tar to preserve users and permissions?
12:44 PM
(or is that an extraction parameter)
12:45 PM
atj

tar --lzip -cf - /path |pv |ssh foo "cat > file.tar.lzip"
12:46 PM
alastairp

thanks!
12:46 PM
atj

I'd run the command from the root of the volume you are backing up
12:46 PM
alastairp

ah, so you directly stream the compressed file, rather than write to file then copy
12:46 PM
atj

just so the archive has relative paths - easier to unpack
12:46 PM
exactly
12:47 PM
alastairp

but pv is going to be reporting the compressed size, right?
12:47 PM
so we'll expect it to be smaller than the size of the data dir, but we don't know how much smaller
12:47 PM
atj

you can't know until you do it unfortunately
12:47 PM
you could try compressing a few files to see
12:48 PM
alastairp

yeah, which is why I was thinking of tar -cf - /path | pv | lzip | ssh
12:48 PM
atj

that should work
12:48 PM
you just won't know the transfer speed over ssh
12:48 PM
alastairp

let me try it on another machine I have
12:49 PM
yeah, hopefully close to gige less overhead, but unclear
12:49 PM
yvanzo

reosarevok: Has the old docker volume staticbrainz-data been replaced with musicbrainz-static-build-prod and musicbrainz-static-build-beta?
12:51 PM
atj

alastairp: you might want to try lzma vs. lzip, I always get confused between these various lz compression algos
12:54 PM
v6lur has quit
12:55 PM
v6lur joined the channel
12:58 PM
alastairp

atj: hmm, adding lzma in the middle makes it 100x _slower_
12:59 PM
atj

CPU limited?
12:59 PM
alastairp

lzma: [1.27MiB/s], no compression: [ 110MiB/s], gzip: [21.5MiB/s]
12:59 PM
checking now
13:00 PM
yes, 100% cpu
13:01 PM
atj

lzip?
13:01 PM
alastairp

other option is tar -> file, pbzip2, copy file, but there's no parallelism there
13:02 PM
atj

zstd (yes another algo) supports multiple threads
13:02 PM
alastairp

even with input from stdin?
13:02 PM
atj

this is what I'm wondering
13:03 PM
I think these more advanced algos have large window sizes which requires more buffering in RAM
13:04 PM
alastairp

lzip -0 says that it's about as fast as gzip, and I'm seeing speeds about the same, but it's still CPU-bound rather than network-bound
13:04 PM
atj

not sure how much difference letting tar do the compression would make
13:04 PM
alastairp

2 minutes to copy a 2gb file, compared to 20sec using just rsync
13:05 PM
does hetzner offer 10ge yet? :)
13:05 PM
trying that now
13:08 PM
no, that's just as slow
13:08 PM
atj

alastairp: on my system "tar --zstd -cf -"
13:08 PM
1.46GiB 0:00:17 [83.3MiB/s]
13:09 PM
lzma and lzip were very slow
13:09 PM
alastairp

yes, zstd is 3x faster than lzip or gzip immediately
13:11 PM
no compression: [ 110MiB/s], pv|zstd: [ 123MiB/s]
13:11 PM
so yes, zstd is giving a slight advantage
13:11 PM
atj

I just compressed a 4.7GB VDI to 1.9GB at pretty much line speed
13:13 PM
looks like that's your best bet, whether the compression is worth it I don't know.
13:13 PM
lucifer

alastairp: the MB docker setup is definitely useful for us to base on but not sure what all we need to test and prepare for that move so probably best to not do it now?
13:14 PM
alastairp

oh yeah - I'm just doing a test with the pg dir now, it's hovering around 300MiB/sec with zstd (measuring with pv before going into zstd), so we're about 3x faster than just a regular copy over the same network link
13:14 PM
thanks atj!
13:14 PM
atj

glad it worked :)
13:15 PM
alastairp

still 40 mins to do the backup
13:15 PM
atj

make sure to check the tar archive once it's completed - "tar --zstd -tf filename.tar.zstd"
13:16 PM
skelly37 has quit
13:16 PM
skelly37 joined the channel
13:18 PM
actually, tar does recognise the file extension so you can do "tar -vtf <filename>.tar.zstd"
13:20 PM
alastairp

lucifer: yeah, that's a good point. so maybe we drop the indexes in current image, bring it up in the migrate image, do the upgrade, bring it up in the new timescale image, recreate indexes ?
13:20 PM
lucifer: I had a look at the config file diff, there are some things I don't understand, but I think it'll be OK to copy directly in place
13:22 PM
lucifer

alastairp: are we going to do with --link or without? for without, we could keep the indexes and drop/recreate in the new image in case we need to bring up old cluster back up.
13:22 PM
alastairp

lucifer: I know you have a few items on the migration doc for preparing listenstore downtime, do you need time for that?