maybe the storage box was mounted as the incorrect user?
2021-05-19 13952, 2021
_lucifer
as it shows the UID and GID as 0 for the directory, but we specifically create a lbdumps user with 900 UID and GID to create the dump files.
2021-05-19 13932, 2021
_lucifer
alastairp, we are already adding sentry in the startup improvement PR. How about we add sentry to check cron status as well? maybe tee the output to it and check exit status or check the log files periodically for errors?
2021-05-19 13928, 2021
_lucifer
or we could make it a part of the dump script
2021-05-19 13913, 2021
_lucifer
Mr_Monkey: should the ReportUser button be a separate component or should i just put in UserPageHeading component?
2021-05-19 13932, 2021
rdswift has quit
2021-05-19 13920, 2021
rdswift joined the channel
2021-05-19 13931, 2021
ruaok
mooin!
2021-05-19 13941, 2021
_lucifer
morning!
2021-05-19 13957, 2021
ruaok
_lucifer: I guess it didn't get remounted correctly after reboot. when I mounted it by hand I made sure to use the right user.
2021-05-19 13907, 2021
ruaok
gid 0 is deffo wrong.
2021-05-19 13955, 2021
_lucifer
yeah, thought so. nice that we'll be able to get rid of the box after the upgrade.
2021-05-19 13912, 2021
ruaok
my thoughts exactly. we dont need another vector for failures
2021-05-19 13947, 2021
ruaok
the storage box failed to mount entirely, that is the problem.
should we trigger an incremental dump manually to verify?
2021-05-19 13942, 2021
ruaok
yes, but...
2021-05-19 13953, 2021
ruaok
lets check what incremental dump version we're on.
2021-05-19 13957, 2021
alastairp
hi _lucifer
2021-05-19 13900, 2021
alastairp
(and everyone else)
2021-05-19 13901, 2021
_lucifer
hi!
2021-05-19 13903, 2021
alastairp
great sleuth work _lucifer
2021-05-19 13944, 2021
ruaok
if we skipped one (or more) then we ought to roll back the dump ID. otherwise the incremental won't import and we'd need to do another full dump and wait another 24 hours.
2021-05-19 13927, 2021
ruaok
full and incremental dump 440 are on the ftp site.
2021-05-19 13937, 2021
alastairp
_lucifer: instead of cron, we could probably get away with using metrics
2021-05-19 13941, 2021
ruaok
if next dump ID is 441, then let 'er rip
2021-05-19 13943, 2021
alastairp
uh, instead of sentry
2021-05-19 13958, 2021
_lucifer
ah right! that works too :D
2021-05-19 13917, 2021
ruaok
> 440 | 2021-05-16 11:41:50+00
2021-05-19 13928, 2021
ruaok
is the last dump in the dump_table. good to go to create an incremental, _lucifer
2021-05-19 13955, 2021
ruaok
if it fails, but the data is generated and the dump id is increased, we have to copy the data by hand, ok?
2021-05-19 13937, 2021
_lucifer
oh! didn't know that.
2021-05-19 13933, 2021
_lucifer
i'll trigger the dump. let's see how it goes.
2021-05-19 13923, 2021
alastairp
I have a load of washing on, I'll make my way to the office when it's finished
2021-05-19 13918, 2021
ruaok is waiting for cocktail robot parts from CA. will come in after they arrive
2021-05-19 13934, 2021
_lucifer
seems it failed again
2021-05-19 13947, 2021
_lucifer
mktemp: failed to create directory via template ‘/mnt/dumps/tmp/archives/incremental.XXXXXXXXXX’: No such file or directory
2021-05-19 13917, 2021
ruaok
ah yes, I have a bug open for the fact that it doesn't create the needed subdirs.
alastairp: _lucifer: ^^ the changed code fragments were tested in an out-of-body script, since it is hard hard to test in-script. but that ought to work.
2021-05-19 13925, 2021
Mr_Monkey
_lucifer: > should the ReportUser button be a separate component
2021-05-19 13925, 2021
Mr_Monkey
I'd say so, yes. That will save some time if (when?) we refactor that code
2021-05-19 13944, 2021
_lucifer
👍
2021-05-19 13909, 2021
BenOckmore joined the channel
2021-05-19 13910, 2021
ruaok
_lucifer: with the recent BU changes what happens when a redis key is set with no expiry time?
2021-05-19 13947, 2021
ruaok
I didn't set one and my metric values are written to redis, but they disappear immediately. I can't find any running metric-writers and I've stopped the one that is supposed to be running.
2021-05-19 13949, 2021
_lucifer
ruaok, you always have to set an expiry time.
2021-05-19 13909, 2021
ruaok
why did it used to work??
2021-05-19 13921, 2021
ruaok
`cache._r.rpush(REDIS_METRICS_KEY, metric)`
2021-05-19 13923, 2021
_lucifer
the `set` used to have a default for time to be 0. we removed the default, now the user has to pass a time explicitly. you can pass 0 if you want no expiry.
2021-05-19 13954, 2021
ruaok
I'm not using set. I'm using rpush directly.
2021-05-19 13911, 2021
_lucifer
ah you are using `rpush` so the redis library directly. the BU changes shouldn't affect you then