OK, I'll put it in /pub/musicbrainz/canonical_data, for now I'll make a new key but I'll open an MBH ticket for streamlining this with atj's suggestion of a wrapper script
mayhem
good plan
atj
i can sort the wrapper script out
alastairp
atj: although see also my other comment about the start_sshd_musicbrainz_fullexport function, it's possible that we can use a single key per project just by moving the locations of volumes
mayhem: I was looking at other uses of execute values in LB and they seem to also work with lists of tuples like you originally had, so not sure what's going on there
mayhem: oh, I don't think you showed me that exception
ArjunM joined the channel
lucifer
ok, that won't work i think. you need to use POST. [{"artist_mbid": uuid1, "algorithm": alg}, {"artist_mbid": uuid2, "algorithm": alg}]
mayhem
I swear, I can be on a tear with programming, but when it comes time to pass args to psycopg2, its given I'm going to lose a couple of hours of work due to this fuckery.
alastairp
looks like it's treating the artistid as a single string "mbid,mbid" rather than a list
mayhem
hmm.
lucifer
but it will always use the algorithm from the first item even if you pass different ones in items (improvements pending).
alastairp
or judicious use of `artist_mbids = mbid.split(",")`
mayhem
lucifer: that is a fine solution if you ask me.
alastairp
lucifer: (hi!), are you around for a bit?
mayhem
well, now that you're here, could you please make similar-artists correctly accept [artist_mbid] as an argument?
Currently I've named the dump this ("Metabrainz metadata"), should we change it to "musicbrainz canonical" perhaps?
mayhem
lucifer: I get a 500 error if I pass in a comma seperated list of artist_mids or a list of mbids.
see the last 4 errors in sentry.
alastairp: that suggestion seems good.
lucifer
mayhem: yes, it doesn't work that way currently.
mayhem
I'd like it to work that way, please. :)
lucifer
alastairp's comma suggestion should work then. should be fine as a temp fix.
ArjunM
Hi team. I'm Arjun, a CS undergrad with some experience in JavaScript, Java and SQL. I'm new to this community and I look forward to contribute to MetaBrainz as part of GSoC'23 and otherwise.
akshaaatt: should i clean up anything related to lookup and collections?
agatzk has quit
akshaaatt
Yes jasje_ , please do so
BrainzGit
[listenbrainz-server] 14alastair opened pull request #2348 (03master…canonical-data-dump-csv): Return artist mbids as comma separated instead of postgres array format https://github.com/metabrainz/listenbrainz-serv...
alastairp
lucifer: another quick one which I noticed when using the data
jasje_
can you just gimme what stuff "Keywords" do you remember to clean up
so i can just have an idea and compare with musicbrainz
what stuff to remove what not
lucifer
alastairp: looks good
alastairp
nice
BrainzGit
[listenbrainz-server] 14alastair closed pull request #2348 (03master…canonical-data-dump-csv): Return artist mbids as comma separated instead of postgres array format https://github.com/metabrainz/listenbrainz-serv...
[listenbrainz-server] 14alastair reopened pull request #2348 (03master…canonical-data-dump-csv): Return artist mbids as comma separated instead of postgres array format https://github.com/metabrainz/listenbrainz-serv...
[listenbrainz-server] 14alastair merged pull request #2348 (03master…canonical-data-dump-csv): Return artist mbids as comma separated instead of postgres array format https://github.com/metabrainz/listenbrainz-serv...
alastairp
ups
jasje_
akshaaatt: ill should keep repositories and data classes intact tho right?
agatzk joined the channel
i should**
BrainzGit
[listenbrainz-server] 14alastair merged pull request #2340 (03master…canonical-dump-connection): Add option to select the database connection to dump canonical data from https://github.com/metabrainz/listenbrainz-serv...
alastairp
finally!
lucifer: I'll make a release with these changes if that sounds good
ill keep the repos which i think could be used in future
most of the stuff ill just remove
alastairp
ok, at the moment I'll just use this release for cron and testing the dumps, let me coordinate with lucifer and we can do a .1 release for the website
mayhem
ah, ok.
alastairp
bitmap: (or yvanzo, reosarevok?): is there a way if I can tell if sshd-musicbrainz-fullexport on aretha is currently receiving an rsync dump? `ps` shows nothing running in the container, so I suspect it's fine
lucifer: quick PR in docker-server-configs related to this too
yvanzo
good question
alastairp
I need to restart it to install a new key
yvanzo
let me check
alastairp
or I guess if we don't want to restart it I could copy the key contents in manually, but that doesn't let me verify that the PR that I opened works correctly
apologies, akshaaatt. i'll be contributing to bookbrainz
akshaaatt
Sounds great ArjunM ! You should definitely join the #bookbrainz channel and discuss things in detail there. The best way to start is by understanding the codebase and kicking things off with a PR or two :) There’s tons of things you could explore in the project and have a look at the bug tracker for issues to fix or discuss.
yvanzo
alastairp: don't have any better to suggest than 'ps aux' atm
alastairp
thanks yvanzo, restarting the service now
ArjunM
will head right there, akshaaatt!
lucifer
mayhem: i think there's a bug in ds hoster code.
i'll put a temp fix in the patch and labs api endpoint for now.
mayhem
k.
DSH needs some love in any case.
akshaaatt
ArjunM: monkey should be you PoC. But go through the docs first and try to exhaust all the available information before making conclusions. That being said, genuine doubts are welcome 24*7 !
Your*
ArjunM
yes, got it
yvanzo
lucifer: CB security alerts will need a lighthouse keeper for the next months, would you agree to replace alastairp?
lucifer
sure yvanzo
alastairp
🙏 thank you
yvanzo
alastairp: should I reassign the current CB security alerts too?
alastairp
yes please
yvanzo
Done, thank you both!
lucifer
mayhem: dataset fetcher element code in troi has some special casing for artist_mbid which is creating troubles
alastairp: if there's no rrsync process it should be fine
alastairp
yvanzo: yes, I think that's reasonable
thanks bitmap
bitmap: related, not sure if you saw earlier discussion about the proliferation of ssh keys in that container - it seems that it's because we use `command` in a pubkey in authorized_keys to run `rrsync` which limits the connecting client to a particular directory
we're already up to 4 keys for ListenBrainz, not sure if you know the original motivation behind that, but it could be something to clean up and improve
jasje_ has quit
lucifer
mayhem: its the popular recordings element that is erroring! not the labs api one