put perhasp as an image or osmething, ro prevent spambots uh, reading it
2022-06-16 16744, 2022
alastairp
CatQuest: I don't think that's an issue - the trick is to prevent a bot from finding the site, seeing that it's a mediawiki, and then trying to create an account automatically
2022-06-16 16721, 2022
alastairp
99.9% of spammers who encounter the slightest amount of resistance are going to move on to the next site and not bother. I bet no one will even try and load the create user page and see why it failed
2022-06-16 16739, 2022
CatQuest
hmmmm
2022-06-16 16741, 2022
Pratha-Fish
alastairp: i.e. 0.301% of ALL UNIQUE rec-MBIDs are unknown. (Don't belong to the recording table OR have a valid redirect.)
2022-06-16 16710, 2022
CatQuest
huh i'd be interestedin a report of those
2022-06-16 16714, 2022
alastairp
Pratha-Fish: perfect! (although 0.3% is higher than the 0.2 that I computed yesterday :(, ohwell)
2022-06-16 16736, 2022
alastairp
CatQuest: yeah, because at some point in time lastfm thought that they were valid recordings. so, were they deleted from MB?
2022-06-16 16757, 2022
Pratha-Fish
Still reasonable ig :))
2022-06-16 16701, 2022
alastairp
Pratha-Fish: please make a list of those mbids and put them in a pastebin for CatQuest
could you recompute this including the timestamp of the first scrobble with that id? (it'd be great to take the number and convert it to a full datetime representation too)
2022-06-16 16745, 2022
Pratha-Fish
Ah that first column is residual index. give me a sec
Pratha-Fish: this should be pretty fast, even without using a vectorised lookup
2022-06-16 16755, 2022
Pratha-Fish
Let's give it a shot
2022-06-16 16752, 2022
alastairp
I'm not sure if this will give interesting results - because I don't know if lastfm did the recording lookup when the scrobble was added, or if it was re-processed in bulk at some later time
2022-06-16 16709, 2022
CatQuest
... dang what i the wikipage where people put theri names for the meeting
Pratha-Fish: btw, generally we don't put data files in git repositories because if they change often then the size of the repository gets larger and larger
2022-06-16 16740, 2022
alastairp
in this specific case since it's a testing/experiment repo this isn't as much of an issue
2022-06-16 16742, 2022
Pratha-Fish
Oh
2022-06-16 16753, 2022
alastairp
but we try very hard to not put this kind of stuff in the main code repos for our projects
2022-06-16 16757, 2022
Pratha-Fish
So where should it be hosted?
2022-06-16 16702, 2022
alastairp
great question
2022-06-16 16720, 2022
alastairp
for now it would be OK to just upload it to the irccloud pastebin
2022-06-16 16724, 2022
alastairp
or gist
2022-06-16 16735, 2022
Pratha-Fish
gist sounds good
2022-06-16 16751, 2022
alastairp
eventually we could set up some hosting space on wolf so that we can put files there and give them a public url
2022-06-16 16719, 2022
Pratha-Fish
alastairp: Please also give me a mini tutorial on how to do that if possible!
2022-06-16 16759, 2022
CatQuest
I mean irccloud paste works fine too
2022-06-16 16718, 2022
Pratha-Fish
Is IRCcloud paste persistant though? (Not that we need persistent paste anyway)
Apparently irccloud paste doesnt support files larger than 50kb
2022-06-16 16711, 2022
Pratha-Fish
^ CatQuest alastairp
2022-06-16 16709, 2022
alastairp
yes, irccloud paste is permanent. If you click the "attach" button to the right of the input box and then "Text snippets" you see a list of all the ones you created, and you can delete them if you want
2022-06-16 16735, 2022
alastairp
I suspect that if you had a larger file you could just upload it, the limit there should be larger
[musicbrainz-docker] 14yvanzo merged pull request #230 (03master…fix-sir-dev-deps): Replace virtualenv with user site-packages directory for SIR development https://github.com/metabrainz/musicbrainz-docker/…
2022-06-16 16717, 2022
alastairp
Pratha-Fish: see how easy it is to make that kind of statement ("none ofthese are younger than 9 years") when we sort the data?
2022-06-16 16742, 2022
alastairp
so all of the things I have been asking for are to make it easy to take a glance at it and make some sort of statement
2022-06-16 16713, 2022
alastairp
CatQuest: so now I'm tempted to import an old 2012-era dump (do those exist?!) and see how many of them are there
I'll try to bring in some similar stats from the next time too
2022-06-16 16721, 2022
CatQuest
alastairp: hmm...
2022-06-16 16733, 2022
CatQuest
would it even be possible to just chek with archive
2022-06-16 16747, 2022
CatQuest
i'm sure it has 2012 era snapshots of mb pages...
2022-06-16 16755, 2022
lucifer
chinmay: iiuc, for instance: the listens page for any user shows feedback for current user if user is logged in but if user is not logged in then feedback for the user whose page is open is shown.
2022-06-16 16703, 2022
lucifer
alastairp: i see, at one place in LB/CB also we were using copy_expert. it has been around for a while.