those similarities are pretty good, but is there any way for me to check them in DSH?
lucifer[m]
mayhem: not at the moment.
yes MLHD similarity data.
mayhem[m]
ok, let me think up a few examples.
lucifer[m]
i am pretty sure there is some bug in the data because i got 42M similarity pairs which is more than the 14M we get from LB data but still not as much as I'd expect.
it may not be the pair count to judge this -- perhaps summing the counts to see if all tracks are accounted for?
can you use session_based_days_7500_session_300_contribution_5_threshold_10_limit_100_filter_True_skip_30 ?
because that is the dataset we have right now, so that would be a good comparison
lucifer[m]
sure
i need to make some changes to the code but we can run this entirely on spark cluster in ~10-12 hours.
so we can run experiments easily with this data.
possibly even ~6 hours but i'll have to run some tests.
pite joined the channel
taking it one chunk at a time, using zstd compression, changing mbids to ids, breaking the data generation into two stages.
helped ensure that we are able to process the data on the existing cluster. and its crazy because the final data (ids not mbids) is just a 180 MB parquet atm.
mayhem[m]
oh wow. you calculated this without the big new VM?
lucifer[m]
yes
mayhem[m]
amazing. this is so much better for being able to iterate this data
lucifer[m]
yup indeed
i had spent a week on figuring out to make this work lol but just after i asked for the vm yesterday, i figured out how to fix the issue..
BrainzGit
[listenbrainz-server] 14MonkeyDo merged pull request #3241 (03master…hide-apple-music-export): Disable "Export to Apple Music" option if user not signed in, or into Apple Music https://github.com/metabrainz/listenbrainz-serv...
vardhan joined the channel
outsidecontext[m has quit
vardhan has quit
krishnacosmic[m] has quit
GautamShorewala[ has quit
jasje[m] has quit
yvanzo[m] has quit
kellnerd[m] has quit
m1gr has quit
m1gr joined the channel
minimal joined the channel
BobSwift[m]
When do you expect the new country code from MBS-12170 to be available in the json output on musicbrainz.org (or beta.musicbrainz.org)? I'm looking into adding this into the variables available in Picard, and I want to make sure I'm extracting and testing properly. Thanks.
augh, co-opting the artist *country* because releases have useless information is s...
now adding it to the api I'm not against, but the real issue here is that we need to stop/revert/codify to prevent these "205 country" monster releases
sigh. it was literally what the [worldwide] thing was supposed ot be *for*
sigh
Sophist-UK has quit
Sophist-UK joined the channel
Kladky has quit
petitminion joined the channel
bitmap[m]
<BobSwift[m]> "When do you expect the new ..." <- I believe reosarevok is planning to update the beta server tomorrow
petitminion has quit
petitminion joined the channel
petitminion has quit
<BobSwift[m]> "When do you expect the new ..." <- I've deployed this on the test server in the meantime