no, because in this query filter for listens which have a matching recording mbid in recording artist table. that filters out listens which don't have a recording mbid.
holycow23[m]
yeah but shouldn't all the listens have a mbid?
i mean if not all atleast like the most of it
right now roughly 10% of the listens have a matching mbid
lucifer[m]
no, all listens have a msid but not all listens have a mbid.
ah that is probably because your spark wolf setup only has the sample dumps.
again, MB db on wolf is not the same as your spark setup on wolf.
MB db has the full dataset, your spark setup only sample.
holycow23[m]
so the discrepancy is fine right
lucifer[m]
yes
holycow23[m]
is there a possibility to get the entire db or like some more content cause the results are too small to test and verify