#metabrainz

/

      • Pratha-Fish
        return nan```
      • lucifer
        mayhem: not sure i understand what you mean. can you explain?
      • btw the normalized tables just finished, its in `spotify_cache` schema on gaga if you want to look
      • alastairp
        yvanzo: keys are still at the office
      • zas
        yvanzo: come to the office first
      • lucifer
        mayhem: there's 145 rows at least where spotify artist id is same but the artist name is different. i guess artist ids are not unique across artist credits but also weird that there are only 145 such cases in the data. https://www.irccloud.com/pastebin/58Td9YOk/
      • yvanzo
        zas, alastairp: Can anyone bring it to me at the station?
      • lucifer
        if its correct then the number is small enough to ignore i think. also many of these would go away with unidecode and lowercasing.
      • alastairp
        yvanzo: one sec, we're organising it
      • mayhem
        lucifer: tracks in Spotify have a list of artists, right? I just want to make sure that we keep them separate and not conflated them
      • -d
      • lucifer
        mayhem: yes, i added a `rel_track_artist` table. which has a track_id and artist_id. so we can get the artists for each track separately.
      • alastairp
        yvanzo: aerozol is going directly to the lodging
      • mayhem
        Ah, yes, right. All good then.
      • lucifer
        if you have an example at hand where tracks from same album have different artist lists, i can check the tables to confirm
      • yvanzo
        Thanks
      • agatzk has quit
      • agatzk joined the channel
      • lucifer
        mayhem: takes 13mins to execute.
      • i guess may take 30mins if cache is not warm but probably not more. we'll see in due time anyways.
      • mayhem
        Wow, nice
      • lucifer
        i'll update the branch with this query and we can try running the PR then
      • zas
        for dinner tonight (for those interested) we meet at the office around 19:30
      • mayhem
        so, we've just spend the last hour talking about OAuth and we have a bit of a plan for tomorrow.
      • are you available at 11am BCN time to join the "fun"?
      • lucifer
        yup i'll be around
      • mayhem
        the basic idea is that we'd update the OAuth lib in meb.org and hard code a couple of user accounts with a couple of scopes.
      • then setup a new VM with meb.org on and another VM with an lb sandbox and then test that things work.
      • then bring up an instance of MB and have it auth to the meb VM and slowly bring things into a prototype.
      • I think it might be best to include you via zoom in our discussions in the morning.
      • lucifer
        there's some code here which might be useful in updating oauth in meb.org https://github.com/metabrainz/metabrainz.org/co...
      • iirc its should be close but i never got around to test it.
      • mayhem
        oh great. I think the meb.org work is going to fall on you and I anyway, so that should be fine.
      • lucifer
        makes sense
      • mayhem
        once we get the VM stood up, we can start the actual work on creating a new user table in meb.org
      • lucifer
        there's a test.meb available which also runs a different database than prod so meb.org side setup should be easy.
      • for the VM, do we need it? we can just repurpose test.lb i think.
      • zas
        yvanzo: are you in?
      • yvanzo
        zas: yes
      • yvanzo is heading to the HOLY now.
      • s/HOLY/office/
      • bitmap
        yvanzo are you at the Airbnb ?
      • yvanzo
        bitmap: no, at the office
      • lucifer
        mayhem: huh there are apparently ~108,000,000 rows in track table.
      • bitmap
        Yeah I saw. Sorry I opened your bedroom door thinking it was a bathroom :)
      • yvanzo
        But I'm stuck in front of her door
      • mayhem
        lucifer: yeah, I knew the number had to be very big.
      • lucifer
        i wonder what's going on with that. that's 3-4x the number of MB tracks.
      • bitmap
        I’m on my way to the office again shortly
      • mayhem
        spotify and apple music are in a quantity arms race. apple just announced that they have 100m tracks.
      • a vast amount of that are duplicates or listenable shit.
      • lucifer
        yeah indeed. 😞
      • anyways 2-3 hours it'll be done
      • mayhem
        noice!
      • !m lucifer
      • BrainzBot
        You're doing good work, lucifer!
      • yvanzo
        bitmap: ok, waiting for you
      • elgranRoble joined the channel
      • BrainzGit
        [musicbrainz-server] 14reosarevok opened pull request #2675 (03master…more-flow-strict): Make more files flow strict or strict-local https://github.com/metabrainz/musicbrainz-serve...
      • lucifer
        mayhem: another things, i was thinking to rework LB#2167 as a troi patch. thoughts?
      • BrainzBot
      • mayhem
        to make playlists from collections? seems a good use yes.
      • lucifer
        yup
      • 👍
      • reosarevok
        Also, everyone's tiny project is by now in Spotify and iTunes
      • lucifer
        yeah, makes sense
      • reosarevok
        4 times the amount of tracks in MusicBrainz is actually a lot less than I would expect
      • bitmap
        Yvanzo: the rest of the group is supposed to meet back there at 7:30 I think?
      • lucifer
        but just 145 artists having multiple artist credits in 108M tracks is surely mind boggling.
      • reosarevok
        Wait, what is that supposed to mean?
      • That only 145 appear in different combinations? then that cannot be right I don't think
      • lucifer
      • reosarevok
        Oh, so like the same artist being mapped to two ways of writing it?
      • lucifer
        i mean same spotify id but different artist name on some tracks. iiuc i am trying to compare it to artist credits in MB
      • reosarevok
        That's still weird, but less weird than what I was thinking
      • I didn't know that was a *thing* even
      • Maybe it's the result of merging artists or something?
      • lucifer
        how an artist may be credited with different names on different tracks so we have artist credit and artist credit name tables.
      • assuming spotify_id is analogous to artist.gid in MB then i'd expect a lot more results here.
      • alastairp
        lucifer: 11am in Barcelona we'll start our mini oauth hack day
      • lucifer
        sounds good alastairp. i'll be around
      • alastairp
        we wrote a task list, we'll share it tomorrow
      • great, see you tomorrow!
      • maybe we can put you on a video call to be along side us "in person"? :)
      • lucifer
        yes that sounds good.
      • zas
        I'm at the office
      • darkstarx joined the channel
      • darkstardevx has quit
      • yvanzo
        zas: bitmap and I are on your way :-)
      • petitminion_ has quit