#musicbrainz

/

      • luks
        it doesn't take that long to crowd-source the database
      • hrglgrmpf
        luks: this is exactly what would have to be done to be 100% independent
      • but this would be a slap in the face... so I think it won't be done
      • if I get it right, echonest gets it's music from 7digital and computes fingerprints out of it?
      • luks
        right know they have a small database
      • yes, they are trying to integrate fingerprints from the 7digital music collection
      • but it's not done yet
      • I've heard they want to allow user submissions too
      • hrglgrmpf
        but they don't their database matched with MBIDs...
      • reosarevok
        Does anyone have an userscript to allow merging recordings / works from the search results?
      • luks
        no
      • that's what they want MB to do for them
      • hrglgrmpf
        how is that possible?
      • by name matching?
      • luks
        by people submitting data to MB
      • just like with PUIDs
      • but anyway, I don't see any of this happening in near future
      • hrglgrmpf
        ah, but then we still can only submit stuff they already scanned?
      • luks
        who knows, I don't think anybody have figured out how they want to do it
      • MB has echoprint support without even knowing know "echoprints" really are :)
      • xlotlu joined the channel
      • voiceinsideyou joined the channel
      • hawke_
        luks: Isn’t there very little difference between echoprints and PUIDs, from a MB perspective?
      • luks
        hawke_: echoprints are going to give you way more results
      • hawke_
        Right, but it’s still just a number associated with one or more recordings.
      • It may be “more” more frequently than with PUIDs, but still the same basic thing
      • luks
        depends on your look at it
      • you don't have a way to tell "this is a radio edit" or "this is the album version" with the Echoprint
      • hawke_
        Really? It’s that fuzzy?
      • So it will be “more” *much* more often than with PUIDs.
      • luks
        I believe it will be useless for automated tagging
      • reosarevok
        luks: so what will it be useful for?
      • luks
        identifying songs
      • reosarevok
        I mean in MB
      • luks
        nothing, IMO :P
      • hawke_
        Closer to identifying works, than recordings, no?
      • luks
        which is why I don
      • 't see it as a competition
      • reosarevok
        hawke_: not really, if remixes and so are not new works
      • luks
        yes
      • well, something between recordings and works
      • jacobbrett
        luks: So, can we use acoustid at some point to replace PUIDs, perhaps?
      • luks
        jacobbrett: that's my plan
      • hawke_
        reosarevok: Well, it all depends on how fuzzy it really is. Is there any real measurement/comparison?
      • reosarevok: It might depend on how different a remix is.
      • luks
        well, currenty it's so fuzzy that will more often give false positives than true positives :P
      • which will of course change as they build the database
      • hawke_
        luks: Why would it change significantly?
      • luks
        they will change the matching algorithms
      • voiceinsideyou1 joined the channel
      • and of course if they have the correct matches, they will score higher
      • voiceinsideyou joined the channel
      • hawke_
        luks: Why would it change? Can a given song get a different echoprint at different times somehow?
      • ijabz joined the channel
      • luks
        hawke_: I don't know what will they end up using
      • but currently it takes the closest match within some thresholds
      • which means the ID can change if there happens to be a better match in the database
      • but I don't know how will they build the database
      • I expect many things will change before it actually starts being useful
      • hawke_
        luks: I would think both echoprint and acoustid would be very similar in that regard, where it’s a matter of tuning to find the correct threshold of difference for assigning a new ID.
      • ijabz
        hawke, acousticids are 'designed' to match just one recording of a song, whilst still allowing different encoding of that recording, and for the most part works
      • not sure that is the aim of echonest ids
      • pbryan
        ijabz: Yes, that's the intent of the Echonest "TR" IDs.
      • "We use the TR identifiers to uniquely identify a single audio recording. Different rips, silence at the beginning and end, and analog/digital rips will all result in the same identifier.” —Alastair Porter
      • hawke_ scratches his head at a credit for “six-string guitar” as distinct from "acoustic guitar" and "12-string guitar"
      • hawke_
        Isn’t six-string kinda … standard?
      • derwin
        12-string guitar is a different instrument than the six string..
      • jacobbrett
        luks: Is it normal for TagLib to throw up warnings about idv2.4 and TDAT, and "MPEG::Header::parse() -- Invalid sample rate."?
      • derwin
        but "acoustic guitar" means 6 string
      • srotta
        But is the six-string acoustic or electric?
      • derwin
        LOL
      • nikki hates having instruments as a tree :P
      • hawke_
        derwin: 12-string is definitely different, yeah.
      • nikki
        and dammit. hrglgrmpf already left
      • hawke_
        But I would think that an acoustic guitar would be implicitly 6-string.
      • But why not just say “electric guitar” if it’s electric
      • And there are also credits for ”electric guitar”
      • Oh well, I guess just “guitar” it is then.
      • srotta
        Who would want to avoid ambiguity when there's a chance for it.
      • luks
        jacobbrett: it doesn't do that if it's compiled in release mode
      • jacobbrett: but otherwise yes, it's normal
      • jacobbrett
        luks: Ah, that was output after I ran it via terminal
      • Ubuntu PPA
      • luks
        pbryan: that only says that will get the same ID, not what will not
      • kepstin-laptop is curious what would happen if he took the musicbrainz server software, the GNU FM server software, and smashed them together violently.
      • nikki
        bits of perl code everywhere I imagine :P
      • luks
        and some php
      • kepstin-laptop
        yeah, looks like they use php :/
      • luks
        I really don't think libre.fm had a good start
      • nikki imagines kepstin-laptop building somehting under switzerland to smash code together
      • nikki
        I gave up on libre.fm, they didn't seem to be going anywhere
      • luks
        "just clone last.fm with all the problems" is not a good goal
      • nikki
        heh, yeah
      • hawke_
        libre.fm is also kind of a pain to use. :-/
      • kepstin-laptop
        nah, too far away. we got lots of space in canada for me to build things, I can put it in an abandoned mine or something
      • well, first things first, I should see about getting a development copy of the musicbrainz server and DB up and running on my computer.
      • pbryan
        luks: Not sure I get your point.
      • nikki
        if they'd solved some of the last.fm problems, I might have been interested, but it was just like last.fm with less data and less js
      • kepstin-laptop
        I wonder if they ever actually finished their api for getting data back out
      • luks
        pbryan: radio edit and album version of the same song will usually also get the same ID
      • nikki: and slower :)
      • kepstin-laptop
        i found it amusing, given their listed goals, that they didn't support that initially.
      • ianmcorvidae joined the channel
      • pbryan
        luks: Usually?
      • kepstin-laptop
        huh, memcached is required as well?
      • seems like a funny thing to install on my laptop :)
      • kepstin-laptop does so anyways
      • nikki doesn't have it installed
      • luks
        are you making the mistake of installing mb_server outside of a VM?
      • kepstin-laptop
        probably, yeah
      • well, all I really want is the db, I should probably just not bother with the server.
      • luks
        mbslave is usually a less painful way to get the db
      • kepstin-laptop
        virtual machine probably is a better idea, tho :)
      • nikki
        mbslave :D
      • kepstin-laptop leaves postgres installed anyways, he might think up something better to do with that.
      • kepstin-laptop
        huh, so that's just the import and replication scripts, eh? handy.
      • srotta
        There should only be one, standardized RDBMS.
      • luks
        you mean ORACLE? :)
      • srotta
        I have several Postgres versions and some Mysql crap all over my server.
      • nikki
        kepstin-laptop: and it doesn't require half of cpan to run ;)
      • and it requires far less disk space to import data
      • kepstin-laptop works in a company that makes big telecom software that runs on oracle :/
      • srotta
        Luckily no Oracle here.
      • luks
        I was working on a large migration from postgresql to oracle at my last job
      • srotta
        Haven't had to touch it for a while. 8)
      • luks
        I was crying inside
      • srotta
        And laughing outside?
      • ijabz
        kepstin-labtop I just run the database
      • kepstin-laptop
        just to get a standalone database set up on a linux box to test stuff nowadays requires something like 12gb storage for the installation.
      • luks
        yep
      • nikki
        kepstin-laptop: any idea who's been putting these (cv ...) thingies in the join phrase? I guess it's not you but maybe you've seen someone doing it
      • kepstin-laptop
        i've done some of it
      • nikki
        really?
      • I thought you were linking two artists
      • kepstin-laptop
        the "[character] (cv [artist])"? yeah.
      • nikki
        I mean "[character] (cv artist)"
      • kepstin-laptop
        not [character (cv artist)] - i don't do that.
      • huh
      • I dunno who does that
      • apparently some people aren't big fans of linking both artists, because they don't want the character song stuff in the artists's own discography or something :/
      • kepstin-laptop wants both artists linked, so it shows up in the real artist's own discography :)
      • reosarevok would think that is the most interesting part to show it…
      • hmm. we probably could actually use some sort of standard cv artist style to normalize the artist credits on recordings, we have variations of "character (CV artist)" "character (c.v. artist)" "character (artist)", etc.
      • reosarevok
        AAAAAAND we have a new RFC in the works!
      • Congratulations, kepstin-laptop
      • kepstin-laptop
        not counting the other different variants like "character starring artist" which should be left as-is, probably :/
      • hey, my last two rfcs have just kind of dies of old age :/
      • my 'official homepage' for release groups died when one person wanted to 'avoid making a link farm' and nobody else cared :/
      • reosarevok
        kepstin-laptop: was it vetoed or just discouraged?
      • kepstin-laptop
        never got a +1 to get out of RFC stage.