#metabrainz

/

      • zas
        change paths (/var/ftp to /home/ftp)
      • reosarevok
        Mr_Monkey: np. I'm sure I can think of more :p
      • Mr_Monkey
        reosarevok: Yeah, do my work for me, see if I care :p
      • zas
        70% of hits on the mb website yesterday were from (verified) googlebot (1277045 hits) and yet searching for an small unknown band existing in the db, indexed (search with site:musicbrainz.org shows it) doesn't appear at all over 15 pages of results... discogs appears on 5th page, qobuz on 7th, official sites on 1st/2nd pages, but musicbrainz absolutly nowhere over 15 pages. It sucks.
      • any SEO expert around ? ;)
      • reosarevok
        Mr_Monkey: if you do the programming, I'm happy to do the asking :p
      • LordSputnik
        Mr_Monkey: we should do a more general database cleanup, as well as fixing the invalid relationships
      • Things like deduplicating alias/identifier tables
      • until yesterday, the site code re-inserted all the aliases for every entity every time it was edited due to a code bug
      • Mr_Monkey
        LordSputnik: Yes, that sounds sane :) Are there already scripts for cleanup operations?
      • Oof.
      • LordSputnik
        Nope, not currently
      • We should write some and put them in bb-sql/scripts
      • Mr_Monkey
        Yep
      • LordSputnik
        Yeah, it was due to the set handling code, which was fixed and moved into bb-data this summer. It's now unit tested to make sure nothing gets duplicated
      • Leo_Verto
        KassOtsimine, sorry, I didn't realize that
      • KassOtsimine
        no prob
      • i was feeling grumpy :)
      • zas
        reosarevok: i just noticed we don't ask for a disambiguation when creating a new RG having the same name of an existing one, as we do for artists. Asking for one would reduce a lot duplicates, is there any suggestion/ticket you're aware of for this ?
      • reosarevok
        We mostly don't enter rgs by hand, but automatically
      • KassOtsimine
        zas: if on the same artist maybe. but else it could get pretty hairly quickly. other thna that good idea!
      • zas
        i mean we can ask for disambiguation as well as confirmation when a new RG of the same name exists for the same artist
      • Mr_Monkey
        LordSputnik: I'll create a ticket for it so it doesn't disappear. I still suck at SQL and I feel like I would fuck it up without realising. Would you have the bandwidth to take care of those cleanup operations (not the creator deduping or anything like that)?
      • KassOtsimine
        it might even help against duplicate releases
      • zas
        just created those (artist called releases "EP", perhaps it should be self-titled, but that's the same)
      • in general, i think we should have better safe checks to prevent common mistakes to happen (like creating a new RG when not needed, which is VERY often)
      • or check packaging set to None on a physical release, or to anything but None on digital releases
      • chhavi_: ^^ included in your plan ?
      • KassOtsimine
        zas: well not setting packaging on physical release ,.. often i don't know waht packaging my dumb library has wrecked so i know not which it is
      • zas
        that's not what i say
      • KassOtsimine
        oh? i missunderstood
      • zas
        CD packaging = None is VERY rare (we should warn about it, not forbid it)
      • KassOtsimine
        oh no explisitly *not set* but explisitly set to *none* ah
      • zas
        and digital release with packaging digipak is also uber-rare
      • KassOtsimine
        now i get
      • zas
        same for capitalization rules, they have to be linked directly from release editor (depending on selected languages), a new editor hardly knows about then.
      • LordSputnik
        Mr_Monkey: Yeah, I can do that at some point
      • Happy to have that assigned to me
      • zas
        Having better infos from the start would reduce a lot edits
      • KassOtsimine
        :o I thought cap rules where! yes i agree
      • Mr_Monkey
        LordSputnik: Thanks, that would be very helpful. I don't think it's a huge rush as it doesn't currently keep anyone from using the site or database correctly, if I'm not mistaken.
      • reosarevok
        Merging! Merging! There are two Lovecrafts! ;)
      • I guess there is a ticket for that though
      • KassOtsimine
        oh no two lovecrafts?! thats like dividing by zero!
      • zas
        rather than merging entities, i'd like to prevent them from the start (whenever it makes sense), it reduces the amount of work for all, and the size of the db....
      • KassOtsimine rolls
      • reosarevok
        Both are needed ;)
      • zas
        of course
      • :))
      • KassOtsimine
        i rememember *distinctly* saying something like this about recrordings when NGS was happening
      • reosarevok
        But yeah, that too
      • KassOtsimine
        ;)
      • reosarevok
        KassOtsimine: not much to be done with that though was there? Other than risking merging the wrong recordings automatically
      • That was a mess we couldn't probably solve properly in any way :/
      • KassOtsimine
        ¯\_(ツ)_/¯
      • reosarevok
        LordSputnik, Mr_Monkey: what should happen with introductory notes by editors? New work too?
      • KassOtsimine
        oh shit that's a thing that's separatr? D:
      • Mr_Monkey
        In the near future, yes, I think so. There isn't currently the correct work type, though.
      • The other options is to only have the relationship between a Creator and an Edition. I'm open to opinions.
      • ruaok
        huh. our "temporary" FreeDB gateway has been in operation for 11 years and is still getting 2 requests per second.
      • well, enough of that. :)
      • Mr_Monkey
        I will very soon open a big discussion about the BookBrainz schema and how everything fits. I know the discussion has been had before, but I want more :)
      • ruaok
        we kill the gateway and the next day brexit happens. should be a fun week.
      • Rotab
      • reosarevok
        :D
      • !m ruaok
      • BrainzBot
        You're doing good work, ruaok!
      • reosarevok
        Mr_Monkey: yeah, I'd like that too
      • Mr_Monkey
        reosarevok: I'm sure I've seen it suggested somewhere : I wish there was a way to link works with BBIDs ! https://www.youtube.com/watch?v=GHyMCFLQ2EE
      • reosarevok
        Mr_Monkey: you mean MusicBrainz works or?
      • Mr_Monkey
        Ye,s sorry, MBIDs
      • LordSputnik
        reosarevok: I'd say make a sub work and link it to the parent work
      • (for editor notes)
      • reosarevok
        What if there's no parent work :)
      • (this is an anthology of short stories)
      • LordSputnik
        Oh, actually, make a work then link it to the edition
      • We could have a special relationship for that
      • Eventually I'd like to have the concept of sections of editions, and the ability to link works to those
      • KassOtsimine
        buh, dodn't we already have a "link to MB entity" lnik type? or maybe notl ink, yo ucna put oin the mbid in the section for various codes isbn etc
      • oh halp. i also have an anthology of short stories to add :s
      • LordSputnik
        reosarevok: yeah KassOtsimine is right, you can link to MB works in the identifiers section
      • reosarevok
        Sure :) But that only helps if they're the exact same
      • LordSputnik
        Mr_Monkey: ^
      • reosarevok
        I think Mr_Monkey's example was more "is the basis for MBID" :D
      • LordSputnik
        Isn't that better stored in MB?
      • Mr_Monkey
        Yes, a relationship of type Work inspired Song/Album
      • LordSputnik
        Oh right
      • No, there's no way of saying that at the moment
      • Mr_Monkey puts it at the bottom of The List
      • Technically possible, since relationships are a pair of uuids and a type, but we'd need to do some weird stuff to make it work
      • reosarevok
        LordSputnik: sure, in this order it would, possibly :) Although I'm sure there are some which are the other way around
      • LordSputnik
        A good first step is relationships with areas, since BB areas are replicated MB areas
      • chhavi_
        zas: we have not reached editing interfaces, so not in plan yet
      • KassOtsimine
        reosarevok:
      • rsh7 joined the channel
      • ruaok
        iliekcomputers: when you got a sec, I wanna compare notes on spark/hadoop/yarn, etc.
      • the spark universe has so many moving parts it is hard to figure out what we need and what we don't need.
      • Nyanko-sensei has quit
      • reosarevok
        The Spark Universe should be your new band, ruaok
      • Nyanko-sensei joined the channel
      • LordSputnik, Mr_Monkey: is there any diagram of the current BB schema?
      • Mr_Monkey
        reosarevok: Yes, let me find you the url
      • reosarevok
        Ok :)
      • Just to have some idea
      • Thanks!
      • gerst joined the channel
      • gerst has quit
      • demonimin has quit
      • Wizzup has quit
      • demonimin joined the channel
      • Wizzup joined the channel
      • Mineo
        outsidecontext, zas, samj1912: does one of you know if something has changed in the way picard handles exceptions? I'm pretty sure picard 1 basically caught every single exception, so you could at least save already correctly associated albums etc. this does not seems to be the case anymore with picard 2
      • Mr_Monkey
        "Congratulations! MetaBrainz Foundation has been selected as a Google Code-in 2018 mentoring organization." Woop wooop !
      • rsh7
        yayy!
      • bukwurm joined the channel
      • LordSputnik
        Damnit, I guess we need some tasks then!
      • SirLagz29 joined the channel
      • SirLagz29 has quit
      • CatQuest
        ok wtf. how am I supposed to select that Neil *wrote* https://bookbrainz.org/edition/ef52527a-f68b-41... ? i tried in both ends. in nei'ls end and in this. i only get every at other than "wrote" what?
      • Mr_Monkey
        CatQuest: That's an Edition you're looking at. The 'wrote' relationship is between a Creator and a Work
      • zas
        Mineo: not AFAIK
      • Mr_Monkey
        I find it confusing as well.
      • CatQuest
        https://bookbrainz.org/creator/0fcd19b7-d57a-40... already has two links to editions though :D
      • so yea that is *SO* not clear
      • can we someohe make this idffernt please
      • i mena i get that "wrote" on a work
      • but in for example mb, we have the "artist credit on the release"
      • Mr_Monkey
        I think that's from a previous version with no constraints for relationships.
      • CatQuest
        wich is separate from the "various roles through relationships(on works)"
      • yes
      • but it's.. very unclear and frustrating becasue I didn't know hat :)
      • Mr_Monkey
        If you edit the relationship by re searching for "The Sleeper and the Spindle" for example, 'wrote' isn't in the available relationships anymore
      • CatQuest
        yea so i'm not gonig ot do that ;)
      • Mr_Monkey
        :D
      • CatQuest
        but my point still stands. "publicist" is a filed in an edition, but "author" isn't. that is really really weird
      • field*
      • Mr_Monkey
        Agreed. The relationships between entities, and sometimes the entities themselves, aren't quite clear.
      • CatQuest
        anyway what this means in order to link a *book* with a *writer of said book* you have ot create 3, three. thngis: publicatio for the edition, and thne a woork to link edition/book to the autor. not.. ideal
      • Mr_Monkey
        Yes, I found it quite confusing when I first played with the website.
      • CatQuest
        ideally I should only *have* to create the "edition", the publication(editiongroup) to be autocreated or be abl to l ink to an old one :)
      • i still don't really groc the differnce between a "work" and a" publication" in the grant sense of things. books are often considered the work in and of themselves
      • Mr_Monkey
        I Think There's A Ticket For That™
      • CatQuest
        much like contemporary albums
      • (where i onte, you have works for the individual recordings, but not a work for the release/album as a whole (unless we're talking concept album or classicla work, but these are not the "album" as a work, but the album is the mendioum of the work yadda yadda))
      • reosarevok
        CatQuest: a short story collection has a lot of works, but isn't necessarily a work in itself - yet it's a publication / edition
      • Even most classic novels reissued nowadays have at least two works, arguably: the actual novel, and some extra introduction text by an expert telling you about it :)
      • CatQuest
        aha