I was hoping for a community effort to agree on tags for when synonymous tags exist
yvanzo
there is no guideline, it's open, community agreement is expressed with vote
Vexatos has quit
Vexatos joined the channel
BabyJesus666 has quit
xplt joined the channel
xplt is now known as xplt_
xplt_ has quit
reosarevok
dominikh: if you mean for genres, we have one "marked" genre, while the others currently aren't
But for non-genres, there's nothing specific, no
dominikh
reosarevok: right, for non-genre. I just noticed a lot of duplicate tags, like the aforementioned game,game music,vgm – which all express the same idea
so of course part of me wishes for that data to be normalized :P
reosarevok
An option would be to make a bot that adds the other two when one is there, then it's "normalized" ;p
dominikh
bonus points for nefariousness if it also downvotes the other two?
CatQuest
-1 to the nefariousness
but the other idea i propose hartily
dominikh
I'm torn on that idea. It does mean that if the original tag was wrong, and nobody has downvoted it yet, the bot would end up adding another incorrect tag
Cyna joined the channel
reosarevok
True. But tags are not right or wrong, you just agree with them or not :p
If I think Bach is video game music, then it is! :D (for me) - maybe I'm making a video game that uses Bach :p
dominikh
in that case I propose a plethora of new relationships so that we can record the absolute truth :P
reosarevok
Even Wikidata doesn't try to do that and they have allll the relationships :p
CallerNo6 has quit
CallerNo6 joined the channel
xplt joined the channel
xplt is now known as xplt_
Leo_Verto_ joined the channel
Leo_Verto has quit
Leo_Verto_ is now known as Leo_Verto
outsidecontext joined the channel
peaveyman has quit
peaveyman joined the channel
Leo_Verto_ joined the channel
Leo_Verto has quit
Leo_Verto_ is now known as Leo_Verto
outsidecontext has quit
Toast joined the channel
simukis joined the channel
Cyna has quit
D4RK-PH0ENiX has quit
D4RK-PH0ENiX joined the channel
CatQuest is now known as lutefix
c1e0 joined the channel
lutefix is now known as CatCat
zykotick9 joined the channel
CallerNo6 is now known as CallerOhNo6
c1e0_ joined the channel
c1e0 has quit
c1e0_ has quit
Toast has quit
Toast joined the channel
Jybz joined the channel
Toast has quit
dominikh
hm, question about the actual database: when entities get merged, redirects are being added to _gid_redirect tables. does that imply the original rows with the old gids get deleted?
reosarevok
Yeah
But relationships etc are transferred to the entity they got merged into first
dominikh
right. I'm playing around with some software/database designs that rely heavily on MB data, using a local version of the database. if entities can be deleted, I'll have to keep that in mind when designing my foreign keys :)
CallerOhNo6
^ entities can *also* be outright deleted.
dominikh
does that ever happen to entities that aren't spam?
CallerOhNo6
ideally "hardly ever", but it can happen.
reosarevok
dominikh: it also happens if someone just adds, say, an artist that does exist, but doesn't use it
(adding at least one relationship, even if it's just a link, avoids that)
CallerOhNo6
oh, good point. I forgot about that.
dominikh
right. those cases are probably the easiest to handle with my use case. if nothing uses it, I won't, either. merges prove far more difficult. imagine an experimental music database/player that pretends to live in a world where everything is tagged with musicbrainz IDs.
files are stored as their track IDs
a bunch of postgresql triggers can probably take care of it, though
CallerOhNo6
Cool. I'd like to see how that comes together.
dominikh
unlikely to turn into anything user-friendly, on account of all the databases that need setting up and maintaining, but it may be a nice toy for the nerds among us
CallerOhNo6
I guess one other (hopefully rare) case you'd need to consider is when something is merged in error, and then needs to be recreated (with a new MBID).
dominikh
that pretty much requires manual intervention and retagging, won't it
CallerOhNo6
not always, but yeah
e.g. recordings might be incorrectly merged, but the release ID remain unchanged
dominikh
yeah, I'll probably only store track IDs and figure out the rest via joins. reduce the risk of IDs disappearing
at any rate, early days. will see how many issues I run into