#metabrainz

/

      • BilleeKhoj
        I mean you made the ticket
      • 2020-01-08 00823, 2020

      • reosarevok
        Yes, does the ticket fit what you wanted
      • 2020-01-08 00828, 2020

      • BilleeKhoj
        hm..
      • 2020-01-08 00831, 2020

      • reosarevok
        Just want to make sure I got the point
      • 2020-01-08 00838, 2020

      • BilleeKhoj
        i mean, yes I guess?
      • 2020-01-08 00838, 2020

      • reosarevok
        If not, edit it :)
      • 2020-01-08 00808, 2020

      • BilleeKhoj
        bleh. how do yo udo indent on our wiki (putting a " " before a sentence jsut maes it like in a grey box with monospacefont)
      • 2020-01-08 00820, 2020

      • BilleeKhoj
        anyone?
      • 2020-01-08 00858, 2020

      • SothoTalKer
        use :
      • 2020-01-08 00845, 2020

      • BilleeKhoj
        cheers
      • 2020-01-08 00810, 2020

      • SothoTalKer
        :-)
      • 2020-01-08 00845, 2020

      • c1e0 has quit
      • 2020-01-08 00825, 2020

      • cleo joined the channel
      • 2020-01-08 00849, 2020

      • cleo is now known as Guest9128
      • 2020-01-08 00806, 2020

      • sbvkrishna joined the channel
      • 2020-01-08 00819, 2020

      • Guest9128 is now known as c1e0
      • 2020-01-08 00855, 2020

      • reosarevok
        yvanzo, bitmap: back to Wed meetings, right?
      • 2020-01-08 00808, 2020

      • yvanzo
        alright!
      • 2020-01-08 00824, 2020

      • SothoTalKer
        mhhh wet meetings :D
      • 2020-01-08 00823, 2020

      • SothoTalKer
        is there no better way than to make a ticket for each new area? :P
      • 2020-01-08 00829, 2020

      • reosarevok
        Nope, that I know
      • 2020-01-08 00800, 2020

      • yvanzo
        automating areas management was a suggestion for gsoc btw
      • 2020-01-08 00833, 2020

      • prabal has quit
      • 2020-01-08 00834, 2020

      • Freso
        yvanzo: Do we have a reliable source for this? Neither Wikidata nor Geonames (which are two mostly used now) are good enough.
      • 2020-01-08 00845, 2020

      • yvanzo
        How so? Aren’t these our current references to create areas?
      • 2020-01-08 00815, 2020

      • BilleeKhoj
        yea. doing it automatically is how we ended p with half a million 200pop US dupe-name villages ¬___¬
      • 2020-01-08 00821, 2020

      • BilleeKhoj
        (I still say we should remove any of those unless in use (artist, recording, place event or user is attached) and thne re-add as requested)
      • 2020-01-08 00801, 2020

      • BilleeKhoj
        I mean I got no issue with some dupenamed us village beign re-added if a concert was held there or an artist came fro mthere or whatever
      • 2020-01-08 00821, 2020

      • BilleeKhoj
        but lets be realistic? many of those will *never* hav a use in mb
      • 2020-01-08 00814, 2020

      • BilleeKhoj
        it's also this dilemma: either we remove them, or we add so many villages fro mevery other country. that is impossible and unmanageable
      • 2020-01-08 00836, 2020

      • BilleeKhoj
        unused villages shoudl be removed and thne readded as needed
      • 2020-01-08 00811, 2020

      • BilleeKhoj
        that way our database doesn't look like some US village directory or whatever
      • 2020-01-08 00828, 2020

      • BilleeKhoj
        I mean. to a russian this will look odd probably :D
      • 2020-01-08 00813, 2020

      • yvanzo
        the idea was not to dup the whole wikidata/geonames, just to automate using it to create missing areas on demand
      • 2020-01-08 00824, 2020

      • BilleeKhoj
        and.. that worked badly :D
      • 2020-01-08 00856, 2020

      • yvanzo
        it has not been implemented, or am I missing something?
      • 2020-01-08 00810, 2020

      • BilleeKhoj
        i thought we were talking aobut old areabot or what
      • 2020-01-08 00814, 2020

      • BilleeKhoj
        i may be mistaken then
      • 2020-01-08 00804, 2020

      • yvanzo
        oh, alright, I don't know much about areabot, it has probably been dismissed a while ago
      • 2020-01-08 00811, 2020

      • BilleeKhoj
        yea it has
      • 2020-01-08 00822, 2020

      • BilleeKhoj
        also that is why drsaunders is doing it manually
      • 2020-01-08 00839, 2020

      • BilleeKhoj
        but there is still a lot of unused villages in the database :/
      • 2020-01-08 00808, 2020

      • yvanzo
        That many?
      • 2020-01-08 00814, 2020

      • BilleeKhoj
        honestly
      • 2020-01-08 00822, 2020

      • BilleeKhoj
        i don't know the numbers. but I think there was quite a few
      • 2020-01-08 00850, 2020

      • yvanzo
        That would be nice to have some feedback about the old areabot, for the record, to prevent having the same issues again.
      • 2020-01-08 00857, 2020

      • BilleeKhoj
        because and this was the thing with areabot, is that for some reason, us-villages in wikidata was labelled with.. I don't remember. some spesific property which it was to interpret as "this should be added", a thing we hadn't known in beforehand
      • 2020-01-08 00810, 2020

      • reosarevok
        I think there shouldn't be a problem with that tbh
      • 2020-01-08 00820, 2020

      • reosarevok
        Right now we show the most used areas on top
      • 2020-01-08 00820, 2020

      • yvanzo
        I cannot find anything about AreaBot on the wiki. It is just mentioned in comment to MBS-8361
      • 2020-01-08 00821, 2020

      • BrainzBot
        MBS-8361: Use geonames data for mb-areas https://tickets.metabrainz.org/browse/MBS-8361
      • 2020-01-08 00844, 2020

      • reosarevok
        Villages were marked as villages, we were supposed to add those - the problem wasn't that, but that the rest of the world just didn't have proper data
      • 2020-01-08 00846, 2020

      • BilleeKhoj
        reosarevok: true, and that helps *a LOT* but there is the issue with al lthe wrongly added relationships before that was fixed :/
      • 2020-01-08 00855, 2020

      • reosarevok
        That's probably better nowadays, but still not great
      • 2020-01-08 00810, 2020

      • reosarevok
        Wikidata still has crappy coverage of many non-Western areas
      • 2020-01-08 00816, 2020

      • BilleeKhoj
        indeed
      • 2020-01-08 00847, 2020

      • reosarevok
        (it used to be AWFUL which is how we had every tiny village in the US but were lacking most big cities in, say, Romania, not to mention something like Indonesia)
      • 2020-01-08 00831, 2020

      • BilleeKhoj
        but iirc it was also that areas which outside of the us wouldn't be lablled as "village" because the pop was too low, *is* marked as village becasue it's soem "official" designation in the us or something
      • 2020-01-08 00810, 2020

      • BilleeKhoj
        in addition to what yo ujsut said
      • 2020-01-08 00841, 2020

      • reosarevok
        I don't remember tbh
      • 2020-01-08 00843, 2020

      • BilleeKhoj
        I mean. as long as there is something musically related i'm all for adding it. but there are going ot be palces that have a population of 200 and wil lnever
      • 2020-01-08 00803, 2020

      • BilleeKhoj
        (or when, eventually possibly. we can *then* add it)
      • 2020-01-08 00806, 2020

      • reosarevok
        I feel having everything is better than not having stuff unless we can find a neat way of auto-adding
      • 2020-01-08 00807, 2020

      • reosarevok
        But
      • 2020-01-08 00820, 2020

      • reosarevok
        iNaturalist has an interesting way of dealing with missing species
      • 2020-01-08 00829, 2020

      • BilleeKhoj
        ...
      • 2020-01-08 00850, 2020

      • reosarevok
        When the user requests them, it automatically searches a few databases, and adds them automatically based on that
      • 2020-01-08 00801, 2020

      • BilleeKhoj
        cool
      • 2020-01-08 00808, 2020

      • reosarevok
        Us doing the same based on Wikidata would probably work (I guess that's kinda what yvanzo suggested?)
      • 2020-01-08 00813, 2020

      • BilleeKhoj
        hmm
      • 2020-01-08 00823, 2020

      • BilleeKhoj
        atleast having an automatic ticket creation!
      • 2020-01-08 00833, 2020

      • reosarevok
        Where rather than needing a ticket, you'd have a way to say "search for this area in Wikidata" at the bottom of a MB search
      • 2020-01-08 00842, 2020

      • reosarevok
        Like we have "Add a work" or whatevs
      • 2020-01-08 00842, 2020

      • BilleeKhoj
        oohhh
      • 2020-01-08 00803, 2020

      • BilleeKhoj
        but what if it's not found?
      • 2020-01-08 00809, 2020

      • BilleeKhoj
        (or, user error)
      • 2020-01-08 00815, 2020

      • reosarevok
        Well, then we'd still need to add it in Wikidata I guess
      • 2020-01-08 00825, 2020

      • reosarevok
        We could still keep AREQ for that if needed
      • 2020-01-08 00828, 2020

      • reosarevok
        But it'd automate a lot of cases I suspect
      • 2020-01-08 00834, 2020

      • BilleeKhoj
        hm! yea
      • 2020-01-08 00849, 2020

      • BilleeKhoj
        and if things are well linkied in WD the relationships can be propagated in mb as well
      • 2020-01-08 00853, 2020

      • reosarevok
        (iNaturalist does it so that if it's not in the automatic DBs, you can request it manually)
      • 2020-01-08 00803, 2020

      • BilleeKhoj
        sounds liek a plan tbh
      • 2020-01-08 00824, 2020

      • BilleeKhoj
        (but with the addition that we run a cronjob to remove every village with ### pop that is not in use)
      • 2020-01-08 00831, 2020

      • BilleeKhoj
        (maybe i'll be wrong and there wil lbe liek 10)
      • 2020-01-08 00835, 2020

      • reosarevok
        Yeah. iNat adds the parent taxons automatically based on that data, and that should also be doable in MB: "if we have the parent in MB, link it, if not, add the parent, until we get there"
      • 2020-01-08 00859, 2020

      • reosarevok
        (we'd need to decide what intermediate things to skip, etc, but the general idea should work)
      • 2020-01-08 00811, 2020

      • BilleeKhoj
        sure
      • 2020-01-08 00833, 2020

      • BilleeKhoj
        tbf, I don't actually agree with skipping intermediate thnigs. I mean if we are "going to have all the things" and "feel having everything is better than not having stuff" then it shoudl aply to larger areas, and soem historical areas too?
      • 2020-01-08 00830, 2020

      • BilleeKhoj
        releases, people and recordings can easily have "intermeidate linked area" credit without having a smaller area
      • 2020-01-08 00846, 2020

      • BilleeKhoj
        or it can be old and information fuzzy
      • 2020-01-08 00814, 2020

      • BilleeKhoj
        or area used to be non-intermediate but a higher level but was subsumed later. etc
      • 2020-01-08 00846, 2020

      • BilleeKhoj
        can areas be linked with other areas with a date range?
      • 2020-01-08 00800, 2020

      • BilleeKhoj
        so you could like logically say "blup area was a part of thingycountry from 1200-1500" but then was a part of othercountry 1520-1907" and "become soverenge state in 1908"
      • 2020-01-08 00824, 2020

      • BilleeKhoj
        eh. ignore the last one
      • 2020-01-08 00849, 2020

      • BilleeKhoj
        that'd be the start date of the area i gues
      • 2020-01-08 00856, 2020

      • BilleeKhoj
        but like
      • 2020-01-08 00818, 2020

      • yvanzo
        Currently AREQ already requires a reference such as Wikidata or Geonames.
      • 2020-01-08 00841, 2020

      • BilleeKhoj
        what do we do for a medieval work written in some place in some date in a region that no longer exists, but does have a n equivalent area today
      • 2020-01-08 00800, 2020

      • BilleeKhoj
        but borders are not identical so you sohudl rather use an historical name
      • 2020-01-08 00810, 2020

      • yvanzo
        Historic areas are currently not supported.
      • 2020-01-08 00819, 2020

      • BilleeKhoj
        and that is a huge problem imho
      • 2020-01-08 00834, 2020

      • BilleeKhoj
        not even near historical such. but old ones
      • 2020-01-08 00852, 2020

      • BilleeKhoj
        anyway we do actually have some.. we have east germany afaik
      • 2020-01-08 00803, 2020

      • BilleeKhoj
        and .. chezoslovakia?
      • 2020-01-08 00835, 2020

      • BilleeKhoj
        at ome point we *have* to enable historical areas.. what happens when todays borders are altered?
      • 2020-01-08 00844, 2020

      • BilleeKhoj
        say catalonia becomes a country.. how do we deal with that?
      • 2020-01-08 00858, 2020

      • BilleeKhoj
        make a new entity and igonre that is used ot be aprt of spain until such time
      • 2020-01-08 00800, 2020

      • BilleeKhoj
        ?
      • 2020-01-08 00836, 2020

      • BilleeKhoj
        all old links of artists to this area would be erranous if moved to new country "catalonia" whne the person f.ex was born and lived and died in "spain"
      • 2020-01-08 00847, 2020

      • yvanzo
        We should probably replace area fields with area relationships (which have dates).
      • 2020-01-08 00829, 2020

      • BilleeKhoj
        that works if the old area is equivalent to the new one.
      • 2020-01-08 00832, 2020

      • BilleeKhoj
        not always so
      • 2020-01-08 00858, 2020

      • BilleeKhoj
        Norway of pre kalmar-union had a prt of what is now sweden
      • 2020-01-08 00801, 2020

      • BilleeKhoj
        (as an example)
      • 2020-01-08 00841, 2020

      • BilleeKhoj
        w.ex a writer fro mthere. how do yo udo? do yo uwrite "was from norway (daterange)" or "was from sweden daterange"
      • 2020-01-08 00857, 2020

      • BilleeKhoj
        what if they lived during the change?
      • 2020-01-08 00806, 2020

      • BilleeKhoj
        what if they chnged theri own oppinion of it ?
      • 2020-01-08 00815, 2020

      • BilleeKhoj
        what if data is unconclusive
      • 2020-01-08 00810, 2020

      • yvanzo
        taht's live, capturing the complexity of border changes is beyond MB goals IMHO
      • 2020-01-08 00834, 2020

      • BilleeKhoj
        for the "herjedalen" scenarion it's fairly ok to just use norway with he data range (as it was a prt of that then) but for situations where the old area stopped exisitng. or a new area created
      • 2020-01-08 00801, 2020

      • reosarevok
        I just use Estonia for Soviet Estonia people
      • 2020-01-08 00810, 2020

      • yvanzo
        if there is a floss project that handles border changes and historic area data, we should surely use it though.
      • 2020-01-08 00836, 2020

      • BilleeKhoj
        yvanzo: yes and no. it's unfortunately something we're going to have to deal with eventually as the data we colelct gets better and better. music s not too unoften closely connected to "area" bot historical and current(ly changing)
      • 2020-01-08 00841, 2020

      • reosarevok
        Because it's more relevant like that and it's not like they generally chose to be Soviet
      • 2020-01-08 00848, 2020

      • BilleeKhoj
        yep
      • 2020-01-08 00850, 2020

      • reosarevok
        But I can see how it's tricky
      • 2020-01-08 00852, 2020

      • BilleeKhoj
        that makes sence..
      • 2020-01-08 00855, 2020

      • BilleeKhoj
        yes
      • 2020-01-08 00817, 2020

      • reosarevok
        That said, I kind of agree with yvanzo that we should figure out how to deal with music stuff first, and area stuff later :p
      • 2020-01-08 00824, 2020

      • BilleeKhoj
        i mena yea
      • 2020-01-08 00829, 2020

      • reosarevok
        (we still suck at a lot of directly musical bits)
      • 2020-01-08 00835, 2020

      • reosarevok
        bitmap: around btw?
      • 2020-01-08 00836, 2020

      • BilleeKhoj
        (and this is why i think "nuke the unused villages"! :P)
      • 2020-01-08 00849, 2020

      • reosarevok
        But that's effort! :p
      • 2020-01-08 00856, 2020

      • BilleeKhoj
        tsk
      • 2020-01-08 00858, 2020

      • reosarevok
        (and the MBID is probably already in Wikidata, etc)
      • 2020-01-08 00821, 2020

      • BilleeKhoj
        both those would be easy to do with some report regex thingy
      • 2020-01-08 00803, 2020

      • BilleeKhoj
        atleast I'd be very interested to see a list of "areas ith no link other than to one other (parent)area"
      • 2020-01-08 00821, 2020

      • BilleeKhoj
        and order by listed pop
      • 2020-01-08 00835, 2020

      • reosarevok
        yvanzo: do you have some time to check some PRs?
      • 2020-01-08 00842, 2020

      • yvanzo
        yup!
      • 2020-01-08 00844, 2020

      • reosarevok
        BilleeKhoj: the pop thing is harder, but the first bit is easy. One sec
      • 2020-01-08 00853, 2020

      • BilleeKhoj
        and population, not pop music , before you go all dads on me, reosarevok
      • 2020-01-08 00803, 2020

      • reosarevok
        Pop music would be easier!
      • 2020-01-08 00806, 2020

      • BilleeKhoj
        :P :D
      • 2020-01-08 00812, 2020

      • reosarevok
        Since we store that as tags, but not the population :p
      • 2020-01-08 00826, 2020

      • BilleeKhoj
        hm. I actually thought we did :/
      • 2020-01-08 00800, 2020

      • reosarevok
        yvanzo: I really want the genre stuff to get merged. https://github.com/metabrainz/musicbrainz-server/… - do you feel that's problematic? (IMO we can look at how alias stuff will work once we actually implement it)
      • 2020-01-08 00805, 2020

      • BilleeKhoj
        but like I'll shut up if its like 10