bleh. how do yo udo indent on our wiki (putting a " " before a sentence jsut maes it like in a grey box with monospacefont)
anyone?
SothoTalKer
use :
BilleeKhoj
cheers
SothoTalKer
:-)
c1e0 has quit
cleo joined the channel
cleo is now known as Guest9128
sbvkrishna joined the channel
Guest9128 is now known as c1e0
reosarevok
yvanzo, bitmap: back to Wed meetings, right?
yvanzo
alright!
SothoTalKer
mhhh wet meetings :D
is there no better way than to make a ticket for each new area? :P
reosarevok
Nope, that I know
yvanzo
automating areas management was a suggestion for gsoc btw
prabal has quit
Freso
yvanzo: Do we have a reliable source for this? Neither Wikidata nor Geonames (which are two mostly used now) are good enough.
yvanzo
How so? Aren’t these our current references to create areas?
BilleeKhoj
yea. doing it automatically is how we ended p with half a million 200pop US dupe-name villages ¬___¬
(I still say we should remove any of those unless in use (artist, recording, place event or user is attached) and thne re-add as requested)
I mean I got no issue with some dupenamed us village beign re-added if a concert was held there or an artist came fro mthere or whatever
but lets be realistic? many of those will *never* hav a use in mb
it's also this dilemma: either we remove them, or we add so many villages fro mevery other country. that is impossible and unmanageable
unused villages shoudl be removed and thne readded as needed
that way our database doesn't look like some US village directory or whatever
I mean. to a russian this will look odd probably :D
yvanzo
the idea was not to dup the whole wikidata/geonames, just to automate using it to create missing areas on demand
BilleeKhoj
and.. that worked badly :D
yvanzo
it has not been implemented, or am I missing something?
BilleeKhoj
i thought we were talking aobut old areabot or what
i may be mistaken then
yvanzo
oh, alright, I don't know much about areabot, it has probably been dismissed a while ago
BilleeKhoj
yea it has
also that is why drsaunders is doing it manually
but there is still a lot of unused villages in the database :/
yvanzo
That many?
BilleeKhoj
honestly
i don't know the numbers. but I think there was quite a few
yvanzo
That would be nice to have some feedback about the old areabot, for the record, to prevent having the same issues again.
BilleeKhoj
because and this was the thing with areabot, is that for some reason, us-villages in wikidata was labelled with.. I don't remember. some spesific property which it was to interpret as "this should be added", a thing we hadn't known in beforehand
reosarevok
I think there shouldn't be a problem with that tbh
Right now we show the most used areas on top
yvanzo
I cannot find anything about AreaBot on the wiki. It is just mentioned in comment to MBS-8361
Villages were marked as villages, we were supposed to add those - the problem wasn't that, but that the rest of the world just didn't have proper data
BilleeKhoj
reosarevok: true, and that helps *a LOT* but there is the issue with al lthe wrongly added relationships before that was fixed :/
reosarevok
That's probably better nowadays, but still not great
Wikidata still has crappy coverage of many non-Western areas
BilleeKhoj
indeed
reosarevok
(it used to be AWFUL which is how we had every tiny village in the US but were lacking most big cities in, say, Romania, not to mention something like Indonesia)
BilleeKhoj
but iirc it was also that areas which outside of the us wouldn't be lablled as "village" because the pop was too low, *is* marked as village becasue it's soem "official" designation in the us or something
in addition to what yo ujsut said
reosarevok
I don't remember tbh
BilleeKhoj
I mean. as long as there is something musically related i'm all for adding it. but there are going ot be palces that have a population of 200 and wil lnever
(or when, eventually possibly. we can *then* add it)
reosarevok
I feel having everything is better than not having stuff unless we can find a neat way of auto-adding
But
iNaturalist has an interesting way of dealing with missing species
BilleeKhoj
...
reosarevok
When the user requests them, it automatically searches a few databases, and adds them automatically based on that
BilleeKhoj
cool
reosarevok
Us doing the same based on Wikidata would probably work (I guess that's kinda what yvanzo suggested?)
BilleeKhoj
hmm
atleast having an automatic ticket creation!
reosarevok
Where rather than needing a ticket, you'd have a way to say "search for this area in Wikidata" at the bottom of a MB search
Like we have "Add a work" or whatevs
BilleeKhoj
oohhh
but what if it's not found?
(or, user error)
reosarevok
Well, then we'd still need to add it in Wikidata I guess
We could still keep AREQ for that if needed
But it'd automate a lot of cases I suspect
BilleeKhoj
hm! yea
and if things are well linkied in WD the relationships can be propagated in mb as well
reosarevok
(iNaturalist does it so that if it's not in the automatic DBs, you can request it manually)
BilleeKhoj
sounds liek a plan tbh
(but with the addition that we run a cronjob to remove every village with ### pop that is not in use)
(maybe i'll be wrong and there wil lbe liek 10)
reosarevok
Yeah. iNat adds the parent taxons automatically based on that data, and that should also be doable in MB: "if we have the parent in MB, link it, if not, add the parent, until we get there"
(we'd need to decide what intermediate things to skip, etc, but the general idea should work)
BilleeKhoj
sure
tbf, I don't actually agree with skipping intermediate thnigs. I mean if we are "going to have all the things" and "feel having everything is better than not having stuff" then it shoudl aply to larger areas, and soem historical areas too?
releases, people and recordings can easily have "intermeidate linked area" credit without having a smaller area
or it can be old and information fuzzy
or area used to be non-intermediate but a higher level but was subsumed later. etc
can areas be linked with other areas with a date range?
so you could like logically say "blup area was a part of thingycountry from 1200-1500" but then was a part of othercountry 1520-1907" and "become soverenge state in 1908"
eh. ignore the last one
that'd be the start date of the area i gues
but like
yvanzo
Currently AREQ already requires a reference such as Wikidata or Geonames.
BilleeKhoj
what do we do for a medieval work written in some place in some date in a region that no longer exists, but does have a n equivalent area today
but borders are not identical so you sohudl rather use an historical name
yvanzo
Historic areas are currently not supported.
BilleeKhoj
and that is a huge problem imho
not even near historical such. but old ones
anyway we do actually have some.. we have east germany afaik
and .. chezoslovakia?
at ome point we *have* to enable historical areas.. what happens when todays borders are altered?
say catalonia becomes a country.. how do we deal with that?
make a new entity and igonre that is used ot be aprt of spain until such time
?
all old links of artists to this area would be erranous if moved to new country "catalonia" whne the person f.ex was born and lived and died in "spain"
yvanzo
We should probably replace area fields with area relationships (which have dates).
BilleeKhoj
that works if the old area is equivalent to the new one.
not always so
Norway of pre kalmar-union had a prt of what is now sweden
(as an example)
w.ex a writer fro mthere. how do yo udo? do yo uwrite "was from norway (daterange)" or "was from sweden daterange"
what if they lived during the change?
what if they chnged theri own oppinion of it ?
what if data is unconclusive
yvanzo
taht's live, capturing the complexity of border changes is beyond MB goals IMHO
BilleeKhoj
for the "herjedalen" scenarion it's fairly ok to just use norway with he data range (as it was a prt of that then) but for situations where the old area stopped exisitng. or a new area created
reosarevok
I just use Estonia for Soviet Estonia people
yvanzo
if there is a floss project that handles border changes and historic area data, we should surely use it though.
BilleeKhoj
yvanzo: yes and no. it's unfortunately something we're going to have to deal with eventually as the data we colelct gets better and better. music s not too unoften closely connected to "area" bot historical and current(ly changing)
reosarevok
Because it's more relevant like that and it's not like they generally chose to be Soviet
BilleeKhoj
yep
reosarevok
But I can see how it's tricky
BilleeKhoj
that makes sence..
yes
reosarevok
That said, I kind of agree with yvanzo that we should figure out how to deal with music stuff first, and area stuff later :p
BilleeKhoj
i mena yea
reosarevok
(we still suck at a lot of directly musical bits)
bitmap: around btw?
BilleeKhoj
(and this is why i think "nuke the unused villages"! :P)
reosarevok
But that's effort! :p
BilleeKhoj
tsk
reosarevok
(and the MBID is probably already in Wikidata, etc)
BilleeKhoj
both those would be easy to do with some report regex thingy
atleast I'd be very interested to see a list of "areas ith no link other than to one other (parent)area"
and order by listed pop
reosarevok
yvanzo: do you have some time to check some PRs?
yvanzo
yup!
reosarevok
BilleeKhoj: the pop thing is harder, but the first bit is easy. One sec
BilleeKhoj
and population, not pop music , before you go all dads on me, reosarevok
reosarevok
Pop music would be easier!
BilleeKhoj
:P :D
reosarevok
Since we store that as tags, but not the population :p
BilleeKhoj
hm. I actually thought we did :/
reosarevok
yvanzo: I really want the genre stuff to get merged. https://github.com/metabrainz/musicbrainz-serve... - do you feel that's problematic? (IMO we can look at how alias stuff will work once we actually implement it)