geordi was basically undefined given how many things the eighty different people who had come up with "sort of ingestion" ideas had come up with, so I went for solving needs I understood
nikki
presumably because you want to know what's already matched?
ianmcorvidae
which in this case was the two I mentioned, for editors and for data providers
nikki
I dunno... I guess I'm not sure how you thought it'd work (and/or I'm too tired to understand what you've said)
luks
I'm not good at explaining things, I know that :)
the ultimate goal I saw was adding things to MB automatically
and the rest being just tools to help with that
hawke_1
luks: why would you need an external database to keep track of it, if geordi knows what it has matched?
ianmcorvidae
ah, yeah, that was not in the goals, and is a much harder thing to do right
kepstin-work
we weren't sure we wanted to add stuff "automatically"; geordi is just a tool to make it easier to import/match large external data sources manually
ianmcorvidae
versus making editing easier, which is the editor perspective on geordi
luks
hawke_1: I need a database to know what's processed, possible not matched
kepstin-work
but a bot could certainly ask geordi for unmatched releases, then either import or match them, then tell geordi that it's matched
and rinse, repeat
luks
hawke_1: and I need to have a different app to be able to confirm "candidate" matches I'm not completely sure about
hawke_1
luks: Isn’t everything unmatched a candidate for matching?
ianmcorvidae
I see what luks is saying, that doesn't let you keep track of things that you haven't successfully matched, yeah
luks
hawke_1: possibly, but after some time
ianmcorvidae
there's nothing in geordi for "I tried, but it didn't work, so don't try again for a bit"
kepstin-work
adding support to geordi for storing 'potential' or 'possible' matches might be interesting
but yeah, failed matches would be an issue :/
ianmcorvidae
yeah, indeed
hawke_1
I don‘t quite get what would be the “I” in “I tried but it didn’t work”
ianmcorvidae
automatic matches are already displayed differently than human-done ones, but adding more levels may be a good idea
kepstin-work
if geordi lets an app add arbitrary extra metadata fields, that might be interesting.
hawke_1
that would be any bot/algorithm wouldn’t it?
ianmcorvidae
hawke_1: whatever script
yes
hawke_1
each one would need its own 'tried' flag
kepstin-work
the app could add an 'x-luks-script-match: failed' tag or whatever
and do a search for things without that field?
ianmcorvidae
well, geordi does let you keep track of who or what did the match, so
there's just no way to attach data to an item other than having direct elasticsearch access or submitting a match, which is what the potential-match thing would be for
also: apparently that pep8 test *did* in fact pass
which has me mystified.
kepstin-work
ianmcorvidae: does it fail on your box?
ianmcorvidae
kepstin-work: yes, rather dramatically
kepstin-work
It might be an issue with the test code itself, then
failing to detect a failure :)
ianmcorvidae
lol
kepstin-work
(e.g. counting an 'unable to find executable' as a pass)
ianmcorvidae
hah
I'm silly :P
kepstin-work
hmm. by any chance, is jenkins running the tests from outside of the 'geordi' directory?
Ian McEwen: Use the right directory for the pep8 test.
kepstin-work
there you go, now it's failing properly :)
ianmcorvidae
yup
ruaok joined the channel
Freso
FAILURE, Ian. It's a FAILURE!
ianmcorvidae
yeah
Freso
(MBo failed too, when I added a PEP8 test. >_>)
ianmcorvidae
I need to figure out which things it's yelling about I care about and which rules I should decree aren't part of geordi's style guide :P
(already removed one, for expecting two line breaks before classes (or maybe after imports? I wasn't really sure -- but double linebreaks are gross :P)
Leftmost
I agree.
Freso
Heh.
Leftmost
Excess whitespace makes me sad.
Freso
I want to adhere to Pocoo's style guide, so I'm keeping them.
ianmcorvidae
hehe
Freso
Even as I generally disapprove of excessive whitespace.
ianmcorvidae
I'm of two minds about maybe increasing the line length
Freso
I still think a consistent coding style is > removing whitespace.
ianmcorvidae: I've so far managed to keep all Python files under 80 characters long.
...
ianmcorvidae
yeah, but as pep8 itself says, it's more important within a given project or file
Freso
>_>
ianmcorvidae
hah
Freso
All *lines* in Python files.
ianmcorvidae
insert joke here about writing python like APL? ;)
Freso
:)
Python forcing whitespace indentation is a pretty silly language to make small-yes-extensive one liner scripts in...
...
Ugh.
I'm too tired for this.
I should go to bed.
ianmcorvidae
heh semicolons
ianmcorvidae was clearly writing too much perl before I worked on this
Leftmost
Freso, consistent coding style is good. If I were working on a project that specified extra whitespace, I would probably do it. But for projects I start, I won't.