luks: still alive? got another python-quiz for you :p
2006-10-03 27653, 2006
luks
yep, still here :)
2006-10-03 27608, 2006
canidae
need a workaround for this error "UnicodeEncodeError: 'ascii' codec can't encode characters in position 10-12: ordinal not in range(128)"
2006-10-03 27619, 2006
canidae
apparently i'm getting some string with weird characters
2006-10-03 27634, 2006
luks
use unicode objects
2006-10-03 27643, 2006
yllona
canidae: so you're on a mission to: 1) keep everyone young; 2) give everyone a few grey hairs :)
2006-10-03 27634, 2006
luks
canidae: python uses only ascii for implicit conversions between str and unicode
2006-10-03 27642, 2006
canidae
yllona: i'm not quite sure what my mission is, although i suspect i won't have any hair left on my head if i don't get this working within the end of the week :p'
2006-10-03 27648, 2006
luks
to you have to call e.g. something.encode('utf-8')
2006-10-03 27652, 2006
luks
*so
2006-10-03 27657, 2006
yllona
canidae: :)
2006-10-03 27618, 2006
yllona
yay! luks to the rescue ;)
2006-10-03 27602, 2006
canidae
luks: so... you're suggesting i do a "text.encode('utf-8')" then?
2006-10-03 27617, 2006
luks
what exactly are you doing?
2006-10-03 27649, 2006
canidae
well :)
2006-10-03 27650, 2006
luks
if you need to pass the string to some external library which expects UTF-8 then yes
2006-10-03 27657, 2006
canidae
right
2006-10-03 27600, 2006
canidae
*breathe*
2006-10-03 27637, 2006
canidae
ok, so i got pylucene up & running after alot of pain, i've made a cute index just below 2gb and i'm fetching fields from a document
2006-10-03 27631, 2006
canidae
it works well for most, but at least one artist didn't like when i tried to "print doc.get('artist')"
2006-10-03 27624, 2006
canidae
not quite sure what more i can say, it borks when i try to do that print :)
2006-10-03 27632, 2006
luks
print uses the encoding from your locale
2006-10-03 27652, 2006
luks
unless you redirect the output to a file
2006-10-03 27656, 2006
luks
then it uses ascii
2006-10-03 27641, 2006
canidae
it's a cgi-script, so i guess it outputs to apache :p
2006-10-03 27602, 2006
luks
yeah, then you probably need to encode it explicitly
2006-10-03 27653, 2006
luks
okay, i'm off to bed
2006-10-03 27656, 2006
luks
good night
2006-10-03 27658, 2006
luks
or good morning :)
2006-10-03 27600, 2006
luks has quit
2006-10-03 27619, 2006
canidae
well that's the oddest...
2006-10-03 27601, 2006
canidae
oh wait, it isn't :)
2006-10-03 27633, 2006
canidae
이동준
2006-10-03 27643, 2006
canidae
no wonder it borked if it tried to display that as ascii
2006-10-03 27658, 2006
tro has quit
2006-10-03 27643, 2006
tro joined the channel
2006-10-03 27604, 2006
CarlFK joined the channel
2006-10-03 27618, 2006
yllona has quit
2006-10-03 27644, 2006
Yurim joined the channel
2006-10-03 27608, 2006
Yurim
Hi folks!
2006-10-03 27625, 2006
Amblin- joined the channel
2006-10-03 27610, 2006
Amblin has quit
2006-10-03 27635, 2006
hawke has quit
2006-10-03 27641, 2006
toxickore joined the channel
2006-10-03 27640, 2006
hawke joined the channel
2006-10-03 27611, 2006
Yurim has quit
2006-10-03 27638, 2006
rpedro has quit
2006-10-03 27609, 2006
rpedro joined the channel
2006-10-03 27604, 2006
SoothingR joined the channel
2006-10-03 27642, 2006
MrQwerty joined the channel
2006-10-03 27638, 2006
toxickore has quit
2006-10-03 27604, 2006
FauxFaux has quit
2006-10-03 27652, 2006
FauxFaux joined the channel
2006-10-03 27625, 2006
rpedro has quit
2006-10-03 27611, 2006
rpedro joined the channel
2006-10-03 27657, 2006
dseomn has quit
2006-10-03 27624, 2006
sidd_ has quit
2006-10-03 27629, 2006
Aankhen`` has quit
2006-10-03 27652, 2006
yalaforge joined the channel
2006-10-03 27647, 2006
Freso joined the channel
2006-10-03 27631, 2006
luks joined the channel
2006-10-03 27632, 2006
luks has quit
2006-10-03 27654, 2006
luks joined the channel
2006-10-03 27634, 2006
sidd joined the channel
2006-10-03 27638, 2006
csp joined the channel
2006-10-03 27650, 2006
csp
Am I supposed to leave in spelling mistakes in track titles?
2006-10-03 27613, 2006
FauxFaux
I believe it's quoted as the artists' choice, check the covers?
2006-10-03 27601, 2006
csp
I do have the original CD, but I'm unsure if it's the artist (Jim Croce) choice to name it "It does'nt have to be that way"
2006-10-03 27622, 2006
csp
(while I'm not a native english speaker, I believe it should be "doesn't")
2006-10-03 27640, 2006
FauxFaux
Yeah, that looks horrible. :p
2006-10-03 27627, 2006
csp
I guess it's a stupid compilation.. but it is acually printed that way
2006-10-03 27638, 2006
csp
I think I'll try to make it correct english. Other compilations also use the correct title
2006-10-03 27628, 2006
Synchro joined the channel
2006-10-03 27614, 2006
Rondom joined the channel
2006-10-03 27631, 2006
deadchip
i think it should become a separately listed track with the annotation of being an extraordinary curiosity
2006-10-03 27652, 2006
csp
I could add an annotation to the release mentioning the error, but should the title of the track be the original or the corrected one then?
2006-10-03 27626, 2006
yalaforge
if other releases use the correct title, then it's no artist intent and it should be corrected, IMHO
2006-10-03 27638, 2006
intgr
Freso: Ping?
2006-10-03 27615, 2006
Freso
intgr: Pong!
2006-10-03 27636, 2006
HairMetalAddict
Album cover mistakes are regularly corrected. We want correct info, not the cover designer's inability to spell...
2006-10-03 27650, 2006
intgr
I'm at work (= on Windows) and I'd like to bump Picard to 0.7.1 in my overlay. Could you test it for me?
Now this is odd. I could've sworn I saw a 0.7.1 release announcement on the blog this morning... Or was I still dreaming?
2006-10-03 27624, 2006
yalaforge
it was in mb-users, I think
2006-10-03 27601, 2006
yalaforge
the reason why it's not yet on the ftp server could be a permission problem
2006-10-03 27612, 2006
yalaforge doesn't have upload privs for his packages either
2006-10-03 27636, 2006
luks
actually, no. uploading picard to ftp.mb.org was a temporary solution
2006-10-03 27607, 2006
luks
i'm not sure if i should upload it also there
2006-10-03 27635, 2006
intgr
Helixcommunity sucks.
2006-10-03 27646, 2006
luks
yeah, but it has nice stats :)
2006-10-03 27656, 2006
luks
wait a sec, i'll upload it
2006-10-03 27601, 2006
intgr
Thanks. :)
2006-10-03 27643, 2006
intgr
I wouldn't mind Helixcommunity if one wouldn't have to update that download ID with every new release.
2006-10-03 27618, 2006
deadchip
csp: no it was just a joke
2006-10-03 27642, 2006
deadchip
csp: i wouldn't bother with anything, except maybe making a note or something that on the printed cover of this particular CD the title was spelled wrong
2006-10-03 27611, 2006
deadchip
csp: so people who own the exact same CD are not confused (they'll probably think that it's just a typo as well but this will just let them know they're not the only onse)
2006-10-03 27612, 2006
deadchip
ones*
2006-10-03 27613, 2006
csp
deadchip: sorry the joke didn't work. I'm leaving it corrected now. The collection (out since 1989) isn't popular anyway, so why bother
in that screenshot it matched the titles using LD and some additional code of course
2006-10-03 27601, 2006
intgr
Are you using standard weights?
2006-10-03 27649, 2006
deadchip
i don't know much about the specific LD calculation implementation i've used
2006-10-03 27654, 2006
deadchip
i'd have to check heh
2006-10-03 27635, 2006
intgr
Differences in punctation and higher/lower case should probably have lower weights.
2006-10-03 27603, 2006
intgr
But what kind of matching algorithm does Picard currently use? I thought it was Levenshtein.
2006-10-03 27623, 2006
luks
levenshtein on normalized strings
2006-10-03 27658, 2006
deadchip
oh i didn't know you have it already in
2006-10-03 27639, 2006
intgr
Freso: So, would you be OK with testing 0.7.1?
2006-10-03 27646, 2006
deadchip
but either way, i've tried to do various preprocessing to the strings before calculating the ld and it turned out leaving them like they are seems to be best
2006-10-03 27655, 2006
deadchip
i'm not even normalizing them in any way (lowercasing or anything)
2006-10-03 27648, 2006
Freso
intgr: Sure. :)
2006-10-03 27656, 2006
Rondom has quit
2006-10-03 27609, 2006
Freso
intgr: But please prepend all messages directed to me with my nick so that my highlight goes off. :)
2006-10-03 27617, 2006
intgr
Freso: Ah, ok. :)
2006-10-03 27621, 2006
intgr
Freso: I suppose you can manage the version-bumping yourself?
2006-10-03 27655, 2006
Freso
intgr: Unmasking/-keywording or renaming the ebuild?
2006-10-03 27619, 2006
Freso
The former? Sure. The latter? Sure - but I'd prefer you to do it. :)