in #musicbrainz

20:49 PM
reosarevok

(maaany)
20:50 PM
djce joined the channel
20:50 PM
But honestly, I get most of my music in streaming lately
20:50 PM
kepstin-laptop

yeah, I have some automatic scripts to sync my laptop with my main music collection, grabbing the mp3/vorbis version of stuff I have in flac.
20:51 PM
reosarevok

what
20:51 PM
hell
20:51 PM
http://musicbrainz.org/work/45afb3b2-18ac-4187-...
20:51 PM
Any idea why I get a square instead of the ä in the wiki link?
20:51 PM
kepstin-laptop

I get some korean character
20:51 PM
nikki

it's a korean character
20:51 PM
reosarevok

wtf
20:51 PM
kepstin-laptop

looks like an encoding issue to me.
20:51 PM
nikki

yeah
20:52 PM
reosarevok

Can anyone edit it?
20:52 PM
It looks fine to me in the actual DE wiki...
20:52 PM
nikki

the url is fine
20:52 PM
kepstin-laptop

it's just a display issue.
20:52 PM
nikki

the the decoding bit in the server
20:52 PM
reosarevok

So
20:52 PM
Is it a bug?
20:52 PM
kepstin-laptop

reosarevok: yes
20:52 PM
reosarevok

Could one you write the ticket?
20:52 PM
You clearly know what to say better than I do
20:53 PM
kepstin-laptop

probably related to http://tickets.musicbrainz.org/browse/MBS-2911 which is in this release
20:53 PM
ocharles

Hmm, that's odd
20:53 PM
I guess the display of them isn't using the correct URL escaping, still
20:53 PM
please reopen 2911
20:54 PM
nikki

it doesn't like ä it seems :P http://musicbrainz.org/url/6e3c1cc3-1d52-4b86-9...
20:54 PM
Ä worked fine though
20:56 PM
kepstin-laptop

the character that we're seing there is U+CC44, which is encoded in UTF-8 as 0xEC 0xB1 0x84.
20:56 PM
ocharles

does anyone know what encoding http://de.wikipedia.org/wiki/De_R%C3%A4uber is?
20:56 PM
nikki

what do you mean?
20:56 PM
kepstin-laptop

ocharles: looks like UTF-8
20:56 PM
ocharles

so it's not utf-8?
20:56 PM
hmm
20:56 PM
At the moment we're using "detect" to guess encoding on urls, maybe we should try utf-8 first always, and if it fails, try and guess
20:56 PM
kepstin-laptop

yeah, ä is 0xC3 0xA4 in UTF-8
20:56 PM
nikki

I would just use utf-8 first and if it fails, leave it encoded
20:57 PM
ocharles

nikki: the problem is then we can't display a pretty wikipedia name
20:57 PM
nikki

why not?
20:57 PM
wikipedia uses utf-8
20:57 PM
kepstin-laptop

nikki: some of the wikipedias don't.
20:57 PM
ocharles

nikki: the examples in that ticket isn't utf-8
20:57 PM
nor is that de. url above
20:57 PM
oh, sorry
20:57 PM
that one is
20:57 PM
but the one in the ticket is latin-1
20:58 PM
nikki

actually they do. that example redirects
20:58 PM
I only left it unedited so that I had an example I could find
20:58 PM
ocharles

so we'd have to resolve the URL to display it, which is also not really an option
20:58 PM
here's my real suggestion
20:59 PM
kepstin-laptop

ocharles: you could leave them unencoded, until someone comes along and fixes them to use the correct UTF-8 url.
20:59 PM
nikki

kepstin-laptop: exactly
20:59 PM
ocharles

Add/editing URLs should only allow utf-8 encoding. If it's not utf-8, we present a user with a list of how it would look in various encodings, and they can correct it
20:59 PM
nikki

that's what we did pre-ngs and it worked just fine
20:59 PM
ocharles

In the database, we need to find a list of URLs that aren't utf-8, and just clean them up
20:59 PM
kepstin-laptop

ocharles: presumably there are some legacy sites that don't use UTF-8 in urls tho; you can't just convert them, you'll get 404s.
20:59 PM
ocharles

and we need to ensure that we *only* store utf-8 encoding in the database
21:00 PM
nikki

and when a site only accepts non-utf-8? we tell the user they can't add it because we can't implement somehting we used to have?
21:00 PM
ocharles

ok, then we need to store the encoding
21:00 PM
nikki

I don't know why you're making it so bloody complicated
21:00 PM
ocharles

getting emotional isn't going to help...
21:01 PM
nikki

no, it isn't
21:01 PM
ocharles

i'm not making it complicated, I'm making it correct
21:01 PM
ocharles shrugs
21:01 PM
kepstin-laptop

well, character sets are hard to detect, and requiring a user to manually select one would be quite a pain.
21:01 PM
ocharles

kepstin-laptop: I was going to hide that complexity to the user though
21:02 PM
instead of saying "is this utf-8 or latin-1" just say "which of these looks correct?"
21:02 PM
<option value="encoding scheme">[ url in that encoding ]</option>
21:02 PM
kepstin-laptop

ocharles: so, an imcomplete list? what do you pick if none are correct?
21:02 PM
ijabz joined the channel
21:02 PM
ocharles

kepstin-laptop: how would that be the case?
21:02 PM
kepstin-laptop

ocharles: there are a lot of character sets.
21:03 PM
ocharles

right
21:03 PM
kepstin-laptop

so either a very long list, or an incomplete list :/
21:04 PM
ocharles

well, the list would only display stuff that successful decodes from the bytes in the url to text
21:04 PM
kepstin-laptop

for something that honestly really doesn't matter that much.
21:04 PM
ocharles

it matters if we want to display them human readable
21:04 PM
(in the wikipedia case)
21:04 PM
kepstin-laptop

right now, all the sites where you use a human-readable version take UTF-8.
21:05 PM
and the random urls to things like blogs, etc. don't have a human-readable name - none would make sense, so the character encoding doesn't matter.
21:05 PM
ocharles

so what was nikki trying to suggest? if it doesn't decode, then just display the URL and nothing else?
21:05 PM
kepstin-laptop

yeah.
21:05 PM
ocharles

that's all I needed to hear :)
21:05 PM
ocharles reopens
21:07 PM
reosarevok

kepstin-laptop, does http://imslp.org/wiki/Matth%C3%A4uspassion,_BWV... take UTF-8 too?
21:07 PM
reosarevok would like those to be made like the wikipedia ones
21:07 PM
kepstin-laptop

reosarevok: that is utf-8
21:08 PM
reosarevok

Ok
21:08 PM
ocharles, can you do that? :p
21:08 PM
ocharles

reosarevok: see http://tickets.musicbrainz.org/browse/MBS-2911
21:08 PM
kepstin-laptop

if you have a two-byte sequence where the first byte is %C3 and the second is %80 or higher, it is almost certainly UTF-8.
21:08 PM
Leftmost joined the channel
21:08 PM
anyways, I'm heading off, later :)
21:09 PM
reosarevok

ocharles, I meant making imslp links show like the wiki ones
21:09 PM
ocharles

oh right
21:09 PM
reosarevok

imslp: Title
21:09 PM
ocharles

kepstin-laptop: ta for the help
21:09 PM
reosarevok

(they're all wiki so it should be easy)
21:09 PM
ocharles

reosarevok: you'll need to open an improvement request for that, but it should be fine
21:09 PM
reosarevok

Ok :)
21:12 PM
ijabz joined the channel
21:22 PM
xplt joined the channel
21:24 PM
the_metalgamer joined the channel
21:25 PM
CatCat

ಠ_ಠ
21:25 PM
¬_¬
21:26 PM
..
21:26 PM
SPLIT ARTIST IS WHATS?
21:26 PM
OMG
21:27 PM
godamn i am supposed ot go to bed
21:27 PM
reosarevok

:p
21:27 PM
Start tomorrow!
21:27 PM
reosarevok is not splitting more today so he won't affect you
21:28 PM
ocharles

CatCat: ^_^
21:29 PM
reosarevok

CatCat, remember to deactivate the autoedit option if you're not 100% sure for these though
21:29 PM
(much harder to rejoin later in case of mistake :) )
21:30 PM
ocharles

yea
21:30 PM
reosarevok: I was wondering if it should be an auto edit
21:30 PM
reosarevok

nikki says no
21:30 PM
She might be right
21:31 PM
But let's wait this one week before changing it :p
21:31 PM
ocharles

yea, I think we could have non-auto-edit with a low vote count
21:31 PM
reosarevok

Hmm, that's an option
21:31 PM
ocharles

or just normal voting really
21:31 PM
reosarevok

Yeah, dunno
21:31 PM
I wouldn't mind normal voting
21:32 PM
But I would like that more if people actually voted on my edits more oftne
21:32 PM
*often
21:32 PM
I mean, I have problems bringing my queue to 0 and I have 20 subscribers
21:32 PM
I can't even start to imagine how bad it must be to others
21:34 PM
Elliot joined the channel
21:36 PM
PasNox joined the channel
21:41 PM
ianmcorvidae joined the channel
21:42 PM
kyan

Hello! Would you mind renaming my user account to "aktkele"? http://ephemurl.com/2d/otu Also, please erase http://ephemurl.com/2d/otw and the reference at http://ephemurl.com/7d/otx (erase the username from them). Thanks! :-)
21:43 PM
ocharles

kyan: you'll need to email support [at musicbrainz.org] for that to happen
21:43 PM
kyan

ocharles: Okay, I'll do that. Thanks.
21:44 PM
murdos

reosarevok: I've the same problem, even if I'm one of the editors with the most subscribers (58 currently)
21:44 PM
drsaunde

DOH
21:44 PM
i have to try harder, i think i used to have the most :-)
21:45 PM
reosarevok

drsaunde, you probably need to try LESS hard
21:45 PM
reosarevok imagines some ran away at the edit amount
21:47 PM
murdos

the new behavior of "open edits" view for an editor doesn't help (it now display edits you've already voted for)
21:49 PM
reosarevok

hah
21:50 PM
ocharles, so the modbot remove relationships can be voted on
21:50 PM
Including the ones for my own splits
21:50 PM
Nice :p
21:50 PM
ocharles

hum
21:51 PM
reosarevok

The ones for collabs I mean
21:51 PM
I guess the rest still can't
21:51 PM
ocharles

yea