in #metabrainz

22:36 PM
Leftmost

The developers manually fetch translations from Transifex before making a release.
22:38 PM
beqn_ has quit
22:39 PM
LordSputnik

Leftmost: but dimensions aren't strings :(
22:39 PM
bitmap very much likes the idea of an automated, weekly pull request for mbserver, though
22:39 PM
opatel99

Mineo: https://github.com/opatel99/picard/commit/69999...
22:39 PM
alastairp has quit
22:40 PM
Tecfan_ has quit
22:40 PM
All those changes in the .pot file has me worried I did something wrong.
22:41 PM
kepstin

opatel99: it's mostly just line number changes, other than the new stuff you added. That's normal
22:42 PM
looks like someone added a new string before you but forgot to regenerate the pot file :)
22:42 PM
For "Key"
22:43 PM
opatel99

Oh so that shifted everything...
22:43 PM
kepstin

nah, the line number shifts are just do to various other changes since the last time the file was regenerated. Don't worry about them
22:49 PM
Leftmost

LordSputnik, they're not integers either. They're complex data types that libraries store in a semi-standardized format.
22:50 PM
LordSputnik

Which libraries and what standard?
22:54 PM
Leftmost

The standard I know about is apparently from the Anglo-American Cataloguing Rules, Second Edition used in the US, Canada and UK, superseded in 2010 by a similar format from the Resource Description and Access standard, which is in use in the US. (Not sure about elsewhere.)
22:55 PM
kepstin should go to his library and check out a copy of this book ;)
22:55 PM
LordSputnik

OK, but surely storing as a string opens up all kinds of problems?
22:56 PM
And takes up more space
22:56 PM
kepstin

it would be nice to have machine-readable dimensions; would it be possible to parse/generate the format in question, or is it too variable?
22:56 PM
LordSputnik

And is harder for sorting
22:56 PM
Leftmost

I'm not suggesting we make full use of that format, just that we allow leeway in how things are entered. For one thing, the semantics of our storage is unclear. We don't specify units or a standard means for determining number of pages, which both AACR2 and RDA do.
22:56 PM
LordSputnik

We specify units
22:56 PM
mm for the dimensions and g for the weight
22:57 PM
kepstin

and no matter our storage units, someone should be able to select display units
22:57 PM
Leftmost

Okay. Well, libraries typically measure books in cm but sometimes in mm.
22:58 PM
LordSputnik

Unless we get a ton of complaints I don't really want to change storing whole numbers as integers :P
22:58 PM
Leftmost

Pagination is also something libraries care about.
22:59 PM
That'd be a separate field anyhow, I guess.
22:59 PM
I'm temporarily convinced while I do more research. :-P
22:59 PM
kepstin suspects that most librarians don't go measuring books with micrometers, so mm is probably sufficient resolution... but it wouldn't hurt to have a little more precision just in case ;)
23:00 PM
chirlu`

The Book (disambiguation: 234.56 mm width edition)
23:01 PM
LordSputnik

At greater precisions than mm, the width of the book depends on the amount of pressure you apply :P
23:02 PM
Leftmost: so I'm thinking about OAuth with MB
23:02 PM
Do we want to get back the user's username or email address for performing lookups in our own editor table?
23:02 PM
(or both?)
23:03 PM
I guess an ID would be better
23:03 PM
kepstin

might be nice to have enough precision to store reasonable representations of up to maybe 1/16 inch in mm, which you could probably do with an extra 2 decimal places or so.
23:03 PM
Leftmost

For now, I suggest we stick with what we have until MB actually has an SSO plan.
23:04 PM
LordSputnik

Leftmost: We do have one in the short term
23:04 PM
Gentlecat is doing the same thing for CB
23:04 PM
(afaik)
23:04 PM
Gentlecat

what am I doing? :)
23:06 PM
LordSputnik

Gentlecat: the OAuth stuff we were talking about the other day, when you were asking about whether we decided upon anything at the summit
23:06 PM
Leftmost

We've got a _lot_ of churn in our code right now and I'd like to hit the target I'm looking at (integrating the reworked schema and data package into -site and -ws) and move it after.
23:06 PM
Gentlecat

I don't remember how email address is returned from oauth endpoint
23:07 PM
if it's always verified or not, etc.
23:07 PM
zas

opatel99: don't bother with potfile in Picard, make your PR marking strings to be translated as usual. I'll regenerate and commit the potfile at some point, it will be retrieved by transifex few hours later and strings to be translated will be available for translation
23:07 PM
LordSputnik

Leftmost: OK, good point, let's leave users alone for now
23:07 PM
Gentlecat

besides, there's an issue of keeping everything in sync, which is annoying
23:08 PM
opatel99

zas: Oops... Too late for the current, but will keep that in mind for next time
23:08 PM
Leftmost

On the plus side, I'm pretty satisfied with the schema diagram we have right now and I think we're in a position to do a soft freeze on it once we get it implemented, if that sounds reasonable.
23:09 PM
zas

I will also resync translations in Picard sources, before a release or on regular basis if, as usual, we don't release often enough.
23:09 PM
LordSputnik

Leftmost: I'm going to change password to a CHAR(64)
23:09 PM
opatel99

So no need to worry about it in the future. zas
23:09 PM
Tecfan_ joined the channel
23:10 PM
LordSputnik

Leftmost: Oh, no, CHAR(60) :P
23:10 PM
Leftmost

Oh, good call.
23:11 PM
LordSputnik

Haha, I don't know how editions got a gender ID :P
23:12 PM
zas

Yes. Pot file update has to be done in master branch rather than in pull requests. Else we'll get unneeded conflicts.
23:12 PM
opatel99

Should I delete the branch and resubmit, or is that ok?
23:13 PM
chirlu`

Gentlecat: MB doesn’t even store unverified addresses. :)
23:13 PM
beqn_ joined the channel
23:13 PM
D4RK-PH0ENiX has quit
23:13 PM
alastairp joined the channel
23:13 PM
Gentlecat

well, shows how much I know about it :)
23:13 PM
opatel99

zas:
23:15 PM
opatel99 has quit
23:16 PM
zas

opatel99: in your string it is preferable when possible to keep the same accel key here I would keep the & before the P
23:16 PM
Leftmost

LordSputnik, can we do away with using Handlebars templates stored in the database somehow?
23:17 PM
LordSputnik

Leftmost: Whats the matter with that?
23:17 PM
zas

And you can just drop the Picard.pot changes
23:17 PM
LordSputnik

We need some way of templating the relationship string, and Handlebars is pretty lightweight
23:17 PM
Leftmost

Fair enough. It just seems... dirty to me.
23:18 PM
LordSputnik

If we stored something like "_s_ authored _t_" instead, it's just a different templating format
23:18 PM
zas

I guess we can start to think about a Picard release after the GCI
23:19 PM
LordSputnik

Leftmost: At least with the new schema the handlebars parameters become much cleaner
23:21 PM
Bookzombie

anniezhou301 closed pull request "Login and Register pages converted into react.js" without merge (https://github.com/bookbrainz/bookbrainz-site/p...)
23:21 PM
LordSputnik pushed 1 commits to bookbrainz-sql: https://github.com/bookbrainz/bookbrainz-sql/co...
23:23 PM
LordSputnik

Leftmost: should we shut down Bookzombie?
23:23 PM
Leo_Verto: ^ same question
23:24 PM
Leo_Verto

D:
23:25 PM
Because it's too spammy?
23:27 PM
Leftmost

Maybe we need a #metabrainz-botspam channel.
23:27 PM
chirlu` finds a mistake in the business-relations blog post.
23:28 PM
LordSputnik

Leo_Verto: partly, yeah, but also because we could just use gitter to notify each other about commits
23:28 PM
chirlu`

“hopefully Christina will allows us” -s
23:30 PM
LordSputnik

darwin: you might have an opinion on this: Is there any advantage to using NULL to represent an empty string in a TEXT field in postgres to just storing an empty string?
23:30 PM
Leo_Verto

Um
23:30 PM
D4RK-PH0ENiX joined the channel
23:31 PM
Am I using Gitter wrong? There's only Freso and me in the metabrainz channel and the only message is from me and 2 months old
23:31 PM
chirlu`

LordSputnik: Semantically, NULL means “unknown value”.
23:31 PM
Empty string means “known empty”.
23:31 PM
Leo_Verto

Also I thought keeping all communication transparent and in one place was one of the recent goals
23:32 PM
LordSputnik

chirlu`: I know that
23:33 PM
chirlu`

E.g. in MB, barcode = NULL means we don’t know, barcode = "" means someone ticked the “this release has no barcode” checkbox.
23:34 PM
LordSputnik: Well, then, the answer is “Yes, it is advantageous if the value is unknown rather than empty”.
23:34 PM
LordSputnik

Leo_Verto: true, but bot notifications aren't necessarily communication we want to keep around
23:34 PM
chirlu`: I'm asking from a storage/performance point of view
23:34 PM
I'd guess that NULL is faster because it is stored in-table, while TEXT is stored externally, right?
23:34 PM
chirlu` is answering from a “Premature optimization is the root of all evil” point of view.
23:35 PM
chirlu`: you're right, that's probably a better point of view
23:35 PM
Leo_Verto

Mhm, we could work around this by either having Bookzombie prefix messages with [off] or BrainzBot ignoring all it's messages, but I can see your point
23:36 PM
Bookzombie

leftmostcat pushed 1 commits to bookbrainz-sql: https://github.com/bookbrainz/bookbrainz-sql/co...
23:36 PM
Leftmost

LordSputnik, ^ look okay to you?
23:36 PM
Leo_Verto

Either way, I say !m must stay!
23:37 PM
LordSputnik

Leftmost: Just a sec, also - do you think we need to differentiate between "unknown bio" and "empty bio"? If not, I'll make bio NOT NULL
23:37 PM
Leftmost

My stance is that commit bots and build bots are useful, but in a high-traffic channel can interfere with discussions (particularly for users new to IRC, which may be relevant during GCI.
23:37 PM
LordSputnik

Leo_Verto: !m is BrainzBot though!
23:38 PM
Leftmost

Is bio not already NOT NULL?
23:38 PM
LordSputnik

Nope
23:38 PM
I'm going through checking NULLs atm
23:38 PM
Leftmost

Oh, wait. Why should it be NOT NULL?
23:38 PM
Leo_Verto

Oh, I guess that's solved then :P
23:38 PM
LordSputnik

Leftmost: because I don't think there's a need to distinguish between "unknown bio" and "empty bio"
23:39 PM
Leftmost

Sure, but why store "empty" at all, in that case?
23:40 PM
LordSputnik

Leftmost: well, we have to have the field, unless we have a editor_bio table
23:40 PM
(it's currently at editor.bio)
23:41 PM
Leftmost

I'm just very confused as to why we'd want to store '' instead of NULL, rather than the other way around.
23:42 PM
LordSputnik

Leftmost: because NULL is meaningless here :P
23:43 PM
Leftmost

I'd argue that '' is meaningless here. :-P
23:44 PM
LordSputnik

If the editor fills out their bio, it's a non-empty string. If the editor deletes the filled out bio, it's an empty string (''). If the editor leaves it blank, it's NULL. I don't think we need to distinguish between the latter two
23:44 PM
darwin

LordSputnik: I don't know the answer in postgres, in mysql it barely matters.
23:45 PM
Leftmost

I dunno. It just seems an odd way to store it. To me, NOT NULL hints that we really want something in that field, but I dunno. I agree we don't need to distinguish, but I'd vote for instead: unset = NULL, set = 'blah', deleted = NULL.
23:45 PM
If that sounds wrong to you, go ahead and go NOT NULL.
23:46 PM
darwin

in my world, things which always should be set are NOT NULL
23:46 PM
things which may not always be set may be NULLable, especially if NULL has a different meaning from "0" or "empty string"
23:46 PM
but NULL complicates queries and does not generally save space on disk...
23:47 PM
Leftmost

Yeah, I tend to see NOT NULL as indicating things should always be set.
23:47 PM
chirlu`

You should also consider that the ternary logic is often unintuitive and will lead to bugs. E.g., most people would think that WHERE editor.bio LIKE '%foo%' OR NOT (editor.bio LIKE '%foo%') is going to match every row.
23:48 PM
Techtronix joined the channel
23:48 PM
darwin

chirlu`: ("NULL complicates queries")
23:48 PM
Leftmost

chirlu`, in the instance where it's NULLable?
23:48 PM
chirlu`

Yes.
23:48 PM
Leftmost

Fair enough.
23:50 PM
chirlu`

Example for a bug that bit MB: https://musicbrainz.org/edit/29424653
23:51 PM
The name changed from '' meaning no name to NULL meaning no name.
23:51 PM
Bookzombie

LordSputnik pushed 1 commits to bookbrainz-sql: https://github.com/bookbrainz/bookbrainz-sql/co...
23:55 PM
djpretzel has quit
23:55 PM
LordSputnik

Leftmost: I'm wondering if we can choose a better field name than "is_primary"/"primary" for that alis field
23:57 PM
Leftmost

is_primary_for_locale? :-P
23:58 PM
opatel99 joined the channel
23:59 PM
chirlu`

MB: primary_for_locale