in #metabrainz

17:05 PM
iliekcomputers

https://spark.apache.org/docs/2.2.0/streaming-p... ?
17:06 PM
ruaok

yes.
17:06 PM
and the first graphic there suggest what our flow should be, I supposed.
17:06 PM
so, asking in apache-spark for recommendations on how to setup a brand new cluster for our needs would be really good.
17:07 PM
and, I am thinking that I want to postpone further policy work until October -- in sept I want to actually do technical work and move this stuff along.
17:07 PM
maybe we can do a sprint-like think and move LB along a few notches in sept.
17:09 PM
iliekcomputers

yes, please.
17:09 PM
I'll ask on the spark channel and see what they say.
17:11 PM
ruaok goes to lurk in their channel
17:16 PM
ok
17:18 PM
About AB, I was planning to create a full dump using `manage.py`, import it into frank, then during downtime create an incremental dump and import it. there are a few private tables (user, api_key) etc which are small enough to dump manually during downtime, I would guess.
17:18 PM
demonimin has quit
17:18 PM
But the thing is that full dumps don't seem to add entries into the incremental dump table at all. I'm not sure if that is intentional or not.
17:19 PM
seems liek incremental dumps is supposed to be a different series (1, 2, 3)?
17:21 PM
I ran an incremental dump on spike a few days ago, but it got stuck with no logs. So I wanted to get some context before jumping into the dumps code again.
17:22 PM
ruaok

let me look at the AB schema
17:24 PM
demonimin joined the channel
17:26 PM
what is the dump naming strategy used right now? incremental dump strategy?
17:26 PM
iliekcomputers

there's the full dump and json dumps.
17:27 PM
and then there's acousticbrainz-incr-1, acousticbrainz-incr-2
17:27 PM
ruaok

full dumps use dates in filenames?
17:28 PM
the incremental serial number is kinda odd.
17:28 PM
iliekcomputers

http://acousticbrainz.org/download
17:28 PM
acousticbrainz-lowlevel-json-20150129.tar.bz2
17:28 PM
wait, copied the wrong filename, meant to paste this: acousticbrainz-dump-20150129-180408.tar.xz
17:34 PM
ruaok

if we name dumps acousticbrainz-dump-inc-20180809-180408.tar.xz ...
17:34 PM
then it could work -- name by timestamp, not by serial number.
17:35 PM
in a way I quite like that since you can see what is going on by inspection.
17:35 PM
but it is hard to tell if you're missing a dump.
17:35 PM
demonimin has quit
17:42 PM
discopatrick has quit
18:41 PM
rxy_ joined the channel
18:41 PM
rxy_ has quit
18:43 PM
Checking joined the channel
18:43 PM
Checking has quit
18:50 PM
w3stside18 joined the channel
18:50 PM
w3stside18 has quit
18:56 PM
Mr_Monke_ has quit
19:14 PM
clonak7 joined the channel
19:14 PM
clonak7 has quit
19:25 PM
Mr_Monkey joined the channel
19:29 PM
github joined the channel
19:29 PM
github has left the channel
19:33 PM
travis-ci joined the channel
19:33 PM
travis-ci

metabrainz/picard#3537 (master - 2413504 : Laurent Monin): The build passed.
19:33 PM
Change view : https://github.com/metabrainz/picard/compare/71...
19:33 PM
Build details : https://travis-ci.org/metabrainz/picard/builds/...
19:33 PM
travis-ci has left the channel
19:34 PM
alastairp

iliekcomputers: hi, I'm heading out camping for the weekend. I'll be around on Monday/Tuesday. Just reading backlog now
19:37 PM
travis-ci joined the channel
19:37 PM
travis-ci

metabrainz/picard#3537 (master - 2413504 : Laurent Monin): The build passed.
19:37 PM
Change view : https://github.com/metabrainz/picard/compare/71...
19:37 PM
Build details : https://travis-ci.org/metabrainz/picard/builds/...
19:37 PM
travis-ci has left the channel
19:41 PM
darxun13 joined the channel
19:41 PM
darxun13 has quit
19:42 PM
Gazooo has quit
19:42 PM
Gazooo joined the channel
19:50 PM
Death91612 joined the channel
19:50 PM
Death91612 has quit
19:54 PM
HeinzBoettjer joined the channel
19:54 PM
HeinzBoettjer has quit
20:01 PM
rsh7 has quit
20:23 PM
Mr_Monkey has quit
20:23 PM
Mr_Monke_ joined the channel
20:29 PM
rsh7 joined the channel
20:34 PM
kartikeyaSh has quit
21:07 PM
Mr_Monke_ has quit
21:11 PM
mon19 joined the channel
21:11 PM
mon19 has quit
21:20 PM
Mr_Monkey joined the channel
21:24 PM
Mr_Monkey has quit
21:24 PM
mal0 joined the channel
21:24 PM
mal0 has quit
21:25 PM
CatQuest

samj1912: do you know why a search for "klaver" shows piano last https://beta.musicbrainz.org/search?query=klave... ? ideally it would be higher/highest up
21:25 PM
https://beta.musicbrainz.org/instrument/b3eac5f...
21:26 PM
some sort of counting of amoutn of same and similar aliases. eg if it has 3 aliases as "klaver" then count that
21:26 PM
it also has 3 "klavier" and 3 "klavir"
21:29 PM
anzuof10 joined the channel
21:30 PM
foxcookie joined the channel
21:30 PM
foxcookie has quit
21:32 PM
myth0d19 joined the channel
21:32 PM
myth0d19 has quit
21:35 PM
anzuof10 has quit
21:40 PM
djwhitey_ joined the channel
21:41 PM
djwhitey has quit
21:48 PM
djwhitey joined the channel
21:49 PM
djwhitey_ has quit
22:01 PM
djwhitey has quit
22:07 PM
Mr_Monkey joined the channel
22:07 PM
Carlos061115 joined the channel
22:07 PM
Carlos061115 has quit
22:08 PM
Mr_Monkey has quit
22:08 PM
Mr_Monkey joined the channel
22:49 PM
AC`97_ joined the channel
22:49 PM
AC`97_ has quit
23:31 PM
rsh7 has quit
23:51 PM
grit2 joined the channel
23:51 PM
Mr_Monkey has quit
23:55 PM
grit2 has quit