#metabrainz

/

17:05 PM
iliekcomputers

https://spark.apache.org/docs/2.2.0/streaming-pro… ?

2018-08-09 22110, 2018

17:06 PM
ruaok

yes.

2018-08-09 22121, 2018

17:06 PM
ruaok

and the first graphic there suggest what our flow should be, I supposed.

2018-08-09 22153, 2018

17:06 PM
ruaok

so, asking in apache-spark for recommendations on how to setup a brand new cluster for our needs would be really good.

2018-08-09 22136, 2018

17:07 PM
ruaok

and, I am thinking that I want to postpone further policy work until October -- in sept I want to actually do technical work and move this stuff along.

2018-08-09 22157, 2018

17:07 PM
ruaok

maybe we can do a sprint-like think and move LB along a few notches in sept.

2018-08-09 22115, 2018

17:09 PM
iliekcomputers

yes, please.

2018-08-09 22150, 2018

17:09 PM
iliekcomputers

I'll ask on the spark channel and see what they say.

2018-08-09 22121, 2018

17:11 PM
ruaok goes to lurk in their channel

2018-08-09 22127, 2018

17:16 PM
iliekcomputers

ok

2018-08-09 22131, 2018

17:18 PM
iliekcomputers

About AB, I was planning to create a full dump using `manage.py`, import it into frank, then during downtime create an incremental dump and import it. there are a few private tables (user, api_key) etc which are small enough to dump manually during downtime, I would guess.

2018-08-09 22154, 2018

17:18 PM
demonimin has quit

2018-08-09 22159, 2018

17:18 PM
iliekcomputers

But the thing is that full dumps don't seem to add entries into the incremental dump table at all. I'm not sure if that is intentional or not.

2018-08-09 22144, 2018

17:19 PM
iliekcomputers

seems liek incremental dumps is supposed to be a different series (1, 2, 3)?

2018-08-09 22117, 2018

17:21 PM
iliekcomputers

I ran an incremental dump on spike a few days ago, but it got stuck with no logs. So I wanted to get some context before jumping into the dumps code again.

2018-08-09 22157, 2018

17:22 PM
ruaok

let me look at the AB schema

2018-08-09 22138, 2018

17:24 PM
demonimin joined the channel

2018-08-09 22109, 2018

17:26 PM
ruaok

what is the dump naming strategy used right now? incremental dump strategy?

2018-08-09 22158, 2018

17:26 PM
iliekcomputers

there's the full dump and json dumps.

2018-08-09 22122, 2018

17:27 PM
iliekcomputers

and then there's acousticbrainz-incr-1, acousticbrainz-incr-2

2018-08-09 22155, 2018

17:27 PM
ruaok

full dumps use dates in filenames?

2018-08-09 22108, 2018

17:28 PM
ruaok

the incremental serial number is kinda odd.

2018-08-09 22123, 2018

17:28 PM
iliekcomputers

http://acousticbrainz.org/download

2018-08-09 22132, 2018

17:28 PM
iliekcomputers

acousticbrainz-lowlevel-json-20150129.tar.bz2

2018-08-09 22158, 2018

17:28 PM
iliekcomputers

wait, copied the wrong filename, meant to paste this: acousticbrainz-dump-20150129-180408.tar.xz

2018-08-09 22121, 2018

17:34 PM
ruaok

if we name dumps acousticbrainz-dump-inc-20180809-180408.tar.xz ...

2018-08-09 22136, 2018

17:34 PM
ruaok

then it could work -- name by timestamp, not by serial number.

2018-08-09 22100, 2018

17:35 PM
ruaok

in a way I quite like that since you can see what is going on by inspection.

2018-08-09 22112, 2018

17:35 PM
ruaok

but it is hard to tell if you're missing a dump.

2018-08-09 22130, 2018

17:35 PM
demonimin has quit

2018-08-09 22153, 2018

17:42 PM
discopatrick has quit

2018-08-09 22158, 2018

18:41 PM
rxy_ joined the channel

2018-08-09 22158, 2018

18:41 PM
rxy_ has quit

2018-08-09 22136, 2018

18:43 PM
Checking joined the channel

2018-08-09 22139, 2018

18:43 PM
Checking has quit

2018-08-09 22112, 2018

18:50 PM
w3stside18 joined the channel

2018-08-09 22113, 2018

18:50 PM
w3stside18 has quit

2018-08-09 22124, 2018

18:56 PM
Mr_Monke_ has quit

2018-08-09 22129, 2018

19:14 PM
clonak7 joined the channel

2018-08-09 22129, 2018

19:14 PM
clonak7 has quit

2018-08-09 22100, 2018

19:25 PM
Mr_Monkey joined the channel

2018-08-09 22100, 2018

19:29 PM
github joined the channel

2018-08-09 22100, 2018

19:29 PM
github has left the channel

2018-08-09 22138, 2018

19:33 PM
travis-ci joined the channel

2018-08-09 22139, 2018

19:33 PM
travis-ci

metabrainz/picard#3537 (master - 2413504 : Laurent Monin): The build passed.

2018-08-09 22139, 2018

19:33 PM
travis-ci

Change view : https://github.com/metabrainz/picard/compare/7157…

2018-08-09 22139, 2018

19:33 PM
travis-ci

Build details : https://travis-ci.org/metabrainz/picard/builds/41…

2018-08-09 22139, 2018

19:33 PM
travis-ci has left the channel

2018-08-09 22119, 2018

19:34 PM
alastairp

iliekcomputers: hi, I'm heading out camping for the weekend. I'll be around on Monday/Tuesday. Just reading backlog now

2018-08-09 22123, 2018

19:37 PM
travis-ci joined the channel

2018-08-09 22124, 2018

19:37 PM
travis-ci

metabrainz/picard#3537 (master - 2413504 : Laurent Monin): The build passed.

2018-08-09 22124, 2018

19:37 PM
travis-ci

Change view : https://github.com/metabrainz/picard/compare/7157…

2018-08-09 22124, 2018

19:37 PM
travis-ci

Build details : https://travis-ci.org/metabrainz/picard/builds/41…

2018-08-09 22124, 2018

19:37 PM
travis-ci has left the channel

2018-08-09 22145, 2018

19:41 PM
darxun13 joined the channel

2018-08-09 22146, 2018

19:41 PM
darxun13 has quit

2018-08-09 22127, 2018

19:42 PM
Gazooo has quit

2018-08-09 22139, 2018

19:42 PM
Gazooo joined the channel

2018-08-09 22141, 2018

19:50 PM
Death91612 joined the channel

2018-08-09 22141, 2018

19:50 PM
Death91612 has quit

2018-08-09 22153, 2018

19:54 PM
HeinzBoettjer joined the channel

2018-08-09 22156, 2018

19:54 PM
HeinzBoettjer has quit

2018-08-09 22120, 2018

20:01 PM
rsh7 has quit

2018-08-09 22106, 2018

20:23 PM
Mr_Monkey has quit

2018-08-09 22112, 2018

20:23 PM
Mr_Monke_ joined the channel

2018-08-09 22141, 2018

20:29 PM
rsh7 joined the channel

2018-08-09 22145, 2018

20:34 PM
kartikeyaSh has quit

2018-08-09 22123, 2018

21:07 PM
Mr_Monke_ has quit

2018-08-09 22158, 2018

21:11 PM
mon19 joined the channel

2018-08-09 22158, 2018

21:11 PM
mon19 has quit

2018-08-09 22102, 2018

21:20 PM
Mr_Monkey joined the channel

2018-08-09 22132, 2018

21:24 PM
Mr_Monkey has quit

2018-08-09 22145, 2018

21:24 PM
mal0 joined the channel

2018-08-09 22148, 2018

21:24 PM
mal0 has quit

2018-08-09 22117, 2018

21:25 PM
CatQuest

samj1912: do you know why a search for "klaver" shows piano last https://beta.musicbrainz.org/search?query=klaver&… ? ideally it would be higher/highest up

2018-08-09 22125, 2018

21:25 PM
CatQuest

https://beta.musicbrainz.org/instrument/b3eac5f9-…

2018-08-09 22100, 2018

21:26 PM
CatQuest

some sort of counting of amoutn of same and similar aliases. eg if it has 3 aliases as "klaver" then count that

2018-08-09 22127, 2018

21:26 PM
CatQuest

it also has 3 "klavier" and 3 "klavir"

2018-08-09 22130, 2018

21:29 PM
anzuof10 joined the channel

2018-08-09 22112, 2018

21:30 PM
foxcookie joined the channel

2018-08-09 22114, 2018

21:30 PM
foxcookie has quit

2018-08-09 22114, 2018

21:32 PM
myth0d19 joined the channel

2018-08-09 22117, 2018

21:32 PM
myth0d19 has quit

2018-08-09 22138, 2018

21:35 PM
anzuof10 has quit

2018-08-09 22104, 2018

21:40 PM
djwhitey_ joined the channel

2018-08-09 22148, 2018

21:41 PM
djwhitey has quit

2018-08-09 22129, 2018

21:48 PM
djwhitey joined the channel

2018-08-09 22146, 2018

21:49 PM
djwhitey_ has quit

2018-08-09 22140, 2018

22:01 PM
djwhitey has quit

2018-08-09 22131, 2018

22:07 PM
Mr_Monkey joined the channel

2018-08-09 22156, 2018

22:07 PM
Carlos061115 joined the channel

2018-08-09 22156, 2018

22:07 PM
Carlos061115 has quit

2018-08-09 22142, 2018

22:08 PM
Mr_Monkey has quit

2018-08-09 22158, 2018

22:08 PM
Mr_Monkey joined the channel

2018-08-09 22105, 2018

22:49 PM
AC`97_ joined the channel

2018-08-09 22105, 2018

22:49 PM
AC`97_ has quit

2018-08-09 22118, 2018

23:31 PM
rsh7 has quit

2018-08-09 22103, 2018

23:51 PM
grit2 joined the channel

2018-08-09 22124, 2018

23:51 PM
Mr_Monkey has quit

2018-08-09 22152, 2018

23:55 PM
grit2 has quit