#musicbrainz-devel

/

0:02 AM
CallerNo6

I expect the data to prove conclusively that there's only one Sting song.

2014-10-15 28847, 2014

0:02 AM
ianmcorvidae

haha

2014-10-15 28806, 2014

0:03 AM
CallerNo6

I mean, it's an okay song. Don't get me wrong.

2014-10-15 28859, 2014

0:03 AM
Nyanko-sensei joined the channel

2014-10-15 28818, 2014

0:53 AM
JesseW joined the channel

2014-10-15 28839, 2014

1:08 AM
kepstin-laptop__ joined the channel

2014-10-15 28824, 2014

1:09 AM
kepstin-laptop

so, when running abzsumbit, I occasionally get sqlite database locking errors

2014-10-15 28825, 2014

1:12 AM
kepstin-laptop opens https://github.com/MTG/acousticbrainz-client/issues/13

2014-10-15 28823, 2014

1:29 AM
MightyJay_ joined the channel

2014-10-15 28859, 2014

1:29 AM
demonimin_ joined the channel

2014-10-15 28812, 2014

1:31 AM
Mineo_ joined the channel

2014-10-15 28834, 2014

1:31 AM
zas_ joined the channel

2014-10-15 28816, 2014

1:33 AM
michiwend joined the channel

2014-10-15 28836, 2014

1:33 AM
DjSlash_ joined the channel

2014-10-15 28852, 2014

1:41 AM
michiwend_ joined the channel

2014-10-15 28858, 2014

1:41 AM
nikki_ joined the channel

2014-10-15 28859, 2014

1:42 AM
legoktm joined the channel

2014-10-15 28801, 2014

1:43 AM
legoktm joined the channel

2014-10-15 28838, 2014

2:27 AM
Gentlecat joined the channel

2014-10-15 28836, 2014

2:47 AM
kepstin-laptop

so, 92k unique recordings in abz now

2014-10-15 28836, 2014

2:48 AM
JesseW joined the channel

2014-10-15 28821, 2014

2:49 AM
ianmcorvidae

yup

2014-10-15 28829, 2014

2:49 AM
ianmcorvidae

I should do those graphs, lossy stuff has been growing a lot more lately

2014-10-15 28848, 2014

2:51 AM
kepstin-laptop

my lossless stuff hasn't finished yet

2014-10-15 28800, 2014

2:53 AM
kepstin-laptop

this data set's gonna be a bit more weighted towards japanese pop than most, i think ;)

2014-10-15 28817, 2014

2:53 AM
ianmcorvidae

haha

2014-10-15 28846, 2014

2:55 AM
ianmcorvidae

probably got more estonian hip-hop than the average dataset just by my 6 CDs worth :P

2014-10-15 28836, 2014

2:58 AM
kepstin-laptop

well, it can only improve the results, right?

2014-10-15 28800, 2014

3:02 AM
ianmcorvidae

yup!

2014-10-15 28809, 2014

3:02 AM
ianmcorvidae

I was thinking I should write a crappy recommender to kick us off

2014-10-15 28823, 2014

3:02 AM
ianmcorvidae

something obviously terrible like levenshtein distance of the JSON :P

2014-10-15 28850, 2014

3:03 AM
kepstin-laptop

hmm, something with just the low-level data? Could do something silly like just match bpm and key

2014-10-15 28830, 2014

3:04 AM
kepstin-laptop

you like this song in C# major at 140bpm, so you'll obviously like this other one too!

2014-10-15 28829, 2014

3:08 AM
ianmcorvidae

that's far more sophisticated than I was thinking XD

2014-10-15 28856, 2014

3:08 AM
ianmcorvidae

I mean, I'm really thinking in the vein of making a truly terrible recommender that anyone can do better than, because I want to goad them into doing so :P

2014-10-15 28802, 2014

3:11 AM
CallerNo6

listeners who like songs with "satan" in the title will probably like other songs with "satan" in the title?

2014-10-15 28822, 2014

3:11 AM
kepstin-laptop wonders if there's something really silly and easy you could do which would on average perform worse than random matching.

2014-10-15 28842, 2014

3:11 AM
ianmcorvidae

hah

2014-10-15 28813, 2014

3:12 AM
CallerNo6

I've been assured that nobody's smart enough to be wrong all the time. But it can't hurt to try?

2014-10-15 28858, 2014

3:13 AM
kepstin-laptop

doesn't have to be all the time

2014-10-15 28803, 2014

3:14 AM
kepstin-laptop

just on average :)

2014-10-15 28836, 2014

3:14 AM
kepstin-laptop

(if you actually got it wrong all the time, you could presumably just flip your rating and get something actually useful)

2014-10-15 28800, 2014

3:17 AM
CallerNo6

hence the expression :-)

2014-10-15 28800, 2014

3:19 AM
CallerNo6

( http://www.blogcatalog.com/discuss/entry/nobody-i… )

2014-10-15 28813, 2014

4:05 AM
KillDaBOB_ joined the channel

2014-10-15 28839, 2014

7:53 AM
ijabz1 joined the channel

2014-10-15 28816, 2014

7:59 AM
ijabz1 joined the channel

2014-10-15 28855, 2014

8:07 AM
ianmcorvidae

past 100k uniques! :D

2014-10-15 28813, 2014

8:17 AM
ruaok

\ø/

2014-10-15 28806, 2014

9:05 AM
djp joined the channel

2014-10-15 28820, 2014

9:07 AM
yeeeargh joined the channel

2014-10-15 28850, 2014

9:42 AM
ruaok

alastairp: do you have a sec to talk about jesus christ your lord and saviour?

2014-10-15 28853, 2014

9:42 AM
ruaok

er wait.

2014-10-15 28804, 2014

9:43 AM
ruaok

how about the schema for the highlevel table? :)

2014-10-15 28819, 2014

9:43 AM
alastairp

I can see how you might confuse them

2014-10-15 28824, 2014

9:43 AM
ruaok

in particular I'm thinking of what version info we should track.

2014-10-15 28824, 2014

9:43 AM
alastairp

they're both world-changing

2014-10-15 28831, 2014

9:43 AM
ruaok

heh. :)

2014-10-15 28851, 2014

9:43 AM
alastairp

are you at the lab, or will do we do it here?

2014-10-15 28808, 2014

9:44 AM
ruaok

here. mom is in town and I only have half days while aleta baby-sits mom.

2014-10-15 28820, 2014

9:44 AM
ruaok wishes he was in the lab

2014-10-15 28831, 2014

9:44 AM
alastairp

I don't know what features or algorithms high-level will be in the output

2014-10-15 28846, 2014

9:44 AM
ruaok

yeah, that too.

2014-10-15 28813, 2014

9:45 AM
ruaok

so, my inclinatio is to store: json, timestamp and essentia_git_sha

2014-10-15 28827, 2014

9:45 AM
ruaok

since, I am thinking that only the AB server should ever calculate high level stuff.

2014-10-15 28839, 2014

9:45 AM
ruaok

is that even a reasonable assumption?

2014-10-15 28842, 2014

9:45 AM
alastairp

split per algorithm?

2014-10-15 28804, 2014

9:46 AM
ruaok

ideally, but I just don't know if the essentia codebase is really ready for that/

2014-10-15 28817, 2014

9:46 AM
ruaok

I think we may just need to start with one version and get a move on.

2014-10-15 28827, 2014

9:46 AM
ruaok

the good thing is that we can re-calculate this at any time.

2014-10-15 28842, 2014

9:46 AM
alastairp

right. that'd be a good start then

2014-10-15 28858, 2014

9:46 AM
ruaok

ok, I'll get moving on that.

2014-10-15 28801, 2014

9:47 AM
ruaok

any signs of dima?

2014-10-15 28821, 2014

9:47 AM
alastairp

if there are many algorithms, there's no difference between 1 binary that spits out lots of bits of json, and many binaries that each spit out their own

2014-10-15 28827, 2014

9:47 AM
alastairp

no, but he normally does afternoons, I think

2014-10-15 28800, 2014

9:48 AM
alastairp

I'll try and grab him as soon as I can

2014-10-15 28852, 2014

9:51 AM
ruaok returns from a mom interruption

2014-10-15 28800, 2014

9:52 AM
alastairp

I have to put out some ssl fires on freesound first, but back to this asap

2014-10-15 28808, 2014

9:52 AM
ruaok

ah yes.

2014-10-15 28822, 2014

10:41 AM
LordSputnik joined the channel

2014-10-15 28825, 2014

10:41 AM
ruaok_ joined the channel

2014-10-15 28801, 2014

10:51 AM
Nyanko-sensei joined the channel

2014-10-15 28839, 2014

10:53 AM
ruaok

alastairp: got a moment for a quick sanity check on https://github.com/metabrainz/acousticbrainz-serv… ?

2014-10-15 28847, 2014

10:53 AM
ruaok

all high level related stuff only.

2014-10-15 28829, 2014

10:56 AM
alastairp

ah, I see. that spit is pretty cool

2014-10-15 28849, 2014

10:56 AM
alastairp

do you want to do antying about highlevel_json / raw_json table namess?

2014-10-15 28807, 2014

10:57 AM
ruaok

unsure.

2014-10-15 28824, 2014

10:57 AM
ruaok

we are not likely to need the split and view as we do for the lowlevel stuff.

2014-10-15 28842, 2014

10:57 AM
ruaok

first question is if ianmcorvidae intended for all the json to go into one table.

2014-10-15 28852, 2014

10:57 AM
ruaok

my gut instinct says to use two tables.

2014-10-15 28800, 2014

10:58 AM
ruaok

for scalability.

2014-10-15 28809, 2014

10:58 AM
ruaok

and then deciding on the names.

2014-10-15 28809, 2014

10:58 AM
alastairp

right

2014-10-15 28837, 2014

10:58 AM
ruaok

but ianmcorvidae is sleeping, right now.

2014-10-15 28857, 2014

10:58 AM
ruaok

but assuming you're ok with the columns in said tables, I'll press on for now.

2014-10-15 28806, 2014

10:59 AM
ruaok

changing table names during the review phase is easy.

2014-10-15 28845, 2014

10:59 AM
ruaok

combining tables less so, but I think having two tables is desireable.

2014-10-15 28800, 2014

11:00 AM
ruaok

we're not losing anything having separate tables.

2014-10-15 28839, 2014

11:01 AM
alastairp

yes, I think 2 is a good idea

2014-10-15 28841, 2014

11:01 AM
alastairp

otherwise, fine

2014-10-15 28852, 2014

11:01 AM
ruaok

ok, I'll keep moving then.

2014-10-15 28803, 2014

11:02 AM
ruaok

not sure I can get a PR up for the high level stuff today, but I'll try.

2014-10-15 28820, 2014

11:02 AM
ruaok

hm.

2014-10-15 28849, 2014

11:02 AM
ruaok

I'll build no locking support into the highlevel stuff.

2014-10-15 28854, 2014

11:03 AM
ruaok

I'm going to assume that there will be one master program that looks at the DB, determines which highlevel data needs to be calculated, fires off a thread that will then calculate the highlevel data.

2014-10-15 28811, 2014

11:04 AM
ruaok

it then takes ending threads and stores the data into the DB>

2014-10-15 28837, 2014

11:05 AM
Nyanko-sensei joined the channel

2014-10-15 28840, 2014

11:20 AM
ardoRic

does the vm update the musicbrainz-server code automatically, or should I check it out again ?

2014-10-15 28805, 2014

11:22 AM
ruaok

just do a git pull on it.

2014-10-15 28809, 2014

11:22 AM
ruaok

it doesn't update automatically

2014-10-15 28841, 2014

11:35 AM
KillDaBOB_ joined the channel

2014-10-15 28825, 2014

11:47 AM
chirlu` joined the channel

2014-10-15 28806, 2014

11:48 AM
KillDaBOB joined the channel

2014-10-15 28841, 2014

12:41 PM
Nyanko-sensei joined the channel

2014-10-15 28827, 2014

13:47 PM
ijabz1 joined the channel

2014-10-15 28849, 2014

13:56 PM
kepstin-laptop

so, >100k recordings now :)

2014-10-15 28831, 2014

14:05 PM
alastairp

this is great. 10% of our target in 5 days

2014-10-15 28824, 2014

14:06 PM
alastairp

at this rate that'll be ~400k by the end of the month, so if we get more people running it in the coming week I think 500k or more is really doable

2014-10-15 28806, 2014

14:07 PM
kepstin-laptop

I've just about hit all the music I have now, though.

2014-10-15 28845, 2014

14:07 PM
kepstin-laptop

keeping the rate up probably really requires getting more people to run the tool :)

2014-10-15 28855, 2014

14:09 PM
alastairp

right, but the only reason we've not opened this up wider is that the tools still have problems

2014-10-15 28823, 2014

14:10 PM
alastairp

rob is confident, and I agree with him, that we can dump this tool on 2-4x as many people immediately

2014-10-15 28842, 2014

14:10 PM
alastairp

which will keep up our submission speed

2014-10-15 28815, 2014

14:13 PM
kepstin-laptop has started to run it on the stuff he has only has lossy formats now

2014-10-15 28826, 2014

14:13 PM
kepstin-laptop

(which is a bunch of touhou arranges, mostly)

2014-10-15 28813, 2014

14:29 PM
Nyanko-sensei joined the channel

2014-10-15 28847, 2014

14:38 PM
ruaok

in fact, I think we should start tapping people on the shoulders quietly and ask them to jump in.

2014-10-15 28802, 2014

14:40 PM
alastairp

right

2014-10-15 28811, 2014

14:45 PM
ruaok

we need to get derwin in on this.

2014-10-15 28814, 2014

14:47 PM
nikki is still working on her stuff

2014-10-15 28845, 2014

14:48 PM
nikki

although when I'll be able to actually run it on *all* of my music is another question

2014-10-15 28823, 2014

14:49 PM
ijabz1

if we can get either an osx or windows version available soon will be alot easier to get more users

2014-10-15 28825, 2014

14:49 PM
nikki

(right now I can't do korean stuff, because apparently linux has a bug in its support for korean filenames on hfs filesystems)

2014-10-15 28841, 2014

14:50 PM
JesseW joined the channel

2014-10-15 28841, 2014

14:51 PM
ruaok

ijabz1: that is our goal for friday, if at all possible

2014-10-15 28820, 2014

14:52 PM
ijabz1

great

2014-10-15 28818, 2014

14:55 PM
jesus2099_ joined the channel

2014-10-15 28859, 2014

15:00 PM
alastairp

i wish

2014-10-15 28830, 2014

15:01 PM
LordSputnik

btw, have about 12k lossless tracks for scanning - are there instructions anywhere? :)

2014-10-15 28830, 2014

15:02 PM
yeeeargh

https://chatlogs.musicbrainz.org/musicbrainz-deve… this seems to work pretty good

2014-10-15 28859, 2014

15:03 PM
LordSputnik

ok, will see what I can do later :)

2014-10-15 28808, 2014

15:04 PM
ruaok

LordSputnik: sweet.

2014-10-15 28821, 2014

15:04 PM
LordSputnik has left the channel

2014-10-15 28833, 2014

15:04 PM
hawke1 joined the channel

2014-10-15 28820, 2014

15:11 PM
kepstin-laptop__ joined the channel

2014-10-15 28827, 2014

15:20 PM
drsaunde

ruaok: Not sure what you guys are doing but i'd be happy to help whenever

2014-10-15 28838, 2014

15:20 PM
ruaok

got flacs?

2014-10-15 28800, 2014

15:21 PM
drsaunde

no

2014-10-15 28800, 2014

15:21 PM
ruaok

even if lossy, anything helps at this point.

2014-10-15 28814, 2014

15:21 PM
ruaok

got linux?