#metabrainz

/

      • himanshu25 joined the channel
      • ruaok
        Shocker. Vueling flight home is... Delayed.
      • himanshu25
        Hi there.. I'm interested in "ListenBrainz: Statistics" project . Please help me getting started or any warm-up tasks?
      • ariscop has quit
      • ariscop joined the channel
      • umeshksingla joined the channel
      • umeshksingla has quit
      • himanshu25 has quit
      • Freso
        You need to stick around if you want any kind of reply.
      • ainishd has quit
      • reosarevok
        At least I got reminded of https://www.youtube.com/watch?v=jVFZLXtq0nE :p
      • "Caught 'em up in the web of the world and it's wide / The YouTubes has more science than Bill Nye the Science Guy" :D
      • umeshksingla joined the channel
      • kartikgupta0909 joined the channel
      • kartikgupta0909
        armalcolite: Can you tell me your name? :p
      • reosarevok
        Freso: we should probably look at the old forum stickies and see if some should be imported
      • Freso
        Sure.
      • ruaok
        reosarevok: I should be able to do the renames later today/tomorrow.
      • reosarevok
        :)
      • But I have learned not to give ETAs if they're not strictly needed :p
      • michiwend joined the channel
      • ruaok
        But first, back into sun and more sanity than this beaurocratic crap city. :-)
      • Freso
        Man. I can't wait for June. Borrowed a replacement laptop, but it's so slow. ;_; I've put in 4 GB of RAM, but it seems like it only recognises 2. And the up key doesn't work.
      • And the mousepad moves the mouse all over the place.
      • ruaok is off
      • Except now it does. Wtf.
      • michiwend has quit
      • michiwend joined the channel
      • alastairp
        I added some "Getting started" subsections to the ideas page, after discussion with Gentlecat last night
      • suggesting what people can do with the AB and LB servers to get started - I was getting the feeling that our blanket "set it up and play with it" was a little too vague
      • Freso
        "set it up" isn't, "play with it" is. :)
      • Freso would love for a student to do LB-1 as part of their "warm-up", hint, hint ;)
      • alastairp
        but I even think "set up LB" is kind of vague without saying why
      • yes, I know "implement LB-1" or similar is a good reason, but I think explicit is better than implicit
      • anyway, I agree. if we can get people coming in saying "hey, I set this up and tried some things, what should I do next" I think that's a lot better than "hey, I want to work on x, what should I do?"
      • Freso
        Agreed.
      • Freso should also really look into getting factoids set up for BrainzBot
      • alastairp
        so, let's see if these comments change the tone of questions around MB and LB
      • s/MB/AB/
      • Freso
        *B :)
      • alastairp
        [AL]B
      • Freso
        (A|L)B
      • alastairp
        well. that's the same as [AL] :)
      • I guess it depends what grammar you use
      • (A|L)+B
      • I'm trying to clean up GettingStarted to include some less-MBS specific stuff
      • vidhan joined the channel
      • Freso: is the submission template on the wiki?
      • Freso
        GSoC application? Yes.
      • alastairp
        It'd be nice to expose students to it before they apply
      • where?
      • Freso
        Summer of Code/Application Template or something like that. It's linked from the blog post at least.
      • alastairp
        OK, I'll add it to some other wiki pages too
      • Freso
        :)
      • Thank you alastairp.
      • alastairp
        you're welcome, head of community Freso
      • ;)
      • Freso
        :p
      • mrc0mmand_ joined the channel
      • mrc0mmand has quit
      • cetko
        Hi guys, I'm interested in contributing to me MetaBrainz via gsoc this year
      • So I came to say 'Hi'
      • :)
      • mrc0mmand_ is now known as mrc0mmand
      • Gentlecat
        hello
      • alastairp
        hi
      • cetko
        Concretely, I'd like to work on AcousticBrainz, because it includes (as I understood it) lightweight machine learning experience
      • Freso
        "Lightweight"...
      • cetko
        :)
      • Freso
        But hi :)
      • alastairp
        we currently use SVM on AcousticBrainz
      • cetko
        alastairp: thanks!
      • Freso: "lightweight" as in "It would suit someone with experience or an interest in machine learning algorithms, though the majority of the project will probably involve creating infrastructure around our existing algorithms." :)
      • alastairp
        we don't have any concrete proposals specifically around the machine learning algorithms
      • right, as you've just seen
      • however if you wanted to propose something we could probably make it fit
      • new algorithms to see how they compare to SMV? Random Forest? Deep learning?
      • cetko
        I'll check the link above and AcousticBrainz docs and see if anything comes to mind
      • Gentlecat
        Freso: wouldn't it discourage people from discussing a project if there is no category for it? doesn't seem like the right approach to me
      • Nyanko-sensei joined the channel
      • D4RK-PH0ENiX has quit
      • cetko
        alastairp: how do Python and Flask come into play with AcousticBrainz since they're listed on the ideas page?
      • alastairp
        the server is written in flask
      • there's a audio feature extractor written in C++ which performs the signal processing and outputs json
      • then the webserver does stuff with it
      • "stuff"
      • ainishd joined the channel
      • ainishd has quit
      • discouse is pretty buggy
      • umeshksingla has quit
      • Nyanko-sensei has quit
      • D4RK-PH0ENiX joined the channel
      • ariscop has quit
      • sandy_ joined the channel
      • ariscop joined the channel
      • CatQuest
        how do I turn of smilies in my post on discourse?
      • ariscop has quit
      • sandy_
        Hi ...I am Sandeep bharti currently pursuing my under graduate program in computer science and I am interested in the Listenbrainz:Statistics project as listed in the project ideas for GSoC 2016. Can anyone help me in how to get started ? Thanks in advance :)
      • Gentlecat
        hey. one way to start is to try and install ListenBrainz to get familiar with some things
      • bagl has left the channel
      • cetko
        alastairp: is there a git repo for the webserver code?
      • nevermind, found it!
      • alastairp
        sandy_: there is a "Getting started" section on the Ideas page. You should see if you can follow that at first
      • cetko: cool
      • sandy_
        okay it looks cool. I will follow that for now, thanks :)
      • ariscop joined the channel
      • Gentlecat
        kind of weird that h3 and h4 look exactly the same in the wiki
      • alastairp
        Gentlecat: yeah
      • cetko
        I see an issue notice about the statistics chart on the accousticbrainz webpage, what kind of issue?
      • alastairp
      • cetko
        oh, I thought that's supposed to be there :)
      • Gentlecat
        finally caught up \o/
      • cetko
        a side note, I've recently dabbled with real-time databases (InfluxDB) for pageview analytics, would that be of any interest for the statistics part of acousticbrainz?
      • alastairp
        AB stats aren't really that critical
      • and their scale is quite small. we have a few rows per hour storing counts, and that's it
      • himanshu25 joined the channel
      • cetko
        yes, but as I understand it, the c++ submission code outputs json, are those raw time-series data?
      • umeshksingla joined the channel
      • or aggregated?
      • alastairp
        ah, no. statistics is just a time series of the number of data submissions we have
      • the output of the feature extractor is not time series
      • kartikgupta0909
        alastairp: Replied to your comments on my ideas. Please have a look whenever possible and we can discuss it further.
      • alastairp
        we take a mean (and other statistics) over the entire contents of the audio file
      • however, we are looking at storing complete time-series data in the future as well (we call it frame-level data)
      • I wonder if a real-time db would be the kind of thing that we can use to store this frame level data
      • cetko
        alastairp: how far in the future? this summer? :)
      • alastairp
        perhaps
      • the extractor generates the data with a configuration option switched on
      • we currently store json in postgres, in a jsonb field
      • however once we generate the extended data it's too large to store in postgres (10MB/minute)
      • we don't actually know how to store it yet. I had a look into protocol buffers, but I'm also interested in looking at other options
      • cetko
        what's the extended data?
      • alastairp
        one sec, let me see if I can find an example
      • himanshu25
        Hi again... sorry I lost my connection then... I'm interested in "ListenBrainz: Statistics" ... So how should I proceed?
      • alastairp
        in fact, you could probably generate it yourself
      • have you downloaded streaming_extractor_music ?
      • himanshu25: take a look at the "Getting started" section for listenbrainz on the wiki
      • see in the json we have summary information - mean, median, variance, first derivative for some values
      • whereas in the frames data, it has 1 value for every 1024 frames of audio (1 second = 44100 frames)
      • so, e.g. the barkbands_crest->mean value is the mean of all of these frames
      • ainishd joined the channel
      • armalcolite
        kartikgupta0909: just pinged you. :p
      • cetko
        alastairp: so streaming_extractor_music generates both summary and frames data?
      • alastairp
        cetko: right
      • umeshksingla has quit
      • if you look at the usage (run with no arguments) you see that there is an argument for a profile file
      • put this in a file and use it as the profile: https://gist.github.com/alastair/df17f5f641c760...