#metabrainz

/

      • ruaok
        that is exactly what I figured out yesterday.
      • 2020-02-20 05142, 2020

      • ruaok
        let me go read the follow code just to make sure
      • 2020-02-20 05152, 2020

      • iliekcomputers
        yeah, that's what i was gonna say
      • 2020-02-20 05157, 2020

      • ZaphodBeeblebrox
        and the vlc listening works really well honestly
      • 2020-02-20 05131, 2020

      • iliekcomputers
        <3 livegrep
      • 2020-02-20 05135, 2020

      • Raghav has quit
      • 2020-02-20 05152, 2020

      • iliekcomputers
      • 2020-02-20 05143, 2020

      • ruaok
        it looks like you created a separate exchange too.
      • 2020-02-20 05148, 2020

      • ruaok
        "playing_now_exchange": "playing_now",
      • 2020-02-20 05148, 2020

      • ruaok
        "playing_now_queue": "playing_now",
      • 2020-02-20 05152, 2020

      • iliekcomputers
        i have no remembrance at all of how the follow thing worked
      • 2020-02-20 05158, 2020

      • ruaok
        docker-server-configs/consul/LB
      • 2020-02-20 05125, 2020

      • iliekcomputers
        oh
      • 2020-02-20 05127, 2020

      • iliekcomputers
        so
      • 2020-02-20 05148, 2020

      • iliekcomputers
        the playing_now queue is only for listens that people sent in as 'now_playing'
      • 2020-02-20 05155, 2020

      • iliekcomputers
      • 2020-02-20 05107, 2020

      • iliekcomputers
      • 2020-02-20 05110, 2020

      • ruaok
        yeah, I just remembered that.
      • 2020-02-20 05104, 2020

      • iliekcomputers
        k, i'm gonna get food and then look into emails for the stats cron job
      • 2020-02-20 05111, 2020

      • ruaok
        k
      • 2020-02-20 05121, 2020

      • iliekcomputers
        the observability mailer may be spammy for a while.
      • 2020-02-20 05139, 2020

      • ZaphodBeeblebrox
        oh no.
      • 2020-02-20 05154, 2020

      • Raghav joined the channel
      • 2020-02-20 05105, 2020

      • ZaphodBeeblebrox
        amCap1712: I'm finished testing for now. tell me tomorrow/osme other tiem if yo uhave spesific test cases yo uwant me to test
      • 2020-02-20 05152, 2020

      • ruaok
        iliekcomputers: self.create_and_bind_exchange_and_queue(channel, current_app.config['UNIQUE_EXCHANGE'], current_app.config['FOLLOW_LIST_QUEUE'])
      • 2020-02-20 05100, 2020

      • ruaok
        this is the crux of it. makes sense.
      • 2020-02-20 05125, 2020

      • iliekcomputers
        yep
      • 2020-02-20 05114, 2020

      • amCap1712
        ZaphodBeeblebrox: pausing/resuming, stopping midway, repeat, playlist. thats all. if all of that is tested i don't have anything more in mind
      • 2020-02-20 05141, 2020

      • ruaok
        oh, another correction
      • 2020-02-20 05148, 2020

      • ruaok
        1. start shadowing the UNIQUE queue to insert all listens (starting timestamp x)
      • 2020-02-20 05151, 2020

      • amCap1712
        iliekcomputers: ruaok: do you think there could be some other edge cases which might require to tested separately?
      • 2020-02-20 05102, 2020

      • ruaok
        2. directly import from influx all listens inserted to influx before timestamp x
      • 2020-02-20 05103, 2020

      • ZaphodBeeblebrox
        I think i did all these. i guess palylist. but i'm tired and stuff i need ot eat too
      • 2020-02-20 05109, 2020

      • amCap1712
        ZaphodBeeblebrox: tommorrow lets test on windows then
      • 2020-02-20 05122, 2020

      • amCap1712
        yeah sure. thanks for the help
      • 2020-02-20 05122, 2020

      • ZaphodBeeblebrox
        i'll try to move the windows machine . alright!
      • 2020-02-20 05126, 2020

      • ruaok
        amCap1712: the answer is yes. but, what are we talking about?
      • 2020-02-20 05137, 2020

      • ZaphodBeeblebrox
        hey no problem. this is awesome!!
      • 2020-02-20 05120, 2020

      • amCap1712
        ruaok: edge cases to test the plugin for. like one might be to stop the plugin before it is half finished
      • 2020-02-20 05122, 2020

      • ZaphodBeeblebrox
        two different things beeing listened too at the same time (diff machines, diff programs, etc)
      • 2020-02-20 05128, 2020

      • amCap1712
        and should not be submitted
      • 2020-02-20 05132, 2020

      • ZaphodBeeblebrox
        i cna test this tomorrow! windows and mac both play!
      • 2020-02-20 05104, 2020

      • ruaok
        amCap1712: if the track played 30 seconds or more, it is considered a play.
      • 2020-02-20 05106, 2020

      • ZaphodBeeblebrox
        both should listenz. after all I 'm listening ot both :D
      • 2020-02-20 05128, 2020

      • ZaphodBeeblebrox
        so. if playing , say 40 seocnds thne double clicknig again on it
      • 2020-02-20 05129, 2020

      • ZaphodBeeblebrox
        i cna test that
      • 2020-02-20 05132, 2020

      • amCap1712
        ruaok: even if it less than half the duration?
      • 2020-02-20 05152, 2020

      • ruaok
        30 seconds is just the standard in the industry.
      • 2020-02-20 05109, 2020

      • ruaok
        which seems BS to me, but...
      • 2020-02-20 05159, 2020

      • amCap1712
        that just a constant i'll have to modify.
      • 2020-02-20 05118, 2020

      • ruaok
        then for now, keep it simple and set the constant to 30.
      • 2020-02-20 05119, 2020

      • amCap1712
        but i think it gives the idea of what i am trying to convey by different test cases
      • 2020-02-20 05140, 2020

      • ZaphodBeeblebrox
        i have songs that are >30 secs whixh i wish would listenz
      • 2020-02-20 05141, 2020

      • ruaok
        yes, but I can't really think of any off the top of my head.
      • 2020-02-20 05147, 2020

      • ruaok
        alastairp is good at these things.
      • 2020-02-20 05109, 2020

      • ZaphodBeeblebrox
        (ok so I made up that listening eg "scrobbling" is called "listenz" .. i uges it's dumb)
      • 2020-02-20 05136, 2020

      • amCap1712
        i'll post on the discourse for asking others to test.
      • 2020-02-20 05138, 2020

      • ZaphodBeeblebrox
      • 2020-02-20 05113, 2020

      • amCap1712
        once we are satisfied with the testing, i can send the patch upstream for merging
      • 2020-02-20 05118, 2020

      • ZaphodBeeblebrox
        yes
      • 2020-02-20 05137, 2020

      • ruaok
        iliekcomputers: for when you get back: https://gist.github.com/mayhem/fb944b7bde300643cb…
      • 2020-02-20 05138, 2020

      • iliekcomputers
        i tend towards the simple one
      • 2020-02-20 05153, 2020

      • iliekcomputers
        considering it's writing stuff at 20k/s right now?
      • 2020-02-20 05130, 2020

      • iliekcomputers
        a full import is 4-5 hours?
      • 2020-02-20 05141, 2020

      • ruaok
        I have no idea if writing time series data in a willy nilly fashion will yield an optimal layout on disk.
      • 2020-02-20 05153, 2020

      • ruaok
        not sure if timescale is smart enough to re lay things out.
      • 2020-02-20 05112, 2020

      • iliekcomputers
        hmm, will have to look into that, i guess.
      • 2020-02-20 05139, 2020

      • iliekcomputers
        but
      • 2020-02-20 05108, 2020

      • ruaok
        the rate I am getting is about 880k inserts per minute.
      • 2020-02-20 05128, 2020

      • iliekcomputers
        listens will always come in willy nilly fashion afterwards anyways
      • 2020-02-20 05135, 2020

      • ruaok
        6 hours.
      • 2020-02-20 05103, 2020

      • ruaok
        true dat, but some at least 300M we might have a more performant setup.
      • 2020-02-20 05116, 2020

      • ruaok
        let me read their best practices guide and see if I can glean anything.
      • 2020-02-20 05146, 2020

      • iliekcomputers shrugs
      • 2020-02-20 05118, 2020

      • iliekcomputers
        the scale we're scaling for, 300M would be small potatoes
      • 2020-02-20 05135, 2020

      • ruaok
        also true.
      • 2020-02-20 05141, 2020

      • iliekcomputers
        that's my intuition, maybe reality is different
      • 2020-02-20 05148, 2020

      • ruaok
        aight, let me put it this way... I really want to know all the caveats this time.
      • 2020-02-20 05100, 2020

      • ruaok
        as you said, switching in 3 years would be much harder.
      • 2020-02-20 05109, 2020

      • iliekcomputers
        yeah, that makes sense.
      • 2020-02-20 05121, 2020

      • ruaok
        and I agree with zas' take on scaling.
      • 2020-02-20 05131, 2020

      • ruaok
        partition users by server.
      • 2020-02-20 05106, 2020

      • ruaok
        so when one node gets full, we add more and migrate users over.
      • 2020-02-20 05139, 2020

      • ruaok
      • 2020-02-20 05152, 2020

      • ruaok
        see TIP under insert.
      • 2020-02-20 05158, 2020

      • iliekcomputers
        yup, designing for sharding is a good idea
      • 2020-02-20 05149, 2020

      • ruaok
        also warning under UPDATE.
      • 2020-02-20 05100, 2020

      • ruaok
        that is the "import form last.fm" will become a problem case.
      • 2020-02-20 05138, 2020

      • iliekcomputers
        that is updates though
      • 2020-02-20 05150, 2020

      • iliekcomputers
        oh wait it says inserts, sorry
      • 2020-02-20 05123, 2020

      • ruaok
        basically messing with anything other than the current chunk, is going to cost you.
      • 2020-02-20 05135, 2020

      • ruaok
        one way or another. read should continue to be fine though.
      • 2020-02-20 05128, 2020

      • ruaok
        iliekcomputers: we've seen this before, no?
      • 2020-02-20 05128, 2020

      • ruaok
      • 2020-02-20 05157, 2020

      • iliekcomputers
        in AB, i think
      • 2020-02-20 05104, 2020

      • Raghav has quit
      • 2020-02-20 05122, 2020

      • ruaok
        > js = js.replace("\u0000", "")
      • 2020-02-20 05123, 2020

      • iliekcomputers
        that's json v jsonb maybe
      • 2020-02-20 05130, 2020

      • ruaok
        doesn't seem to be doing the trick though.
      • 2020-02-20 05146, 2020

      • ruaok
        I am using jsonb -- did you look at the schema in the pasted code?
      • 2020-02-20 05103, 2020

      • iliekcomputers
        yeah, just did
      • 2020-02-20 05105, 2020

      • ruaok
        4 columns.
      • 2020-02-20 05113, 2020

      • ruaok
        with inserted_at it will be 5.
      • 2020-02-20 05139, 2020

      • ruaok
        and with PG JSONB support the data still remains queryable. and indexable.
      • 2020-02-20 05135, 2020

      • adhawkins joined the channel
      • 2020-02-20 05153, 2020

      • Gore has quit
      • 2020-02-20 05133, 2020

      • prabal has quit
      • 2020-02-20 05158, 2020

      • Gore joined the channel
      • 2020-02-20 05132, 2020

      • Etua joined the channel
      • 2020-02-20 05159, 2020

      • Raghav joined the channel
      • 2020-02-20 05123, 2020

      • BrainzGit
        [listenbrainz-server] paramsingh opened pull request #740 (master…param/email-when-new-stats-start-getting-written): Send a mail when new user stats start getting written into the db from spark https://github.com/metabrainz/listenbrainz-server…
      • 2020-02-20 05125, 2020

      • ruaok
        how philosophical... TIME_TO_CONSIDER_STATS_AS_OLD
      • 2020-02-20 05158, 2020

      • Raghav has quit
      • 2020-02-20 05102, 2020

      • iliekcomputers
        😂
      • 2020-02-20 05111, 2020

      • iliekcomputers
        ugh
      • 2020-02-20 05118, 2020

      • iliekcomputers
        didn't push the actual email
      • 2020-02-20 05139, 2020

      • iliekcomputers
        i've deployed this to spark-reader-beta anyways, hopefully we'll see some email tomorrow
      • 2020-02-20 05107, 2020

      • ruaok
        ok, my script now gets past those 0x0000 errors.
      • 2020-02-20 05119, 2020

      • ruaok
        hopefully a full db in place in the morning.
      • 2020-02-20 05133, 2020

      • Etua has quit
      • 2020-02-20 05159, 2020

      • iliekcomputers
        10000 additions since the last LB release
      • 2020-02-20 05102, 2020

      • iliekcomputers
        🤷🏽
      • 2020-02-20 05120, 2020

      • ruaok
        10000 new listens?
      • 2020-02-20 05150, 2020

      • ruaok
        oh changed code?
      • 2020-02-20 05127, 2020

      • iliekcomputers
        yeah
      • 2020-02-20 05128, 2020

      • iliekcomputers
      • 2020-02-20 05144, 2020

      • iliekcomputers
        most of it is spark tbh
      • 2020-02-20 05150, 2020

      • ruaok
        exactly, that.
      • 2020-02-20 05102, 2020

      • ruaok
        still, we need to get back to regular releases.
      • 2020-02-20 05114, 2020

      • BrainzGit
        [listenbrainz-server] paramsingh merged pull request #740 (master…param/email-when-new-stats-start-getting-written): Send a mail when new user stats start getting written into the db from spark https://github.com/metabrainz/listenbrainz-server…
      • 2020-02-20 05104, 2020

      • BrainzGit
        [listenbrainz-server] paramsingh opened pull request #741 (master…param/fix-push-scripts): Fix spark image push scripts https://github.com/metabrainz/listenbrainz-server…
      • 2020-02-20 05112, 2020

      • kieto joined the channel
      • 2020-02-20 05144, 2020

      • prabal joined the channel
      • 2020-02-20 05140, 2020

      • ephem3r0l joined the channel
      • 2020-02-20 05129, 2020

      • supersandro2000 has quit
      • 2020-02-20 05147, 2020

      • supersandro2000 joined the channel
      • 2020-02-20 05156, 2020

      • ephem3r0l is now known as ephemer0l
      • 2020-02-20 05106, 2020

      • kieto has quit