#metabrainz

/

      • supersandro2000 has quit
      • supersandro20002 joined the channel
      • iliekcomputers
        Lotheric: that's the spotify player being noisy, are you opted into the Spotify integration? If not, it should not be there.
      • arcade_droid has quit
      • zarcade_droid joined the channel
      • zarcade_droid is now known as arcade_droid
      • arcade_droid is now known as zarcade_droid
      • Lotheric
        I linked my account, yeha
      • ok got it
      • unlinked myself and relinked using both play and record
      • the message is gone
      • :)
      • thanks
      • Chinmay3199 joined the channel
      • zarcade_droid has quit
      • zarcade_droid joined the channel
      • m00n
        so, is it a standard/convention that tags begin with a lower letter?
      • and if yes, is it possible to change the name of a tag without changing the value across a selection?
      • im adding "book" and "series" tags to some mp3s
      • in the tag list panel picard lists Artist as being capitalized, but then the tag itself is lower case. However if i create a custom tag it appears in the list panel verbatim.
      • which is what confused me
      • zarcade_droid is now known as arcade_droid
      • arcade_droid is now known as zarcade_droid
      • zarcade_droid is now known as arcade_droid
      • supersandro20002 has quit
      • supersandro2000 joined the channel
      • arcade_droid has quit
      • i figured it out
      • supersandro2000 has quit
      • supersandro2000 joined the channel
      • m00n has quit
      • m00n joined the channel
      • BrainzGit
        [listenbrainz-server] vansika merged pull request #852 (vansika/candidate-recordings…candidate-set-for-all): Generate candidate sets for all users at once. https://github.com/metabrainz/listenbrainz-serv...
      • [listenbrainz-server] ishaanshah opened pull request #856 (master…time_range_spark): LB-575: Add support for more time ranges for "Top Artists" (Spark and DB Insert) https://github.com/metabrainz/listenbrainz-serv...
      • BrainzBot
        LB-575: Add support for more time ranges for "Top Artists" https://tickets.metabrainz.org/browse/LB-575
      • supersandro2000 has quit
      • supersandro2000 joined the channel
      • ishaanshah[m]
        iliekcomputers: can have a look once before I update the tests?
      • BrainzGit
        [listenbrainz-server] vansika opened pull request #857 (vansika/candidate-recordings…dataframe-doc): update documentation in create_dataframes https://github.com/metabrainz/listenbrainz-serv...
      • [listenbrainz-server] vansika opened pull request #858 (master…update-recommendation-schema): update recommendation schema to use recording mbid https://github.com/metabrainz/listenbrainz-serv...
      • supersandro2000 has quit
      • supersandro2000 joined the channel
      • mzfr joined the channel
      • diru1100
        Mo''in'!
      • v6lur joined the channel
      • iliekcomputers
        ruaok: morning!
      • I have two small pull requests on LB that I want to merge today. Will you have time to take a look?
      • ishaanshah[m]: hey! I'll look at your PR soon-ish.
      • Gazooo has quit
      • Gazooo joined the channel
      • ruaok
        iliekcomputers: can do. More urgent then then next couple of hours?
      • iliekcomputers
        Nah, not that urgent. Just want to deploy them to cron today.
      • ruaok
        K
      • iliekcomputers
        Thanks!
      • ruaok
        done!
      • BrainzGit
        [listenbrainz-server] mayhem merged pull request #853 (master…artist-relations): Add artist/artist_credit relations https://github.com/metabrainz/listenbrainz-serv...
      • [listenbrainz-server] paramsingh merged pull request #854 (master…param/better-notification-emails-for-dumps): Improve dump creation notification email subject https://github.com/metabrainz/listenbrainz-serv...
      • [listenbrainz-server] paramsingh merged pull request #855 (master…param/delete-old-dumps-from-temp-dir): LB-588: Remove old dumps from the /mnt/dumps as well https://github.com/metabrainz/listenbrainz-serv...
      • BrainzBot
        LB-588: Remove old dumps from /mnt/dumps/tmp/archives https://tickets.metabrainz.org/browse/LB-588
      • iliekcomputers
        Thanks ruaok
      • shivam-kapila
        Morning!!
      • Chinmay3199 has quit
      • Mr_Monkey is so into working on BrainzPlayer he didn't realise it's Saturday…
      • Mr_Monkey
        Where did the week go?
      • iliekcomputers
        ishaanshah[m]: i've commented on your PR. I think it'd be a good idea for us to catch up once when you get the time.
      • v6lur has quit
      • ruaok
        Mr_Monkey: my week exactly!
      • ishaanshah[m]
        iliekcomputers hey, I am around
      • v6lur joined the channel
      • iliekcomputers
        ishaanshah[m]: hey
      • ishaanshah[m]
        hi
      • iliekcomputers
        did you see my comments on the PR?
      • ishaanshah[m]
        Yes
      • I agree with all of them, I will make the changes tomorrow
      • Are the listens in parquet files sorted in ascending order?
      • iliekcomputers
        i don't think so.
      • ishaanshah[m]
        wrt to listened at
      • _lucifer
        ruaok: does meb have an organization google cloud account?
      • iliekcomputers
        you'll have to do a max operation first, i guess.
      • ishaanshah[m]
        Oh, can we do that
      • ruaok
        _lucifer: yes
      • iliekcomputers
        ishaanshah[m]: yeah, i'd just load the latest month parquet file and find the max for listened_at
      • ishaanshah[m]
        Because where queries will simple because of that
      • iliekcomputers
        this is scala, but there should be an equivalent.
      • ishaanshah[m]
        I was asking for the where queries, min max should be fine
      • iliekcomputers
        i don't understand the question.
      • ishaanshah[m]
        Just a optimisation I though, not necessary to be done right now
      • I mean if we could tell spark that listens are in ascending order the where queries would just binary search ig
      • iliekcomputers
        oh
      • yeah.
      • the sort would be really expensive though
      • ishaanshah[m]
        Yeah, Ig we will have to modify the export to do that
      • iliekcomputers
        hmm
      • the dump process is almost insanely inefficient rn
      • ishaanshah[m]
        It's an optimisation, maybe we should look at it after we setup a proper pipeline
      • iliekcomputers
        yeah, true.
      • ishaanshah[m]
        I will open a ticket for it
      • iliekcomputers
        cool, thx!
      • ishaanshah[m]
        Other than that you had some things in mind to discuss?
      • iliekcomputers
        nah, just wanted to make sure we're aligned on the stats calculation comments I had.
      • ishaanshah[m]
        Oh, I agree with the seperation part
      • iliekcomputers
        for the interval of calculation specifically.
      • ishaanshah[m]
        Its better to have different commands, because we will have a lot more stats now
      • iliekcomputers
        we should find the sunday before the latest listen and then calculate stats for that interval
      • or the 1st of the month before the latest listen and then stats for that interval
      • ishaanshah[m]
        But wouldn't that make the graphs static for the whole month
      • iliekcomputers
        sorry, i'm not clear i guess.
      • suppose the latest listen is from May 16
      • then the calculation should be from May 1 to May 16
      • ishaanshah[m]
        That wouldn't exactly be last month right
      • iliekcomputers
        yeah.
      • ishaanshah[m]
        I was thinking if 16th may is the latest listen
      • Last week = 9th-16
      • Month = 16th april - 16th may
      • iliekcomputers
        hmm.
      • ishaanshah[m]
        Thats what we are doing right now
      • I mean the PR
      • iliekcomputers
        i think the stats would be more useful if they're for a particular month vs a more-or-less random range.
      • like stats for the month of May vs stats for half of April + half of May
      • i'd say go with what I said for now, we'll change it if people want something different.
      • should be easy enough to change.
      • ishaanshah[m]
        Yep, Yep
      • What about week and year
      • iliekcomputers
        same concept for week to week, like Sunday to Sunday, we could show it on the site as stats for the week of May 11.
      • ishaanshah[m]
        should week be sunday to sunday too?
      • iliekcomputers
        and for year to year, stats for 2020
      • vs stats for may 2019 to may 2020
      • ishaanshah[m]
        Hmm
      • That eliminates the need of where queries :D
      • iliekcomputers
        heh
      • ishaanshah[m]
        So we speed up the pipeline 🎉
      • Another thing, I think we should look into more columns in db after some time
      • When we know more about the stats we are gonna calculate
      • iliekcomputers
        yeah, seeing the changes in the sql in the PR, i think more columns will be needed.
      • ishaanshah[m]
        Yeah, the query has become really complex and hard to understand
      • iliekcomputers
        true
      • ishaanshah[m]
        Also does PR #853 mean we will have a MB database on LB now?
      • iliekcomputers
        we have a connection in prod.
      • on the dev env we don't
      • ishaanshah[m]
        So we can make direct db calls to MB
      • iliekcomputers
        in LB we can, in spark rn, we can't
      • ishaanshah[m]
        Oh, noice
      • I was gonna need that for some of the stats :D
      • Anyways, I will make the changes you asked and update the tests :)
      • iliekcomputers
        awesome, thanks!
      • yvanzo
        reosarevok: thanks, published!
      • _lucifer
        ruaok: can you set up a google cloud project for me with editor access
      • i want to connect with firebase test lab
      • it requires the project billing to be enabled but i do not intend to exceed the free quota