#metabrainz

/

      • lucifer
        yes
      • 2022-10-19 29237, 2022

      • lucifer
        oh there's also the option to compress the cache with zstd instead of gzip.
      • 2022-10-19 29253, 2022

      • lucifer
        in your experience with MLHD stuff, is zstd faster than gzip?
      • 2022-10-19 29228, 2022

      • alastairp
        yes, do it
      • 2022-10-19 29244, 2022

      • lucifer
        compression level?
      • 2022-10-19 29203, 2022

      • alastairp
        ~7 is a pretty good compromise
      • 2022-10-19 29208, 2022

      • alastairp
        are we looking for speed or size?
      • 2022-10-19 29217, 2022

      • lucifer
        speed
      • 2022-10-19 29226, 2022

      • alastairp
        I guess that's the tradeoff - speed means that building is faster but pushing the cache is slower
      • 2022-10-19 29238, 2022

      • lucifer
        note that the action will have to download/upload the cache as well
      • 2022-10-19 29241, 2022

      • lucifer
        right
      • 2022-10-19 29227, 2022

      • alastairp
        zstd -3 is ~the same compression level as gzip but a million times faster
      • 2022-10-19 29259, 2022

      • lucifer
        i see.
      • 2022-10-19 29213, 2022

      • alastairp
        once you go up to ~ -9 you probably need threads to keep up with the speed. I don't know how many cores are in the workers, or if you can set that parameter
      • 2022-10-19 29238, 2022

      • alastairp
      • 2022-10-19 29248, 2022

      • alastairp
        I don't have values for gzip for the same thing, I should do that
      • 2022-10-19 29221, 2022

      • lucifer
        2 cores iirc.
      • 2022-10-19 29249, 2022

      • lucifer
        cool, lets try 7 if its slow then try 3.
      • 2022-10-19 29202, 2022

      • alastairp
      • 2022-10-19 29217, 2022

      • alastairp
        so we could try 9 with 2 threads too
      • 2022-10-19 29220, 2022

      • alastairp
        but yeah, let's test it
      • 2022-10-19 29231, 2022

      • Hellow1 has quit
      • 2022-10-19 29243, 2022

      • BrainzGit
        [listenbrainz-server] 14amCap1712 opened pull request #2211 (03master…multi-stage-build): Split production image into multiple stages https://github.com/metabrainz/listenbrainz-server…
      • 2022-10-19 29221, 2022

      • lucifer
        mayhem: when you have time, please review LB#2199,2205
      • 2022-10-19 29221, 2022

      • BrainzBot
        Add lookups to retrieve spotify track ids using metadata or recording mbids: https://github.com/metabrainz/listenbrainz-server…, Update spotify playlist export: https://github.com/metabrainz/listenbrainz-server…
      • 2022-10-19 29255, 2022

      • lucifer
        monkey, for #2205 has some frontend changes as well so would be nice to get your review on it as well.
      • 2022-10-19 29228, 2022

      • mayhem
        on 2199 right now
      • 2022-10-19 29251, 2022

      • lucifer
        ah great, thanks
      • 2022-10-19 29218, 2022

      • BrainzGit
        [troi-recommendation-playground] 14amCap1712 opened pull request #74 (03main…remove-msid): Remove artist_msid and release_msid https://github.com/metabrainz/troi-recommendation…
      • 2022-10-19 29242, 2022

      • BrainzGit
        [listenbrainz-server] 14amCap1712 merged pull request #2208 (03master…remove-msid): Remove artist msid and release msid https://github.com/metabrainz/listenbrainz-server…
      • 2022-10-19 29210, 2022

      • lucifer
        alastairp: oh another feature, you might find useful. https://github.blog/2022-10-03-highlights-from-gi…
      • 2022-10-19 29259, 2022

      • Hellow1 joined the channel
      • 2022-10-19 29218, 2022

      • Hellow1 has quit
      • 2022-10-19 29216, 2022

      • bitmap
        zas: we could add the suffix, I don't think there was a reason to leave them out
      • 2022-10-19 29215, 2022

      • zas
        bitmap: ok, I'll do it, and change grafana panels accordingly
      • 2022-10-19 29251, 2022

      • reosarevok
        bitmap: oh, I see that Moose anyway implies strict + warnings
      • 2022-10-19 29258, 2022

      • reosarevok
        So we're probably not too bad in that regard anywya
      • 2022-10-19 29201, 2022

      • reosarevok
        *anyway
      • 2022-10-19 29245, 2022

      • reosarevok
        Most of our stuff missing stricture is tests
      • 2022-10-19 29238, 2022

      • reosarevok
        But we should probably look into using both https://metacpan.org/pod/Perl::Critic::Policy::Te… and https://metacpan.org/pod/Perl::Critic::Policy::Te… unless there's a reason not to
      • 2022-10-19 29247, 2022

      • reosarevok
      • 2022-10-19 29237, 2022

      • bitmap
        reosarevok: I see no reason not to require use strict/warnings everywhere
      • 2022-10-19 29246, 2022

      • reosarevok
        Ok :)
      • 2022-10-19 29235, 2022

      • lucifer
        mayhem: let's close LB#2092 ?
      • 2022-10-19 29236, 2022

      • BrainzBot
      • 2022-10-19 29236, 2022

      • Hellow1 joined the channel
      • 2022-10-19 29242, 2022

      • lucifer
        mayhem: for LB#2199, i didn't add a cron job to rebuild it periodically. how often should this index be rebuilt?
      • 2022-10-19 29242, 2022

      • BrainzBot
        Add lookups to retrieve spotify track ids using metadata or recording mbids: https://github.com/metabrainz/listenbrainz-server…
      • 2022-10-19 29231, 2022

      • mayhem
        weekly I thin. on or near fridays?
      • 2022-10-19 29245, 2022

      • BrainzGit
        [listenbrainz-server] 14mayhem closed pull request #2092 (03master…spotify-metadata-cache): Spotify metadata cache (WIP) https://github.com/metabrainz/listenbrainz-server…
      • 2022-10-19 29200, 2022

      • lucifer
        1 a.m. every friday?
      • 2022-10-19 29216, 2022

      • mayhem
        sure.
      • 2022-10-19 29237, 2022

      • mayhem
        but that begs the question: how can we be sure that we will have the latest data?
      • 2022-10-19 29250, 2022

      • mayhem
        does spotify have a "new releases" page that we can harvest IDs from?
      • 2022-10-19 29213, 2022

      • lucifer
        it has a new releases endpoint but its limited to 100 per market iirc.
      • 2022-10-19 29254, 2022

      • mayhem
        without pagination?
      • 2022-10-19 29213, 2022

      • lucifer
        with afaik.
      • 2022-10-19 29230, 2022

      • mayhem
        well, 100 is better than nothing. we should add a cron job to check that list daily...
      • 2022-10-19 29251, 2022

      • lucifer
      • 2022-10-19 29257, 2022

      • lucifer
        yeah, nothing at offset 100.
      • 2022-10-19 29220, 2022

      • mayhem
        well, lets start with that.
      • 2022-10-19 29254, 2022

      • lucifer
        we can poll that endpoint for each country daily.
      • 2022-10-19 29226, 2022

      • lucifer
        i am wondering where to put this. i think we wanted a way to be able to queue track ids to the cache in general.
      • 2022-10-19 29254, 2022

      • mayhem
        this is where a dedicated RMQ queue would've been advantageous.
      • 2022-10-19 29247, 2022

      • lucifer
        we do have a dedicated queue fwiw. the exchange is same.
      • 2022-10-19 29213, 2022

      • lucifer
        let me see if we can also write to just that queue as well.
      • 2022-10-19 29221, 2022

      • mayhem
        yes, but I mean a queue that speaks something other than listens as the main format.
      • 2022-10-19 29209, 2022

      • lucifer
        i see, yes makes sense.
      • 2022-10-19 29204, 2022

      • v6lur joined the channel
      • 2022-10-19 29206, 2022

      • Hellow1 has quit
      • 2022-10-19 29217, 2022

      • alastairp
      • 2022-10-19 29240, 2022

      • alastairp
        at the moment we're only passing user_ref if confidental=True
      • 2022-10-19 29252, 2022

      • alastairp
        but it's needed in regular data views: https://github.com/metabrainz/critiquebrainz/pull…
      • 2022-10-19 29249, 2022

      • alastairp
        previously we used `user.id`, I think it makes sense here that `user.musicbrainz_username` should be public too, along with user_ref. email + license_choice if confidental
      • 2022-10-19 29201, 2022

      • lucifer
        alastairp: ah i see. yes makes sense.
      • 2022-10-19 29226, 2022

      • lucifer
        musicbrainz_username is public everywhere so makes sense to be public here as well.
      • 2022-10-19 29216, 2022

      • alastairp
        yeah, exactly. it's been private for at least 6 years, but not sure why that decision was initially made.
      • 2022-10-19 29228, 2022

      • BrainzGit
        [musicbrainz-server] 14reosarevok opened pull request #2707 (03master…MBS-12671): MBS-12671: Convert Edit user page to React https://github.com/metabrainz/musicbrainz-server/…
      • 2022-10-19 29209, 2022

      • lucifer
        mayhem: ok, i checked, we can't resuse that queue without sending items to other queues as well. so i guess a new queue it is then?
      • 2022-10-19 29223, 2022

      • lucifer
        we can keep 2 queues if we don't want to touch ts writer. add a new queue only for manually seeding album ids and keep using existing one for listens.
      • 2022-10-19 29207, 2022

      • Hellow1 joined the channel
      • 2022-10-19 29252, 2022

      • Maxr1998 joined the channel
      • 2022-10-19 29200, 2022

      • Maxr1998_ has quit
      • 2022-10-19 29231, 2022

      • mayhem
        sounds good.
      • 2022-10-19 29250, 2022

      • lucifer
        to clarify, 2 queues sounds good?
      • 2022-10-19 29200, 2022

      • mayhem
        yes
      • 2022-10-19 29204, 2022

      • lucifer
        👍
      • 2022-10-19 29242, 2022

      • alastairp
        bitmap: btw, kafka pruned 60gb of data overnight
      • 2022-10-19 29250, 2022

      • BrainzGit
        [listenbrainz-server] 14amCap1712 merged pull request #2199 (03master…spotify-release-index): Add lookups to retrieve spotify track ids using metadata or recording mbids https://github.com/metabrainz/listenbrainz-server…
      • 2022-10-19 29233, 2022

      • BrainzGit
        [troi-recommendation-playground] 14amCap1712 merged pull request #74 (03main…remove-msid): Remove artist_msid and release_msid https://github.com/metabrainz/troi-recommendation…
      • 2022-10-19 29246, 2022

      • lucifer
        time for a troi release!
      • 2022-10-19 29245, 2022

      • BrainzGit
        [troi-recommendation-playground] release 03v-2022-10-19 has been published by 14amCap1712: https://github.com/metabrainz/troi-recommendation…
      • 2022-10-19 29214, 2022

      • bitmap
        alastairp: wow! do you have an idea of what is being stored in kafka? I thought postgres was the primary data store and these were "unprocessed" events
      • 2022-10-19 29215, 2022

      • alastairp
        nope, no idea what's in there. I agree, I think that pg should be the primary store too
      • 2022-10-19 29246, 2022

      • lucifer
        what's the retention policy of the kafka instance?
      • 2022-10-19 29203, 2022

      • alastairp
        I'm looking through the logs and can't find any clear indication about what happened. So I'm not sure if it's the config items we added, or the cleanup that you did, or it goes and re-packs db files in the background?
      • 2022-10-19 29214, 2022

      • alastairp
        lucifer: theoretically it was always configured to 24h
      • 2022-10-19 29215, 2022

      • lucifer
        events aren't deleted asap after processing but only when the retention policy is met afaik.
      • 2022-10-19 29232, 2022

      • alastairp
        but yesterday we tried to add a "no more than 50gb" limit too (I later lowered it down to 40gb)
      • 2022-10-19 29211, 2022

      • alastairp
        but I'm not completely sure how these limits work when both of them are in place, or exactly how kafka splits the db up for doing cleanups
      • 2022-10-19 29248, 2022

      • alastairp
        also unsure about when it decides to delete something in respect to when it starts up. the forum posts that bitmap and I found seemed to indicate that it'd start pruning "immediately", but it took about 4h from when I restarted it yesterday
      • 2022-10-19 29202, 2022

      • alastairp
        and no log informations about which policy triggered this operation in the end (size or time)
      • 2022-10-19 29239, 2022

      • Hellow1 has quit
      • 2022-10-19 29256, 2022

      • lucifer
        alastairp: i see. iiuc, that limit is still per topic. i am interpreting the output of kafka-configs in container correctly.
      • 2022-10-19 29242, 2022

      • mayhem
        lucifer: what else should I review?
      • 2022-10-19 29208, 2022

      • lucifer
      • 2022-10-19 29209, 2022

      • mayhem
        done.
      • 2022-10-19 29218, 2022

      • mayhem
        am I caught up yet?
      • 2022-10-19 29223, 2022

      • mayhem
        what an avalance!
      • 2022-10-19 29206, 2022

      • lucifer
        yup caught up!
      • 2022-10-19 29212, 2022

      • lucifer
        thanks for all the reviews!
      • 2022-10-19 29249, 2022

      • mayhem
        thanks for the mountains of code needed to get this feature done. Even Kirsten (user Damselfish) thanks you -- she is quite excited to get this feature going.
      • 2022-10-19 29206, 2022

      • lucifer
        hehe. nice!
      • 2022-10-19 29253, 2022

      • lucifer
        mayhem: on that last PR, thoughts on https://github.com/metabrainz/listenbrainz-server… ?
      • 2022-10-19 29251, 2022

      • mayhem
        clearing the setting makes sense to me. what other options do you see?
      • 2022-10-19 29241, 2022

      • lucifer
        we could do nothing, the access token is removed on disconnecting so it would become null in the query. the benefit of not doing anything is that if the user reconnects then they won't have to enable this again.
      • 2022-10-19 29212, 2022

      • lucifer
        and changing permissions from say record only listens to play only or both. or any other combination involves a disconnect.
      • 2022-10-19 29256, 2022

      • mayhem
        Not doing anything, so that the user can disconnect, then reconnect and not lose out sounds best.
      • 2022-10-19 29209, 2022

      • Hellow2 joined the channel
      • 2022-10-19 29211, 2022

      • lucifer
        👍
      • 2022-10-19 29233, 2022

      • mayhem
        can we check the missing access token before doing the troi work to generate a playlist?
      • 2022-10-19 29255, 2022

      • mayhem
        carrying out the wolrk and then not being able to deliver it would be wasteful.
      • 2022-10-19 29222, 2022

      • lucifer
        yes can do that. but we still want to generate a playlist in any case, no? just that for null token don't attempt to upload
      • 2022-10-19 29248, 2022

      • lucifer
        *don't attempt to upload to spotify but still upload to LB.
      • 2022-10-19 29251, 2022

      • mayhem
        yes, good point. thats all good then.
      • 2022-10-19 29219, 2022

      • Hellow2
        Hi :) Thank's to all that answered my questions. I have been able to completly implement the tagger today.. So thanks <33
      • 2022-10-19 29238, 2022

      • mayhem
        great to hear that!
      • 2022-10-19 29250, 2022

      • mayhem didn't know the world needed another tagger in 2022.
      • 2022-10-19 29210, 2022

      • Hellow2
        yea actually not, but it is a complete music cli downloader cuz I am tired of stuff that I listen to not being on spotify xD The tagger is just a part of it
      • 2022-10-19 29236, 2022

      • mayhem
        ah, I see. good to hear.
      • 2022-10-19 29243, 2022

      • Hellow2
        yep :D
      • 2022-10-19 29206, 2022

      • mayhem
        I've been talking about the growing backlash against spotify and how people are embracing their music collections again. a very good development, if you ask me.
      • 2022-10-19 29239, 2022

      • mayhem
        we're clearly keen to get people to be more engaged with music and if spotify is making people numb to music, then by all means, lets change and improve!
      • 2022-10-19 29205, 2022

      • Hellow2
        I've really agree. Though I don't see many people that don't use spotify. To listen to much music you need lots of storage, you need to sync you're library between phone and laptop. The syncing may be easy with scp and simmilar but the storage can be a huge problem. Also the affort of building a good enough library are probaply far to high for the masses to ditch spotify.
      • 2022-10-19 29233, 2022

      • Hellow2
        anyways if anyone wants to check out the downloader I made feel free to do so. I'd appreciate some feedback https://github.com/HeIIow2/music-downloader :3
      • 2022-10-19 29222, 2022

      • lucifer
        alastairp: hi! 2 comments on https://github.com/metabrainz/listenbrainz-server… pending resolution. do you agree with it or should i make changes?
      • 2022-10-19 29251, 2022

      • BrainzGit
        [musicbrainz-server] 14reosarevok opened pull request #2708 (03master…RequireUseStrict): Require use strict / use warnings with Perl::Critic https://github.com/metabrainz/musicbrainz-server/…
      • 2022-10-19 29244, 2022

      • alastairp
        lucifer: those are both fine
      • 2022-10-19 29203, 2022

      • lucifer
        thanks! 👍
      • 2022-10-19 29232, 2022

      • BrainzGit
        [listenbrainz-server] 14amCap1712 merged pull request #2155 (03master…inc-update-mb-metadata-cache): Incrementally update mb_metadata_cache https://github.com/metabrainz/listenbrainz-server…
      • 2022-10-19 29247, 2022

      • chrisshepherd has quit
      • 2022-10-19 29251, 2022

      • ssam has quit
      • 2022-10-19 29242, 2022

      • chrisshepherd joined the channel
      • 2022-10-19 29231, 2022

      • ssam joined the channel
      • 2022-10-19 29206, 2022

      • BrainzGit
        [listenbrainz-server] release 03v-2022-10-19.0 has been published by 14github-actions[bot]: https://github.com/metabrainz/listenbrainz-server…
      • 2022-10-19 29239, 2022

      • ssam has quit
      • 2022-10-19 29247, 2022

      • chrisshepherd has quit
      • 2022-10-19 29254, 2022

      • chrisshepherd joined the channel
      • 2022-10-19 29255, 2022

      • ssam joined the channel
      • 2022-10-19 29225, 2022

      • akshaaatt
        Hi lucifer, riksucks, Pratha-Fish , yellowhatpro , ansh , Shubh, I’ll be kn Delhi starting tomorrow for the next few weeks. I have your MetaBrainz summit t-shirts and lots of metabrainz stickers for you guys! Please connect with me on how you would like to collect them. I would prefer we all meet at one place on a day, chat around about MetaBrainz, have fun and use this as an excuse for a get together!