#metabrainz

/

      • vardhan joined the channel
      • vardhan has quit
      • _BrainzGit
        [listenbrainz-server] 14amCap1712 merged pull request #3278 (03master…refactor-dumps-2): Refactor dumps code https://github.com/metabrainz/listenbrainz-serv...
      • vardhan joined the channel
      • Kladky joined the channel
      • lucifer[m]
        mamanullah: i was able to setup funkwhale locally, can you share the error you were getting again?
      • mamanullah7[m]
        <lucifer[m]> "mamanullah: i was able to..." <- lucifer: as such in terminal i dont thing i'm getting error
      • when i tried to open about my pod its completely blank!
      • mamanullah7[m] uploaded an image: (265KiB) < https://matrix.chatbrainz.org/_matrix/media/v3/download/matrix.org/aBWChotYqHKAQdLlUHKNedWB/error.png >
      • mamanullah7[m] uploaded an image: (208KiB) < https://matrix.chatbrainz.org/_matrix/media/v3/download/matrix.org/CUfzhcUEkUsRwrZuPBxvnnCv/pod.png >
      • vardhan has quit
      • mamanullah7[m] uploaded an image: (898KiB) < https://matrix.chatbrainz.org/_matrix/media/v3/download/matrix.org/EVAdryndMsAPlEihmaMdFBwx/Screenshot%202025-05-21%20at%2011.15.05%E2%80%AFPM.png >
      • lucifer[m]
        m.amanullah7: that's expected if you don't have any local music library configured in funkwhale
      • to generate some fake data
      • mamanullah7[m]
        lucifer[m]: i've added this but once again i'll try!
      • mamanullah7[m] uploaded an image: (786KiB) < https://matrix.chatbrainz.org/_matrix/media/v3/download/matrix.org/NHdaYbEOXQeaKHIVHjvVMFSV/Screenshot%202025-05-22%20at%2012.13.54%E2%80%AFPM.png >
      • mamanullah7[m] uploaded an image: (179KiB) < https://matrix.chatbrainz.org/_matrix/media/v3/download/matrix.org/PEMgltZIaAzFHAcIOavEqUJT/Screenshot%202025-05-22%20at%2012.15.33%E2%80%AFPM.png >
      • lucifer[m]
        for DNS you have to run,... (full message at <https://matrix.chatbrainz.org/_matrix/media/v3/...>)
      • oh you are on mac.
      • mamanullah7[m]
        yes
      • lucifer[m]
        search that page for Add the DNS search domain and choose the mac option.
      • there are a couple of blog posts that you could help you set it up.
      • mamanullah7[m]
        okay let me check!
      • lucifer[m]
        mayhem: do you happen to have a funkwhale instance still around? that m.amanullah7 could use for testing.
      • i do see some public instances at https://www.funkwhale.audio/join/ so we can explore using one of them if suitable.
      • Sophist-UK has quit
      • nobiz has quit
      • mamanullah7[m] uploaded an image: (213KiB) < https://matrix.chatbrainz.org/_matrix/media/v3/download/matrix.org/kCyWxFvwopbiQsVXoYOhUczO/Screenshot%202025-05-22%20at%201.22.57%E2%80%AFPM.png >
      • mamanullah7[m]
        lucifer: i did somehow 😒 but even after generating fake data also i'm getting `An unexpected error occurred.`... (full message at <https://matrix.chatbrainz.org/_matrix/media/v3/...>)
      • * lucifer: i did somehow 😒 but even after generating fake data also i'm getting `An unexpected error occurred.`... (full message at <https://matrix.chatbrainz.org/_matrix/media/v3/...>)
      • nobiz joined the channel
      • adhawkins[m]
        Trying it now. I do get the same collation messages while the reindex is going on. Will monitor the next replication to see if that fixes it.
      • Sintharu joined the channel
      • Sintharu
        Hi
      • adhawkins[m]
        That seems to have done the trick bitmap. Maybe this step could be done automatically whenever a new version of the container is installed?
      • mayhem[m]
        No, but outsidecontext might
      • ApeKattQuest joined the channel
      • Maxr1998_ joined the channel
      • Maxr1998 has quit
      • monkey[m]
        <Sintharu> "Hi..." <- Hello Sintharu (IRC)
      • <mamanullah7[m]> "Hey monkey aerozol i needed a..." <- Overall looks good. I don't have much else to say other than what aerozol said: I expect the visual style would follow what we currently do on the connect services page. The LastFM one has extra text inputs for a good example.
      • And like LastFM, maybe there will be a need for an 'edit' button that is active only when you are connected to these services? Say if you want to change the URL for example.
      • mamanullah7[m]
        <monkey[m]> "Overall looks good. I don't have..." <- okay i missed that edit one! i'll make sure to add this!
      • Thanks monkey aerozol i'll take care suggestions and i'll reach u out for further review!
      • petitminion joined the channel
      • davic has quit
      • spynxic has quit
      • spynxic joined the channel
      • davic joined the channel
      • Sintharu has quit
      • mglubb[m]
        Hi yvanzo . Just wanted to say that I'm happy with re-indexing now that I've applied the latest SIR updates and tuned its configuration. Seems to be in the same ballpark as it was before, in terms of time. Possibly a bit quicker. Thank you for your service!
      • davic has quit
      • Sintharu joined the channel
      • Sophist-UK joined the channel
      • davic joined the channel
      • Kladky has quit
      • Kladky joined the channel
      • Sintharu has quit
      • Sintharu joined the channel
      • holycow23[m] joined the channel
      • holycow23[m]
        Hey lucifer, I was actually looking into the TimeScale DB to run the a couple of queries over the listens and noticed that it doesn't have the `artist_mbid` or the `recording_mbid` to it, so how exactly do I fetch the same on local for the listens?
      • <holycow23[m]> "Hey lucifer, I was actually..." <- Also needed some more assistance with the working of stats, so could we get on a quick zoom call maybe?
      • s/more/help/, s/assistance//
      • petitminion has quit
      • petitminion joined the channel
      • yvanzo[m]
        Hi mglubb, glad it works for your mirror, sharing your thanks with bitmap and lucifer who made it possible too.
      • lucifer: suggested small changes
      • lucifer[m]
        yvanzo: just approved them, thanks!
      • <holycow23[m]> "Also needed some more assistance..." <- let's try to work it out over chat first and if it doesn't clear up we can do a call later.
      • holycow23[m]
        lucifer[m]: Cool
      • yvanzo[m]
        Great, on releasing sir then!
      • lucifer[m]
        <holycow23[m]> "Hey lucifer, I was actually..." <- those will come from the mapping data, i'll put up a branch with sample dumps import tomorrow and then you can do a join to `mapping.mb_metadata_cache` table to get artist mbids for listens.
      • holycow23[m]
        Okay cool
      • _BrainzGit
        [sir] 14yvanzo merged pull request #167 (03master…pyproject): Migrate to pyproject.toml and fix docs build https://github.com/metabrainz/sir/pull/167
      • yvanzo[m]
        Also updated repo webhooks
      • holycow23[m]
        The cron job for the stats how often does it run, I found the file running the cron at /docker/services/cron/crontab, basically I wanna know so if the weekly is chosen then does it run weekly or does it run daily and update the individual time ranges at once?
      • julian45[m]
        lucifer: you might already be aware, but just a heads up that the `stable` view of the sir docs (which seems to be the default when browsing to the RTD pages) doesn't yet have the documentation updates you've made, e.g., the [setup page](https://sir.readthedocs.io/en/stable/setup/index.html) still refers to `python2`
      • lucifer[m]
        julian45: i am not sure if yvanzo has released the new version yet.
      • yvanzo[m]
        On it…
      • lucifer[m]
        just checked RTD dashboard, once the release is done stable should update automatically.
      • _BrainzGit
        [sir] release 03v4.0.1 has been published by 14yvanzo: https://github.com/metabrainz/sir/releases/tag/...
      • lucifer[m]
        the link preview here is probably cached but it has updated to 4.0.1 now.
      • petitminion has quit
      • yvanzo[m]
        I disabled link preview, personally.
      • lucifer[m]
        ah okay
      • holycow23[m]
        * lucifer: The cron job for the stats how often does it run, I found the file running the cron at `/docker/services/cron/crontab`, basically I wanna know so if the weekly is chosen then does it run weekly or does it run daily and update the individual time ranges at once?
      • <holycow23[m]> "The cron job for the stats how..." <- also the `get_aggregate_query` is based on the listens table so, over a period of time won't it have all the listens for a period of time or do you filter the listens for a period and then generate the results?
      • outsidecontext[m
        <lucifer[m]> "mayhem: do you happen to have..." <- I still do. But I assume you will need OpenSubsonic support (for the MBIDs), as troi requires it?
      • I currently run the release version of funkwhale, but the opensubsonic support is not yet released.
      • lucifer[m]
        <holycow23[m]> "also the `get_aggregate_query..." <- can you point me to the query on github?
      • yvanzo[m]
        zas: Something went wrong with Solr backup, running it in 3min again.
      • pite_ joined the channel
      • _BrainzGit
        [listenbrainz-server] 14miki-tebe opened pull request #3283 (03master…add-shuffle): Add a shuffle button to shuffle tracks that are in the queue https://github.com/metabrainz/listenbrainz-serv...
      • holycow23[m]
        <lucifer[m]> "can you point me to the query on..." <- You could refer to [this](https://github.com/metabrainz/listenbrainz-server/blob/master/listenbrainz_spark/stats/incremental/user/listening_activity.py#L27)
      • pite has quit
      • lucifer[m]
        holycow23: yes the filtering is done here: https://github.com/metabrainz/listenbrainz-serv...
      • holycow23[m]
        lucifer[m]: And what is the frequency of the stats update is it done daily?
      • lucifer[m]
        yes
      • holycow23[m]
        Okay
      • Now for example, if I need to do the era stats, it will be based on the release date so that also will be fetched from the dump right?
      • But this is the MB Dump right not the Spark dump
      • lucifer[m]
        [@holycow23:matrix.org](https://matrix.to/#/@holycow23:matrix.org) not the MB dump but directly from the MB db, we already have that stat for year in music iiuc.
      • we have the postgres queries to retrieve the data from the MB db
      • in listenbrainz_spark/postgres.
      • holycow23[m]
        Yes correct the era one is already in the "Your year in Music 20xx" done but for testing it on local, I could use the MB Dump since I can't use the MB db directly
      • lucifer[m]
        That data is cached in hdfs and refreshed daily before stats cron job run.
      • The json dump format is different from the database.
      • holycow23[m]
        So, for developement of the stats how would I access the DB?
      • lucifer[m]
        So it would not work. For this stat, I can export the data and you can import it in your local database.
      • For other stats, I can provide you with access to a full mb db replica. You can develop and test your queries on that and then I can export the data for using that query.
      • holycow23[m]
        Cool, lemme know how to access the mb db replica
      • lucifer[m]
        You can also connect your local spark cluster to a full mb db replica hosted on our servers. Or run a spark cluster on wolf. But that can be slower.
      • holycow23[m]
        Could you guide me with connecting the local spark cluster with the full replica?
      • Also how do you write queries connecting two different databases, or do you just run individual queries on both?
      • lucifer[m]
        The two databases as in?
      • holycow23[m]
        Listens would be timescale_db and information regarding the songs would be
      • * Listens would be timescale_db and information regarding the songs would be mb_db
      • lucifer[m]
        The listens are imported using dumps in spark
      • The addition metadata is brought in from MB db
      • And then joined together and processed in spark
      • holycow23[m]
        So you don't use the timescale_db?
      • lucifer[m]
        No that is not used for statistics at all
      • holycow23[m]
        aah got it
      • got it
      • lucifer[m]
        It is only used directly for listens page.
      • All the stats, recommendations etc. is done in spark where listen data is imported from dumps.
      • holycow23[m]
        Got it, so spark has the listens as well as the info related to recordings, so you run queries over spark and then update the stats
      • lucifer[m]
        Yes.
      • holycow23[m]
        Thank you so much
      • Also till when will the dump be generated?
      • lucifer[m]
        The sample data dump?
      • holycow23[m]
        The spark dump
      • lucifer[m]
        The spark dumps are generated daily for production
      • Do you mean the metadata like release dates etc?
      • holycow23[m]
        Okay so how exactly would I proceed with my project since I will need the metadata with the listens
      • To run queries
      • lucifer[m]
        Should be ready early next week.
      • holycow23[m]
        Okay, is there anything that I can do before the dump is ready, wanted to start a little early actually
      • lucifer[m]
        The data can be exported today but I need to add the import code in spark.
      • holycow23[m]
        Okay
      • lucifer[m]
        I think you can write the api and frontend side of things meanwhile.
      • Using some hardcoded dummy data for testing.
      • holycow23[m]
        Okay
      • Thanks
      • petitminion joined the channel
      • _BrainzGit
        [musicbrainz-server] 14mwiencek opened pull request #3546 (03production…mbs-14032): MBS-14032: Build temporary `release_first_release_date` table for MBS-13966 https://github.com/metabrainz/musicbrainz-serve...
      • BrainzBot
        MBS-14032: The schema 30 upgrade script for MBS-13966 requires release_first_release_date to have been built https://tickets.metabrainz.org/browse/MBS-14032
      • MBS-13966: Calculate first release dates for empty release groups https://tickets.metabrainz.org/browse/MBS-13966