in #metabrainz

1:56 AM
vardhan joined the channel
3:05 AM
vardhan has quit
5:42 AM
_BrainzGit

[listenbrainz-server] 14amCap1712 merged pull request #3278 (03master…refactor-dumps-2): Refactor dumps code https://github.com/metabrainz/listenbrainz-serv...
5:55 AM
vardhan joined the channel
6:12 AM
Kladky joined the channel
6:12 AM
lucifer[m]

mamanullah: i was able to setup funkwhale locally, can you share the error you were getting again?
6:20 AM
mamanullah7[m]

<lucifer[m]> "mamanullah: i was able to..." <- lucifer: as such in terminal i dont thing i'm getting error
6:20 AM
when i tried to open about my pod its completely blank!
6:24 AM
mamanullah7[m] uploaded an image: (265KiB) < https://matrix.chatbrainz.org/_matrix/media/v3/download/matrix.org/aBWChotYqHKAQdLlUHKNedWB/error.png >
6:27 AM
mamanullah7[m] uploaded an image: (208KiB) < https://matrix.chatbrainz.org/_matrix/media/v3/download/matrix.org/CUfzhcUEkUsRwrZuPBxvnnCv/pod.png >
6:29 AM
vardhan has quit
6:30 AM
mamanullah7[m] uploaded an image: (898KiB) < https://matrix.chatbrainz.org/_matrix/media/v3/download/matrix.org/EVAdryndMsAPlEihmaMdFBwx/Screenshot%202025-05-21%20at%2011.15.05%E2%80%AFPM.png >
6:35 AM
lucifer[m]

m.amanullah7: that's expected if you don't have any local music library configured in funkwhale
6:35 AM
you can run https://docs.funkwhale.audio/develop/developer/...
6:36 AM
to generate some fake data
6:39 AM
mamanullah7[m]

lucifer[m]: i've added this but once again i'll try!
6:44 AM
mamanullah7[m] uploaded an image: (786KiB) < https://matrix.chatbrainz.org/_matrix/media/v3/download/matrix.org/NHdaYbEOXQeaKHIVHjvVMFSV/Screenshot%202025-05-22%20at%2012.13.54%E2%80%AFPM.png >
6:45 AM
mamanullah7[m] uploaded an image: (179KiB) < https://matrix.chatbrainz.org/_matrix/media/v3/download/matrix.org/PEMgltZIaAzFHAcIOavEqUJT/Screenshot%202025-05-22%20at%2012.15.33%E2%80%AFPM.png >
6:46 AM
lucifer[m]

for DNS you have to run,... (full message at <https://matrix.chatbrainz.org/_matrix/media/v3/...>)
6:47 AM
oh you are on mac.
6:47 AM
mamanullah7[m]

yes
6:47 AM
lucifer[m]

search that page for Add the DNS search domain and choose the mac option.
6:48 AM
there are a couple of blog posts that you could help you set it up.
6:49 AM
mamanullah7[m]

okay let me check!
6:50 AM
lucifer[m]

mayhem: do you happen to have a funkwhale instance still around? that m.amanullah7 could use for testing.
6:51 AM
i do see some public instances at https://www.funkwhale.audio/join/ so we can explore using one of them if suitable.
7:30 AM
Sophist-UK has quit
7:30 AM
nobiz has quit
7:53 AM
mamanullah7[m] uploaded an image: (213KiB) < https://matrix.chatbrainz.org/_matrix/media/v3/download/matrix.org/kCyWxFvwopbiQsVXoYOhUczO/Screenshot%202025-05-22%20at%201.22.57%E2%80%AFPM.png >
7:54 AM
mamanullah7[m]

lucifer: i did somehow 😒 but even after generating fake data also i'm getting `An unexpected error occurred.`... (full message at <https://matrix.chatbrainz.org/_matrix/media/v3/...>)
7:55 AM
* lucifer: i did somehow 😒 but even after generating fake data also i'm getting `An unexpected error occurred.`... (full message at <https://matrix.chatbrainz.org/_matrix/media/v3/...>)
7:56 AM
nobiz joined the channel
7:58 AM
adhawkins[m]

Trying it now. I do get the same collation messages while the reindex is going on. Will monitor the next replication to see if that fixes it.
7:59 AM
Sintharu joined the channel
7:59 AM
Sintharu

Hi
8:17 AM
adhawkins[m]

That seems to have done the trick bitmap. Maybe this step could be done automatically whenever a new version of the container is installed?
8:37 AM
mayhem[m]

No, but outsidecontext might
8:40 AM
ApeKattQuest joined the channel
8:50 AM
Maxr1998_ joined the channel
8:50 AM
Maxr1998 has quit
9:04 AM
monkey[m]

<Sintharu> "Hi..." <- Hello Sintharu (IRC)
9:12 AM
<mamanullah7[m]> "Hey monkey aerozol i needed a..." <- Overall looks good. I don't have much else to say other than what aerozol said: I expect the visual style would follow what we currently do on the connect services page. The LastFM one has extra text inputs for a good example.
9:13 AM
And like LastFM, maybe there will be a need for an 'edit' button that is active only when you are connected to these services? Say if you want to change the URL for example.
9:25 AM
mamanullah7[m]

<monkey[m]> "Overall looks good. I don't have..." <- okay i missed that edit one! i'll make sure to add this!
9:25 AM
Thanks monkey aerozol i'll take care suggestions and i'll reach u out for further review!
9:50 AM
petitminion joined the channel
10:41 AM
davic has quit
10:51 AM
spynxic has quit
10:52 AM
spynxic joined the channel
11:04 AM
davic joined the channel
11:07 AM
Sintharu has quit
11:10 AM
mglubb[m]

Hi yvanzo . Just wanted to say that I'm happy with re-indexing now that I've applied the latest SIR updates and tuned its configuration. Seems to be in the same ballpark as it was before, in terms of time. Possibly a bit quicker. Thank you for your service!
11:19 AM
davic has quit
11:21 AM
Sintharu joined the channel
11:22 AM
Sophist-UK joined the channel
11:37 AM
davic joined the channel
11:49 AM
Kladky has quit
11:49 AM
Kladky joined the channel
11:58 AM
Sintharu has quit
12:31 PM
Sintharu joined the channel
13:03 PM
holycow23[m] joined the channel
13:03 PM
holycow23[m]

Hey lucifer, I was actually looking into the TimeScale DB to run the a couple of queries over the listens and noticed that it doesn't have the `artist_mbid` or the `recording_mbid` to it, so how exactly do I fetch the same on local for the listens?
13:18 PM
<holycow23[m]> "Hey lucifer, I was actually..." <- Also needed some more assistance with the working of stats, so could we get on a quick zoom call maybe?
13:30 PM
s/more/help/, s/assistance//
13:43 PM
petitminion has quit
13:49 PM
petitminion joined the channel
13:52 PM
yvanzo[m]

Hi mglubb, glad it works for your mirror, sharing your thanks with bitmap and lucifer who made it possible too.
13:54 PM
lucifer: suggested small changes
13:55 PM
lucifer[m]

yvanzo: just approved them, thanks!
13:55 PM
<holycow23[m]> "Also needed some more assistance..." <- let's try to work it out over chat first and if it doesn't clear up we can do a call later.
13:56 PM
holycow23[m]

lucifer[m]: Cool
13:56 PM
yvanzo[m]

Great, on releasing sir then!
13:57 PM
lucifer[m]

<holycow23[m]> "Hey lucifer, I was actually..." <- those will come from the mapping data, i'll put up a branch with sample dumps import tomorrow and then you can do a join to `mapping.mb_metadata_cache` table to get artist mbids for listens.
13:57 PM
holycow23[m]

Okay cool
13:59 PM
_BrainzGit

[sir] 14yvanzo merged pull request #167 (03master…pyproject): Migrate to pyproject.toml and fix docs build https://github.com/metabrainz/sir/pull/167
14:00 PM
yvanzo[m]

Also updated repo webhooks
14:00 PM
holycow23[m]

The cron job for the stats how often does it run, I found the file running the cron at /docker/services/cron/crontab, basically I wanna know so if the weekly is chosen then does it run weekly or does it run daily and update the individual time ranges at once?
14:05 PM
julian45[m]

lucifer: you might already be aware, but just a heads up that the `stable` view of the sir docs (which seems to be the default when browsing to the RTD pages) doesn't yet have the documentation updates you've made, e.g., the [setup page](https://sir.readthedocs.io/en/stable/setup/index.html) still refers to `python2`
14:07 PM
lucifer[m]

julian45: i am not sure if yvanzo has released the new version yet.
14:08 PM
yvanzo[m]

On it…
14:08 PM
lucifer[m]

just checked RTD dashboard, once the release is done stable should update automatically.
14:09 PM
https://sir.readthedocs.io/en/stable/setup/inde...
14:09 PM
_BrainzGit

[sir] release 03v4.0.1 has been published by 14yvanzo: https://github.com/metabrainz/sir/releases/tag/...
14:10 PM
lucifer[m]

the link preview here is probably cached but it has updated to 4.0.1 now.
14:10 PM
petitminion has quit
14:10 PM
yvanzo[m]

I disabled link preview, personally.
14:12 PM
lucifer[m]

ah okay
14:16 PM
holycow23[m]

* lucifer: The cron job for the stats how often does it run, I found the file running the cron at `/docker/services/cron/crontab`, basically I wanna know so if the weekly is chosen then does it run weekly or does it run daily and update the individual time ranges at once?
14:19 PM
<holycow23[m]> "The cron job for the stats how..." <- also the `get_aggregate_query` is based on the listens table so, over a period of time won't it have all the listens for a period of time or do you filter the listens for a period and then generate the results?
14:19 PM
outsidecontext[m

<lucifer[m]> "mayhem: do you happen to have..." <- I still do. But I assume you will need OpenSubsonic support (for the MBIDs), as troi requires it?
14:23 PM
I currently run the release version of funkwhale, but the opensubsonic support is not yet released.
14:26 PM
lucifer[m]

<holycow23[m]> "also the `get_aggregate_query..." <- can you point me to the query on github?
14:27 PM
yvanzo[m]

zas: Something went wrong with Solr backup, running it in 3min again.
14:35 PM
pite_ joined the channel
14:35 PM
_BrainzGit

[listenbrainz-server] 14miki-tebe opened pull request #3283 (03master…add-shuffle): Add a shuffle button to shuffle tracks that are in the queue https://github.com/metabrainz/listenbrainz-serv...
14:37 PM
holycow23[m]

<lucifer[m]> "can you point me to the query on..." <- You could refer to [this](https://github.com/metabrainz/listenbrainz-server/blob/master/listenbrainz_spark/stats/incremental/user/listening_activity.py#L27)
14:38 PM
pite has quit
14:42 PM
lucifer[m]

holycow23: yes the filtering is done here: https://github.com/metabrainz/listenbrainz-serv...
14:44 PM
holycow23[m]

lucifer[m]: And what is the frequency of the stats update is it done daily?
14:45 PM
lucifer[m]

yes
14:46 PM
holycow23[m]

Okay
14:47 PM
Now for example, if I need to do the era stats, it will be based on the release date so that also will be fetched from the dump right?
14:47 PM
But this is the MB Dump right not the Spark dump
15:06 PM
lucifer[m]

[@holycow23:matrix.org](https://matrix.to/#/@holycow23:matrix.org) not the MB dump but directly from the MB db, we already have that stat for year in music iiuc.
15:07 PM
we have the postgres queries to retrieve the data from the MB db
15:07 PM
in listenbrainz_spark/postgres.
15:08 PM
holycow23[m]

Yes correct the era one is already in the "Your year in Music 20xx" done but for testing it on local, I could use the MB Dump since I can't use the MB db directly
15:08 PM
lucifer[m]

That data is cached in hdfs and refreshed daily before stats cron job run.
15:08 PM
The json dump format is different from the database.
15:09 PM
holycow23[m]

So, for developement of the stats how would I access the DB?
15:09 PM
lucifer[m]

So it would not work. For this stat, I can export the data and you can import it in your local database.
15:11 PM
For other stats, I can provide you with access to a full mb db replica. You can develop and test your queries on that and then I can export the data for using that query.
15:12 PM
holycow23[m]

Cool, lemme know how to access the mb db replica
15:12 PM
lucifer[m]

You can also connect your local spark cluster to a full mb db replica hosted on our servers. Or run a spark cluster on wolf. But that can be slower.
15:13 PM
holycow23[m]

Could you guide me with connecting the local spark cluster with the full replica?
15:14 PM
Also how do you write queries connecting two different databases, or do you just run individual queries on both?
15:14 PM
lucifer[m]

The two databases as in?
15:15 PM
holycow23[m]

Listens would be timescale_db and information regarding the songs would be
15:15 PM
* Listens would be timescale_db and information regarding the songs would be mb_db
15:15 PM
lucifer[m]

The listens are imported using dumps in spark
15:16 PM
The addition metadata is brought in from MB db
15:16 PM
And then joined together and processed in spark
15:16 PM
holycow23[m]

So you don't use the timescale_db?
15:16 PM
lucifer[m]

No that is not used for statistics at all
15:17 PM
holycow23[m]

aah got it
15:17 PM
got it
15:17 PM
lucifer[m]

It is only used directly for listens page.
15:17 PM
All the stats, recommendations etc. is done in spark where listen data is imported from dumps.
15:18 PM
holycow23[m]

Got it, so spark has the listens as well as the info related to recordings, so you run queries over spark and then update the stats
15:18 PM
lucifer[m]

Yes.
15:19 PM
holycow23[m]

Thank you so much
15:19 PM
Also till when will the dump be generated?
15:19 PM
lucifer[m]

The sample data dump?
15:19 PM
holycow23[m]

The spark dump
15:20 PM
lucifer[m]

The spark dumps are generated daily for production
15:20 PM
Do you mean the metadata like release dates etc?
15:20 PM
holycow23[m]

Okay so how exactly would I proceed with my project since I will need the metadata with the listens
15:20 PM
To run queries
15:22 PM
lucifer[m]

Should be ready early next week.
15:23 PM
holycow23[m]

Okay, is there anything that I can do before the dump is ready, wanted to start a little early actually
15:23 PM
lucifer[m]

The data can be exported today but I need to add the import code in spark.
15:23 PM
holycow23[m]

Okay
15:24 PM
lucifer[m]

I think you can write the api and frontend side of things meanwhile.
15:24 PM
Using some hardcoded dummy data for testing.
15:25 PM
holycow23[m]

Okay
15:25 PM
Thanks
15:33 PM
petitminion joined the channel
16:05 PM
_BrainzGit

[musicbrainz-server] 14mwiencek opened pull request #3546 (03production…mbs-14032): MBS-14032: Build temporary `release_first_release_date` table for MBS-13966 https://github.com/metabrainz/musicbrainz-serve...
16:05 PM
BrainzBot

MBS-14032: The schema 30 upgrade script for MBS-13966 requires release_first_release_date to have been built https://tickets.metabrainz.org/browse/MBS-14032
16:05 PM
MBS-13966: Calculate first release dates for empty release groups https://tickets.metabrainz.org/browse/MBS-13966