#metabrainz

/

0:20 AM
lusciouslover has quit

2025-08-21 23339, 2025

0:20 AM
lusciouslover joined the channel

2025-08-21 23313, 2025

1:21 AM
DjSlash has quit

2025-08-21 23321, 2025

1:22 AM
DjSlash joined the channel

2025-08-21 23331, 2025

1:28 AM
Maxr1998_ joined the channel

2025-08-21 23333, 2025

1:29 AM
Maxr1998 has quit

2025-08-21 23319, 2025

1:31 AM
Karlifornia joined the channel

2025-08-21 23348, 2025

4:15 AM
revi joined the channel

2025-08-21 23318, 2025

4:55 AM
d4rkie has quit

2025-08-21 23310, 2025

4:56 AM
d4rkie joined the channel

2025-08-21 23313, 2025

5:12 AM
suvid[m]

Hi monkey

2025-08-21 23313, 2025

5:12 AM
suvid[m]

I have added another test to check if the form loads correctly as well!

2025-08-21 23327, 2025

6:31 AM
__BrainzGit

[musicbrainz-server] 14reosarevok merged pull request #3605 (03master…MBS-14115): MBS-14115: Block last.fm links from release level https://github.com/metabrainz/musicbrainz-server/…

2025-08-21 23328, 2025

6:31 AM
BrainzBot

MBS-14115: Block last.fm from discography entry https://tickets.metabrainz.org/browse/MBS-14115

2025-08-21 23340, 2025

7:18 AM
SigHunter has quit

2025-08-21 23331, 2025

7:24 AM
SigHunter joined the channel

2025-08-21 23301, 2025

8:59 AM
Karlifornia

hows it going

2025-08-21 23334, 2025

9:07 AM
monkey[m]

Hi Karlifornia (IRC)

2025-08-21 23306, 2025

9:09 AM
Jigen has quit

2025-08-21 23331, 2025

9:09 AM
Jigen joined the channel

2025-08-21 23339, 2025

9:31 AM
pite has quit

2025-08-21 23303, 2025

9:32 AM
pite joined the channel

2025-08-21 23330, 2025

9:32 AM
monkey[m]

lucifer, mayhem: Hello! We have a new user whose dashboard is completely broken due to the timescale sporadic listens issue. In this case it is drastic, the request completely fails and the page does not load.... (full message at <https://matrix.chatbrainz.org/_matrix/media/v3/download/chatbrainz.org/QCjuPSaEMDwRUfRUSODNAAOI>)

2025-08-21 23316, 2025

9:39 AM
d4rkie has quit

2025-08-21 23345, 2025

9:39 AM
d4rkie joined the channel

2025-08-21 23309, 2025

9:45 AM
lucifer[m]

[@monkey:chatbrainz.org](https://matrix.to/#/@monkey:chatbrainz.org) yes that would work, i think the point of contention was whether the same behaviour should be enforced for api users.

2025-08-21 23329, 2025

9:45 AM
monkey[m]

Yes, API is also going to be an issue in this (albeit extreme) case

2025-08-21 23336, 2025

9:46 AM
monkey[m]

Not sure what we should do then, but I assume something similar.

2025-08-21 23336, 2025

9:46 AM
monkey[m]

There was talk of storign timestamps of when the next listens starts if memory serves?

2025-08-21 23336, 2025

9:46 AM
monkey[m]

Then we could return a pagination-style next_ts or something int he API response

2025-08-21 23354, 2025

9:53 AM
lucifer[m]

yes right

2025-08-21 23351, 2025

9:54 AM
Karlifornia

i love that people still use IRC for work purposes/to contact one another to concur things

2025-08-21 23303, 2025

10:02 AM
mayhem[m]

> For the dashboard we had talked about have a max number of passes and returning early with a bool indicator if we didn't reach 25 listens, then we can show a load more button on the front-end. Would that work?

2025-08-21 23311, 2025

10:02 AM
mayhem[m]

that is what we did previously and everyone hated it.

2025-08-21 23330, 2025

10:02 AM
mayhem[m]

Karlifornia (IRC): most of us are now on matrix, but we have holdouts on IRC.

2025-08-21 23354, 2025

10:03 AM
mayhem[m]

monkey: I suggested to lucifer that we insert hints into the DB on a periodic basis. so, if we're fetching dates for one year and we're approaching a large gap, say 2 years, the system should insert a record of sorts that says, next listen is at <this> timestamp.

2025-08-21 23341, 2025

10:04 AM
lucifer[m]

yes we can do that but it will be hard to keep that up to date because we allow historical imports.

2025-08-21 23302, 2025

10:05 AM
lucifer[m]

say a hint says the next listen is in 2022 but then the user submits listens for 2023.

2025-08-21 23329, 2025

10:05 AM
monkey[m]

Yes, I believe this is probably how this particular case ended up happening, looks like imported data 2007-2012, then started sending new listens in 2025

2025-08-21 23332, 2025

10:05 AM
lucifer[m]

unless we invalidate the 2022 hint properly in the database we will miss the 2023 listens.

2025-08-21 23305, 2025

10:06 AM
mayhem[m]

lucifer[m]: the script that manages these hints needs to handle this case.

2025-08-21 23309, 2025

10:06 AM
monkey[m]

mayhem[m]: Wait did we do this already? You mean a while back, or?

2025-08-21 23316, 2025

10:06 AM
mayhem[m]

and we should have a way to recompute them all for a user, after an import.

2025-08-21 23328, 2025

10:06 AM
lucifer[m]

mayhem: it would need to happen as soon as insert listens happens imo

2025-08-21 23332, 2025

10:06 AM
mayhem[m]

monkey: that was the very first implementation.

2025-08-21 23338, 2025

10:06 AM
monkey[m]

I see

2025-08-21 23359, 2025

10:06 AM
mayhem[m]

lucifer[m]: if we can insert them without a background process, even better!

2025-08-21 23308, 2025

10:08 AM
lucifer[m]

iiuc the hints will be a nullable column in the listens table?

2025-08-21 23342, 2025

10:08 AM
mayhem[m]

I think that could work. when we do an insert for a user, we know their min/max timestamps already, yes? if a new insert is more than X days away, insert a hint. at least, that is the easy case.

2025-08-21 23300, 2025

10:09 AM
mayhem[m]

lucifer[m]: separate table, I would think.

2025-08-21 23301, 2025

10:09 AM
monkey[m]

Could a hybrid solution work? 1. scan a max number of times, return early if we hit the max, 2. despite having returned the response, keep scanning the DB until we hit the next listen 3. store that listens ts temporarily as a hint for future table scans

2025-08-21 23319, 2025

10:09 AM
mayhem[m]

what if all listens, have a previous_ts field?

2025-08-21 23331, 2025

10:09 AM
mayhem[m]

and that is always kept up to date?

2025-08-21 23335, 2025

10:09 AM
mayhem[m]

could we manage that?

2025-08-21 23356, 2025

10:09 AM
mayhem[m]

because this problem becomes easy if you have previous_ts

2025-08-21 23341, 2025

10:10 AM
monkey[m]

I hate to think about adding that to the giant SQL queries we have, but if lucifer says it's possible...

2025-08-21 23359, 2025

10:10 AM
lucifer[m]

the issue is keeping it upto date. delete and insert need to be updated.

2025-08-21 23311, 2025

10:11 AM
lucifer[m]

doable yes not sure how fast it will be.

2025-08-21 23312, 2025

10:11 AM
d4rkie has quit

2025-08-21 23315, 2025

10:11 AM
mayhem[m]

yes, keeping it up to date is the tricky part.

2025-08-21 23333, 2025

10:11 AM
lucifer[m]

i would rather do a separate table on the granularity of a month or year.

2025-08-21 23342, 2025

10:11 AM
d4rkie joined the channel

2025-08-21 23343, 2025

10:11 AM
mayhem[m]

sure.

2025-08-21 23351, 2025

10:11 AM
lucifer[m]

that maintains info as to whether a user has listens in that month or year.

2025-08-21 23307, 2025

10:12 AM
lucifer[m]

not as optimal but easier to keep up to date

2025-08-21 23310, 2025

10:12 AM
mayhem[m]

delete is not too hard. we have the previous_ts, we'd need to scan for next_ts, which is probelmatic.

2025-08-21 23317, 2025

10:12 AM
mayhem[m]

so previous_ts and next_ts?

2025-08-21 23319, 2025

10:12 AM
monkey[m]

Ahhh, and use that as a hint to skip some time buckets?

2025-08-21 23339, 2025

10:12 AM
mayhem[m]

lucifer: oh! I like that idea.

2025-08-21 23341, 2025

10:12 AM
monkey[m]

(I meant lucifer's proposal)

2025-08-21 23300, 2025

10:13 AM
mayhem[m]

how many buckets can we shove into 64 bits?

2025-08-21 23305, 2025

10:13 AM
mayhem[m]

how future proof is that?

2025-08-21 23315, 2025

10:13 AM
monkey[m]

That's a sentence I never thought I'd read.

2025-08-21 23321, 2025

10:13 AM
mayhem[m]

what if we had just that to decode a map of where we need to query?

2025-08-21 23307, 2025

10:14 AM
mayhem[m]

what is our current bucket size, lucifer ?

2025-08-21 23338, 2025

10:14 AM
lucifer[m]

30 days

2025-08-21 23337, 2025

10:15 AM
monkey[m]

So 5 years of monthly ticks in 64 bits?

2025-08-21 23305, 2025

10:16 AM
mayhem[m]

ok, lets make it future proof then.

2025-08-21 23325, 2025

10:16 AM
mayhem[m]

we use a binary field. and we keep appending bytes to it as we progress in time.

2025-08-21 23359, 2025

10:16 AM
mayhem[m]

each bit represents one month since LAST_FM_START

2025-08-21 23318, 2025

10:19 AM
monkey[m]

We also have the user's earliest TS, we could calculate based on that but would need to update it when there is an import/deletion that changes it

2025-08-21 23305, 2025

10:20 AM
mayhem[m]

the large bitfield I was describing is easy to update and easy to fetch/store.

2025-08-21 23311, 2025

10:20 AM
lucifer[m]

if we are doing a separate table, then its just one row per user and we can store an ints or datetimes. etc

2025-08-21 23314, 2025

10:20 AM
mayhem[m]

got a new listen, just set the right bit.

2025-08-21 23321, 2025

10:20 AM
lucifer[m]

bitfield is fine too sure.

2025-08-21 23345, 2025

10:20 AM
mayhem[m]

the bitfield is super dense and we dont need to know much than these search hints.

2025-08-21 23359, 2025

10:20 AM
mayhem[m]

yeah, I think this idea has legs.

2025-08-21 23312, 2025

10:21 AM
lusciouslover has quit

2025-08-21 23302, 2025

10:22 AM
lusciouslover joined the channel

2025-08-21 23315, 2025

10:24 AM
monkey[m]

How does that translate on the front-end and API side? does it mean we seamlessly continue scanning the DB, but skipping the empty buckets, so it's seamless to users?

2025-08-21 23314, 2025

10:25 AM
mayhem[m]

it should be transparent to the UI. just ask for data, the backend will know how to fetch it.

2025-08-21 23355, 2025

10:26 AM
monkey[m]

That sounds perfect, definitely better than the other solutions mentioned

2025-08-21 23313, 2025

10:38 AM
fettuccinae[m] joined the channel

2025-08-21 23313, 2025

10:38 AM
fettuccinae[m]

Would there be any difference if we let timescale scan the db for x number of latest listens of a user instead of changing the scanning window manually for each pass?

2025-08-21 23342, 2025

10:39 AM
fettuccinae[m]

* Would there be any difference if we let timescale db scan for x number of latest listens of a user instead of changing the scanning window manually for each pass?

2025-08-21 23357, 2025

10:43 AM
mayhem[m]

letting timescale search for x listens is exactly the problem. if there are gaps in the listen history this could take quite some time.

2025-08-21 23313, 2025

11:01 AM
fettuccinae[m]

i meant we run the query multiple times before finding x listens, will there be any difference if we run it only once. Also, what if we create an index on user_id, listened_at

2025-08-21 23326, 2025

11:01 AM
fettuccinae[m]

s/meant/mean/, s/user_id/user\_id/, s/listened_at/listened\_at/

2025-08-21 23348, 2025

11:02 AM
fettuccinae[m]

* i mean we run the query multiple times before finding x listens, but yeah, it would be the same.

2025-08-21 23328, 2025

11:04 AM
mayhem[m]

in the end you can't answer the query in a reasonable amount of time. we really need to be able to serve these queries in under 50ms. ideally.

2025-08-21 23350, 2025

11:06 AM
__BrainzGit

[listenbrainz-server] 14MonkeyDo opened pull request #3348 (03master…encode-username): Encode usernames in all URIs https://github.com/metabrainz/listenbrainz-server…

2025-08-21 23335, 2025

11:38 AM
ansh[m]

bitmap: I'm not able to use onClick on buttons and onSubmit props in forms in MB. Is there something I need to do differently?

2025-08-21 23333, 2025

11:56 AM
__BrainzGit

[listenbrainz-server] 14MonkeyDo merged pull request #3346 (03master…LB-1811): LB-1811: Make menu search bar more prominent https://github.com/metabrainz/listenbrainz-server…

2025-08-21 23333, 2025

11:56 AM
BrainzBot

LB-1811: Make menu search bar more prominent https://tickets.metabrainz.org/browse/LB-1811