#metabrainz

/

0:29 AM
xSkkarf[m]

nvm i figured it out xD. Cursor helped me sort out some docker networking issues.

2025-03-18 07755, 2025

0:36 AM
tux0r

ah, docker being docker again

2025-03-18 07746, 2025

2:01 AM
minimal has quit

2025-03-18 07730, 2025

2:09 AM
LupinIII joined the channel

2025-03-18 07755, 2025

2:12 AM
Jigen has quit

2025-03-18 07729, 2025

2:34 AM
Jigen joined the channel

2025-03-18 07729, 2025

2:36 AM
ApeKattQuest has quit

2025-03-18 07736, 2025

2:36 AM
LupinIII has quit

2025-03-18 07706, 2025

2:41 AM
ApeKattQuest joined the channel

2025-03-18 07707, 2025

2:41 AM
ApeKattQuest has quit

2025-03-18 07707, 2025

2:41 AM
ApeKattQuest joined the channel

2025-03-18 07750, 2025

2:59 AM
vardhan_ joined the channel

2025-03-18 07710, 2025

3:30 AM
d4rk-ph0enix has quit

2025-03-18 07735, 2025

3:30 AM
d4rk-ph0enix joined the channel

2025-03-18 07735, 2025

4:03 AM
d4rkie has quit

2025-03-18 07758, 2025

4:03 AM
d4rkie joined the channel

2025-03-18 07717, 2025

4:59 AM
vardhan_ has quit

2025-03-18 07718, 2025

5:37 AM
jasje[m]

<reosarevok[m]> "jasje: let me know if I really..." <- reosarevok: ^^ tqice forwarded email does not contain the users email :(

2025-03-18 07739, 2025

5:44 AM
pite has quit

2025-03-18 07740, 2025

6:03 AM
function1_ has quit

2025-03-18 07727, 2025

6:05 AM
function1 joined the channel

2025-03-18 07715, 2025

6:08 AM
KrishnaCosmic joined the channel

2025-03-18 07724, 2025

6:08 AM
KrishnaCosmic has quit

2025-03-18 07726, 2025

7:04 AM
SigHunter has quit

2025-03-18 07719, 2025

7:07 AM
SigHunter joined the channel

2025-03-18 07735, 2025

7:10 AM
MyNetAz has quit

2025-03-18 07730, 2025

7:21 AM
MyNetAz joined the channel

2025-03-18 07759, 2025

7:21 AM
d4rk-ph0enix has quit

2025-03-18 07725, 2025

7:22 AM
d4rk-ph0enix joined the channel

2025-03-18 07734, 2025

8:07 AM
nobiz has quit

2025-03-18 07748, 2025

8:22 AM
d4rkie has quit

2025-03-18 07707, 2025

8:23 AM
d4rkie joined the channel

2025-03-18 07735, 2025

8:25 AM
vardhan_ joined the channel

2025-03-18 07740, 2025

8:32 AM
vardhan_ has quit

2025-03-18 07739, 2025

8:34 AM
nobiz joined the channel

2025-03-18 07714, 2025

8:55 AM
d4rk-ph0enix has quit

2025-03-18 07740, 2025

8:55 AM
d4rk-ph0enix joined the channel

2025-03-18 07721, 2025

9:30 AM
monu8[m] has quit

2025-03-18 07701, 2025

9:41 AM
BrainzGit

[musicbrainz-server] 14reosarevok merged pull request #3495 (03master…mbs-12826): MBS-12826: Speed up release series reordering https://github.com/metabrainz/musicbrainz-server/…

2025-03-18 07722, 2025

9:48 AM
d4rkie has quit

2025-03-18 07747, 2025

9:48 AM
d4rkie joined the channel

2025-03-18 07729, 2025

10:20 AM
BrainzGit

[listenbrainz-android] 14hemang-mishra opened pull request #556 (03feat-playlists…PlaylistScreen): MOBILE-214 Playlist-Detail-Screen & Add-Track-To-Playlist-Sheet https://github.com/metabrainz/listenbrainz-androi…

2025-03-18 07732, 2025

10:23 AM
reosarevok[m]

Updating beta

2025-03-18 07734, 2025

10:27 AM
HemangMishra[m] joined the channel

2025-03-18 07734, 2025

10:27 AM
HemangMishra[m]

jasje: I have made the final PR of the playlist feature. Let me know if you have any changes are required. Then we can merge it in the main branch.

2025-03-18 07704, 2025

10:29 AM
jasje[m]

HemangMishra[m]: Need time to test

2025-03-18 07709, 2025

10:30 AM
HemangMishra[m] uploaded an image: (736KiB) < https://matrix.chatbrainz.org/_matrix/media/v3/download/matrix.org/jhCzeuhZjPransvQZQvwedGO/image.png >

2025-03-18 07715, 2025

10:31 AM
HemangMishra[m]

HemangMishra[m]: Also, should I change the gradient or rounded corners of the description section. Like Artists page and others have rounded corners but Profile>listens doesn't, so I'm not sure which one to follow.

2025-03-18 07707, 2025

10:42 AM
reosarevok[m]

Beta updated one

2025-03-18 07716, 2025

10:42 AM
reosarevok[m]

* Beta update done

2025-03-18 07736, 2025

11:03 AM
rustynova[m]

Hi! I plan to make some contributions to Melba again now that the project is a more stable state (and not bound by the rules of GoSC).

2025-03-18 07736, 2025

11:03 AM
rustynova[m]

But I do wonder, is the project on hiatus? Or just need contributors? There's a pending PR by yellowhatpro that hasn't been touched in 4 months...

2025-03-18 07736, 2025

11:03 AM
rustynova[m]

For now I'd just make some refactoring changes to clean things up a bit (and finally rebase my own PR).

2025-03-18 07755, 2025

11:16 AM
yellowhatpro[m] joined the channel

2025-03-18 07755, 2025

11:16 AM
yellowhatpro[m]

Hi, rustynova. Yeah you can work on the project. I have been busy with some other projects outside of MetaBrainz that I am not able to work on melba.

2025-03-18 07736, 2025

11:28 AM
vardhan_ joined the channel

2025-03-18 07712, 2025

11:46 AM
rustynova[m]

Well that will be a side project for me too. Something to change my mind off other projects. Will try at least a commit per week though

2025-03-18 07756, 2025

12:53 PM
lucifer[m]

mayhem: hi! i am doing MLHD+ chunk by chunk processing and i think we might be able to do it on the current cluster. it processes one chunk in an hour and 16 chunks in total so assuming that takes a day. however each chunk produces an output of 400G to store. and that's without replication. with replication its 3x. so the current bottleneck is disk space.

2025-03-18 07743, 2025

12:54 PM
mayhem[m]

ok, let me see what our options are.

2025-03-18 07750, 2025

12:54 PM
lucifer[m]

zas: mayhem: i see hetzner has 5TB storage boxes for $13/month. so in the worst case we keep this around for a month, its 65usd still cheaper than the vms.

2025-03-18 07759, 2025

12:54 PM
mayhem[m]

(but, squeee, that is exciting!)

2025-03-18 07721, 2025

12:55 PM
mayhem[m]

the storage boxes run via SAMBA and end up being slow and unreliable.

2025-03-18 07729, 2025

12:55 PM
lucifer[m]

i am guessing its HDD not SSD so slower but we have 12 hours of spark free time daily so i could 3-4 chunks a days.

2025-03-18 07732, 2025

12:55 PM
lucifer[m]

i see

2025-03-18 07735, 2025

12:55 PM
mayhem[m]

its for storing things, not for working things.

2025-03-18 07716, 2025

12:56 PM
mayhem[m]

I tried to run navidrome on top of a storage box and it lasted a week before it shat itself.

2025-03-18 07746, 2025

12:56 PM
lucifer[m]

i can disable replication cluster wide or disable it to 2 for the duration of mlhd processing and but even then i expect at least 10 TB of disk space requirement.

2025-03-18 07756, 2025

12:56 PM
lucifer[m]

s/disable/reduce/

2025-03-18 07729, 2025

12:57 PM
mayhem[m]

we could have disks put into those machines, but it would be a real pain.

2025-03-18 07747, 2025

12:57 PM
mayhem[m]

given all this, I still think we should use new VMs to get this done.

2025-03-18 07756, 2025

12:57 PM
lucifer[m]

i see.

2025-03-18 07708, 2025

12:58 PM
mayhem[m]

and we can attach storage volumes (fast and suitable for working) to the nodes to reach the DB levels we need.

2025-03-18 07720, 2025

12:58 PM
lucifer[m]

is it possible to have multiple disks at a single mount point?

2025-03-18 07740, 2025

12:58 PM
mayhem[m]

no, unix doesn't allow that.

2025-03-18 07746, 2025

12:58 PM
mayhem[m]

why do you need that?

2025-03-18 07708, 2025

12:59 PM
lucifer[m]

actually nvm, i found the way to specify multiple directories/mounts to hdfs

2025-03-18 07715, 2025

12:59 PM
mayhem[m]

if you wanted that you would have to use something like LVM and make a virtual drive.

2025-03-18 07721, 2025

12:59 PM
mayhem[m]

phew.

2025-03-18 07743, 2025

13:02 PM
lucifer[m]

zas: would it possible for the new VMs to connect to existing spark servers?

2025-03-18 07723, 2025

13:04 PM
lucifer[m]

i need a smaller server for running the leader, but i guess we could get a smaller vm for that and keep it all isolated from the current cluster.

2025-03-18 07724, 2025

13:07 PM
lucifer[m]

mayhem: so i am thinking CCX53 * 3 + CCX53 * 1 + 10 TB storage volume on each.

2025-03-18 07735, 2025

13:07 PM
lucifer[m]

https://www.hetzner.com/cloud/

2025-03-18 07745, 2025

13:07 PM
zas[m]

you can use vSwitch for that, https://docs.hetzner.com/cloud/networks/connect-d…

2025-03-18 07721, 2025

13:08 PM
zas[m]

Sorry, but I don't have much time today for this, still fighting back against excess traffic on MB

2025-03-18 07738, 2025

13:08 PM
lucifer[m]

sure we can arrange it later.

2025-03-18 07701, 2025

13:10 PM
MyNetAz has quit

2025-03-18 07700, 2025

13:11 PM
mayhem[m]

I can create the servers, no problem. any objections zas

2025-03-18 07701, 2025

13:11 PM
mayhem[m]

?

2025-03-18 07713, 2025

13:11 PM
zas[m]

none :)

2025-03-18 07727, 2025

13:11 PM
mayhem[m]

ok, hang on lucifer, VMs coming.

2025-03-18 07709, 2025

13:12 PM
lucifer[m]

mayhem: we'll need to connect all the VMs in a vlan or make them accesible to each other somehow.

2025-03-18 07738, 2025

13:12 PM
mayhem[m]

yes, I believe running the server setup should take care of all of that.

2025-03-18 07758, 2025

13:12 PM
lucifer[m]

okay cool

2025-03-18 07755, 2025

13:13 PM
mayhem[m]

but I dont actually know how to run that. so maybe we should wait for zas. unless he can tell me how.

2025-03-18 07725, 2025

13:14 PM
lucifer[m]

yeah i think makes sense to wait until he's free.

2025-03-18 07740, 2025

13:14 PM
mayhem[m]

ok, let us know when you free up zas

2025-03-18 07747, 2025

13:21 PM
zas[m]

First you need to add new VMs to Ansible, since they are Hetzner VMs you can copy https://github.com/metabrainz/metabrainz-ansible/… (a VM) but you need to change values to match the network settings for this VM. This config doesn't add any virtual network. For that you need to follow https://docs.hetzner.com/cloud/networks/connect-d… , we didn't do it successfully yet, and it will require tricky

2025-03-18 07747, 2025

13:21 PM
zas[m]

firewall settings I guess (we can use Ansible for initial deployment, and just do configs manually later on)

2025-03-18 07725, 2025

13:22 PM
zas[m]

Each VM should be added to https://github.com/metabrainz/metabrainz-ansible/… (in right groups, check where is auth VMs)

2025-03-18 07728, 2025

13:23 PM
zas[m]

Each machine we want to connect should be in spark-vnet (this is on Hetzner Robot)

2025-03-18 07718, 2025

13:24 PM
zas[m]

For example, jermaine has:... (full message at <https://matrix.chatbrainz.org/_matrix/media/v3/download/chatbrainz.org/gzyHOuhbaPAqyggQdBvOxWQL>)

2025-03-18 07730, 2025

13:25 PM
monkey[m]

<aerozol[m]> "I think this was about the..." <- Yeha, I think we have enough stuff to show-and-tell, that would be nice. Not sure we have any feature on the immediate horizon to justify waiting for.

2025-03-18 07730, 2025

13:25 PM
monkey[m]

Let me know if you want some help, and in what form. For example I can collate everything I think should go in there, and let you filter, massage and post?

2025-03-18 07744, 2025

13:25 PM
zas[m]

michael (which is also on this vnet) can ping jermaine over this network:... (full message at <https://matrix.chatbrainz.org/_matrix/media/v3/download/chatbrainz.org/pBUNzZkihUCTrmExDeiggUyv>)

2025-03-18 07702, 2025

13:26 PM
zas[m]

So the idea is to have your VMs in this network too.

2025-03-18 07742, 2025

13:26 PM
zas[m]

This is explained at https://docs.hetzner.com/cloud/networks/connect-d…

2025-03-18 07747, 2025

13:26 PM
mayhem[m]

oy. I think it will be better to wait zas -- I think you'll do it much faster than me trying to learn this all.

2025-03-18 07723, 2025

13:29 PM
zas[m]

Actually it would be good if you add VMs to Ansible (this part is easy), and then learn how to deploy (basically run bootstrap.yml playbook then site.yml playbook). You need ssh to be properly configured (ssh root@yourvm should work for bootstrap, and then ssh youruser@yourvm should work for site)

2025-03-18 07743, 2025

13:30 PM
zas[m]

You risk nothing trying (at worse we rebuild the VM and start over). Properly configuring network is a must (of course, else we lose connection)

2025-03-18 07755, 2025

13:31 PM
zas[m]

for the vswitch part, that's trickier, because we don't have such setup yet, and I expect some changes will be needed regarding network/firewall settings to handle this case in Ansible.

2025-03-18 07756, 2025

13:32 PM
zas[m]

Bootstrap needs root access because users aren't created yet, then site can run on user+sudo

2025-03-18 07738, 2025

13:36 PM
mayhem[m]

well, this sounds more interesting that the thing I am fighting now...

2025-03-18 07745, 2025

13:37 PM
mayhem[m]

<lucifer[m]> "mayhem: so i am thinking CCX53 *..." <- this is actually not clear. one machine with 10TB storage?

2025-03-18 07753, 2025

13:37 PM
lucifer[m]

each.

2025-03-18 07754, 2025

13:37 PM
mayhem[m]

and 3 stock CCX53s?

2025-03-18 07720, 2025

13:38 PM
mayhem[m]

so 4 CCX53 with each 10TB?

2025-03-18 07756, 2025

13:38 PM
lucifer[m]

oh sorry, i made a typo. 3 CCX53's and 1 CCX23. with 10 TB each yes.

2025-03-18 07732, 2025

13:39 PM
lucifer[m]

but i did think of an alternative plan to tackle this dataset too if you want to wait.

2025-03-18 07752, 2025

13:39 PM
mayhem[m]

what is that alternate plan?

2025-03-18 07712, 2025

13:40 PM
lucifer[m]

process each file individually parallely using duckdb or somesuch and then just do the final combination step in spark.

2025-03-18 07747, 2025

13:40 PM
mayhem[m]

sounds like a lot of code for you to write.

2025-03-18 07730, 2025

13:41 PM
mayhem[m]

And I can't get a dedicated CPU VM. we've reached some limit I need to have raised.

2025-03-18 07737, 2025

13:41 PM
lucifer[m]

hard to tell without taking a shot at it, might be a day or less or more.

2025-03-18 07709, 2025

13:42 PM
mayhem[m]

maybe spend an hour or 2 on it and see if that approach is promising?

2025-03-18 07751, 2025

13:42 PM
zas[m]

If those machines are temporary, we may not add them to Ansible at all, and just do a basic config manually

2025-03-18 07707, 2025

13:43 PM
mayhem[m]

yes, all temp. but we'll see if we actually need them

2025-03-18 07719, 2025

13:43 PM
lucifer[m]

i don't anticipate issues to show up until say 10% of the dataset is done but sure let me try.

2025-03-18 07748, 2025

13:43 PM
zas[m]

ok, tell me, I'll be happy to help, but I'm very busy right now at controlling this absurd traffic surge...

2025-03-18 07715, 2025

13:44 PM
mayhem[m]

zas[m]: focus on the surge. lucifer will try an alternate path.

2025-03-18 07730, 2025

14:11 PM
d4rkie has quit

2025-03-18 07755, 2025

14:11 PM
d4rkie joined the channel

2025-03-18 07751, 2025

14:14 PM
d4rkie has quit

2025-03-18 07714, 2025

14:15 PM
d4rkie joined the channel

2025-03-18 07758, 2025

14:54 PM
d4rk-ph0enix has quit

2025-03-18 07725, 2025

14:55 PM
d4rk-ph0enix joined the channel

2025-03-18 07722, 2025

15:12 PM
vardhan_ has quit

2025-03-18 07715, 2025

15:45 PM
leftmostcatUTC-8 has quit

2025-03-18 07721, 2025

16:09 PM
bitmap[m]

reosarevok: yvanzo: lucifer: hello, are we meeting now?

2025-03-18 07735, 2025

16:09 PM
reosarevok[m]

Hi! I thought we were! :)

2025-03-18 07701, 2025

16:10 PM
yvanzo[m]

I thought it was in 1h from now but I’m around already.

2025-03-18 07703, 2025

16:10 PM
monkey[m]

mayhem, lucifer An interesting timezone-related issue I never saw before was reported (LB-1766): `can't compare offset-naive and offset-aware datetimes`. The ticket was assigned to me but I don't know if I'm the ideal candidate. Can I assign the ticket to either of you?

2025-03-18 07705, 2025

16:10 PM
BrainzBot

LB-1766: Internal Server Error (500) when accessing listening history https://tickets.metabrainz.org/browse/LB-1766

2025-03-18 07700, 2025

16:11 PM
reosarevok[m]

IIRC we agreed on 5 PM CET which would be now

2025-03-18 07717, 2025

16:11 PM
bitmap[m]

you may be right, I probably got confused by the dst shift again

2025-03-18 07713, 2025

16:12 PM
reosarevok[m]

Oh no, it was UTC

2025-03-18 07719, 2025

16:12 PM
reosarevok[m]

My bad :)

2025-03-18 07731, 2025

16:12 PM
reosarevok[m]

Then yes, in one hour

2025-03-18 07753, 2025

16:12 PM
reosarevok[m]

Although if lucifer is around we can always start earlier

2025-03-18 07739, 2025

16:30 PM
reosarevok[m]

aerozol, monkey: better? https://test.musicbrainz.org/search/edits?auto_ed…

2025-03-18 07716, 2025

16:31 PM
lucifer[m]

i am around now if we want to start early