in #metabrainz

0:12 AM
aerozol[m] joined the channel
0:12 AM
aerozol[m]

Great meeting updates!! Everyone’s been busy :0 💪
0:26 AM
minimal has quit
2:09 AM
Jigen joined the channel
2:11 AM
LupinIII has quit
2:11 AM
ApeKattQuest has quit
2:16 AM
ApeKattQuest joined the channel
2:43 AM
fettuccinae[m]

<suvid[m]> "i had some general queries..." <-
2:43 AM
https://listenbrainz.readthedocs.io/en/latest/d...
3:11 AM
vardhan_ joined the channel
3:11 AM
vardhan joined the channel
4:53 AM
MyNetAz has quit
5:04 AM
MyNetAz joined the channel
5:34 AM
suvid[m]

<fettuccinae[m]> "https://listenbrainz.readthedocs..." <- Ok so I need to look into the Spotify reader container?
5:46 AM
<suvid[m]> "i had some general queries..." <- > <@suvid:matrix.org> i had some general queries regarding the listens import code... (full message at <https://matrix.chatbrainz.org/_matrix/media/v3/...>)
5:49 AM
pite has quit
7:13 AM
Kladky joined the channel
9:45 AM
reosarevok[m]

zas: https://coverartarchive.org/ seems down, known?
9:48 AM
zas[m]

Nope, weird we didn't get any alert (or I missed it)
9:51 AM
I restarted both instances on selda & aphex, it seems to work now
9:52 AM
bitmap: ^^ not sure what happened though
10:03 AM
I added an alert that triggers on high number of 50x for CAA redirects
10:34 AM
BrainzGit

[musicbrainz-server] 14reosarevok merged pull request #3474 (03master…MBS-1964): MBS-1964: Allow opening "Add selected X for merging" in a new tab https://github.com/metabrainz/musicbrainz-serve...
11:09 AM
HemangMishra[m]

<HemangMishra[m]> "jasje: I had a query about..." <- jasje: The app currently uses multiple activities. Is there any specific reason for it?
11:09 AM
I was wondering if we could follow nested navigation instead.
11:09 AM
jasje[m]

HemangMishra[m]: Hemang Mishra: there is no problem with using multiple activities.
11:15 AM
HemangMishra[m]

<jasje[m]> "Hemang Mishra: there is no..." <- I was thinking of onboarding to use nested navigation.... (full message at <https://matrix.chatbrainz.org/_matrix/media/v3/...>)
12:06 PM
lucifer[m]

[@suvid:matrix.org](https://matrix.to/#/@suvid:matrix.org) https://github.com/metabrainz/listenbrainz-serv...
12:06 PM
The importer service related code in this directory.
12:07 PM
The listenbrainz-spotify-reader container invokes the spotify.py file and runs the importer code
12:09 PM
suvid[m]

ohh
12:09 PM
and it does it using cronjobs?
12:09 PM
lucifer[m]

No
12:09 PM
suvid[m]

like at every interval
12:10 PM
lucifer[m]

It runs keep running continuously at all times
12:10 PM
Doing one pass over all users then the another pass over all users and repeat infinitely
12:10 PM
suvid[m]

ohh
12:14 PM
lucifer so the spotify-reader container just invokes the spotify.py file? or something else as well?
12:14 PM
where can i view what all it calls?
12:14 PM
sorry for such a beginner query 😅
12:16 PM
lucifer[m]

Just the spotify.py file
12:36 PM
mayhem[m]

lucifer: moin! I'm working on the shared memory implementation and I have to say, that will be the way to go. shared memory is great (see postgres).
12:36 PM
however, the nmslib only supports persisting indexes to disk. You need to pass a filename -- passing a stream is not supported
12:37 PM
I'm obviously trying to avoid the hit of writing to disk only to load into ram again.
12:37 PM
A RAM disk would do the trick, but that is a pain to setup.
12:38 PM
there are several suggested ways of patching the open call that returns a memory stream as opposed to a file stream, but this code is very likely actual C code, deep in nmslib.
12:38 PM
short of a ram disk, can you think of any alternatives?
12:38 PM
zas: you might have some insights as well.
12:46 PM
lucifer[m]

mayhem: how about save to a file first and then memory map the file?
12:47 PM
mayhem[m]

I dont ever want the file to go to disk.
12:47 PM
making a ram disk is pretty easy it turns out. that might be the best way.
12:47 PM
and its a speed-up improvement, not critical to have.
12:51 PM
lucifer[m]

cool, are you using pyfilesystem?
12:52 PM
mayhem[m]

sudo mount -o size=10M -t tmpfs none /mnt/tmpfs
12:53 PM
not sure I see the point of pyfilesystem
12:54 PM
lucifer[m]

sounds good.
13:00 PM
zas[m]

So basically you don't want to persist indexes? Isn't that the default? Can you point me at actual code? https://github.com/nmslib/nmslib/blob/2ae537802... seems to indicate one has to call saveIndex for it to be written to disk, and there's also a loadIndex. But I'm not even sure what indexes we talk about.
13:01 PM
mayhem[m]

zas[m]: exactly that -- load and save index to and from disk. I don't ever want to hit the disk, I want a pure ram operation.
13:01 PM
speed is of the essence in this case for the new mapping server.
13:04 PM
zas[m]

But ... createIndex() seems to use RAM, and load/save are meant to persist those, but aren't those load/save calls under your control? I mean if you don't want to use disk it seems to me that's perfectly possible (just don't use save/loadIndex()). But maybe I miss something, I know nothing about this lib nor your use of it.
13:06 PM
Of course the RAM disk solution works (and is easy to set up), but what I don't understand is the "nmslib only supports persisting indexes to disk." part, it seems that bindings say different. You can create an index and don't save it at all.
13:06 PM
mayhem[m]

I am building a system where an index needs to be shared with other processes in shared ram. so I need to get an in-ram index and get it into shared ram.
13:07 PM
if I could persist to a buffer, I could copy that buffer to shared ram and I am done.
13:07 PM
but I can only persist the index to disk. thus the need to for the ram disk.
13:07 PM
otherwise I get the hit of going to disk and right back from it, when I would prefer to avoid that.
13:08 PM
zas[m]

Ah, I get it, actually you want to persist indexes. ;)
13:08 PM
mayhem[m]

I do,yes.
13:08 PM
ram disk it is. venga.
13:08 PM
zas[m]

So the ramdisk looks the best approach to me, it is simple and reliable and doesn't require any change in the app
13:37 PM
MyNetAz has quit
13:48 PM
MyNetAz joined the channel
14:44 PM
reosarevok[m]

Releasing beta
14:54 PM
BrainzGit

[listenbrainz-server] 14Mshahnawaz1 opened pull request #3219 (03master…link-to-source-code): LB-1760 https://github.com/metabrainz/listenbrainz-serv...
14:59 PM
MyNetAz has quit
15:01 PM
reosarevok[m]

Done - releasing a small cherry-pick to production
15:10 PM
MyNetAz joined the channel
15:13 PM
pite joined the channel
15:26 PM
Done
15:59 PM
mayhem[m]

zas: I need to increase the shared memory on wolf. is that managed by ansible?
16:02 PM
zas[m]

you mean using sysctl?
16:02 PM
mayhem[m]

yes
16:02 PM
wolf:~/metabrainz/mapping/fast_fuzzy->cat /proc/sys/kernel/shmmax
16:02 PM
18446744073692774399
16:02 PM
zas[m]

Give me the actual command, I'll see how it fits in Ansible
16:02 PM
mayhem[m]

that is a large number. I am sure wolf doesn't have that much RAM. wtf?
16:03 PM
still trying to work out the command, see the snippet above, which makes no sense.
16:06 PM
zas[m]

I just tested at home and I get similar huge number...
16:07 PM
mayhem[m]

wolf:~/metabrainz/mapping/fast_fuzzy->df -h | grep shm
16:07 PM
tmpfs 63G 0 63G 0% /dev/shm
16:07 PM
it is /dev/shm that I need to increase specifically. as you can see, its all allocated.
16:08 PM
but I am confused by this huge number
16:09 PM
zas[m]

On my local system (24.04)... (full message at <https://matrix.chatbrainz.org/_matrix/media/v3/...>)
16:10 PM
max is 16E (bytes) by default it seems ;)
16:11 PM
So I doubt you need to increase it
16:11 PM
mayhem[m]

wolf:~/metabrainz/mapping/fast_fuzzy->df -h | grep shm
16:11 PM
tmpfs 63G 0 63G 0% /dev/shm
16:11 PM
how can I increase more space of this tmpfs then?
16:12 PM
ah, ok, I see.
16:13 PM
mount -o remount,size=75G /dev/shm
16:16 PM
sigh. that doesn't propagate to docker containers.
16:20 PM
zas[m]

You need to pass --shm-size="value" to the docker run command (see https://docs.docker.com/engine/containers/run/).
16:21 PM
mayhem[m]

yeah, I just worked that out. thanks!
16:33 PM
ok, seems to be working now. phew.
16:34 PM
lucifer: yesterday I got rid of the shards and just made the simplest uwsgi workers. no sharing of data. I got at most 75reqs/s out of that.
16:34 PM
Now with shared memory its 200 reqs/s.
16:35 PM
lucifer[m]

awesome.
16:35 PM
mayhem[m]

and if I pre-build the indexes I suspect that is going to much higher.
16:35 PM
vardhan has quit
16:35 PM
vardhan_ has quit
16:35 PM
this is finally starting to come into focus.
16:35 PM
its amazing that it can do as well as it is, without pre-build indexes.
16:36 PM
tomorrow I'll pre-build indexes and add cache management and then we can see the real performance.
16:36 PM
but I suspect I'll be greenlighted for finishing all the features, since this may actually work ok.
16:38 PM
lucifer[m]

sounds great.
16:39 PM
did you add a validation step yet? to make sure its working correctly.
17:58 PM
bitmap[m]

<zas[m]> "bitmap: ^^ not sure what..." <- I didn't see any alert either. and it looks like the container logs for that time period are already gone... I didn't see anything in the PG logs.
17:58 PM
https://stats.metabrainz.org/d/000000075/alerts... is weird though (compare each host)
18:37 PM
BrainzGit

[listenbrainz-server] 14anshg1214 opened pull request #3220 (03ansh/bootstrap4…ansh/bootstrap4-explore-page): feat: Upgrade Entity, Explore and Modals to Bootstrap 4 https://github.com/metabrainz/listenbrainz-serv...
18:59 PM
[listenbrainz-server] 14ahmvdev opened pull request #3221 (03master…master): LB-1760 https://github.com/metabrainz/listenbrainz-serv...
20:21 PM
[listenbrainz-server] 14ahmvdev closed pull request #3221 (03master…master): LB-1760: Fix the link in the details section to redirect to the correct file. https://github.com/metabrainz/listenbrainz-serv...
20:35 PM
[musicbrainz-server] 14mwiencek closed pull request #3426 (03master…MBS-12170): MBS-12170: always give country for artists in relations https://github.com/metabrainz/musicbrainz-serve...
21:52 PM
Kladky has quit
22:33 PM
glucosesniffer[m joined the channel
22:33 PM
glucosesniffer[m

question: why isn't critiquebrainz a part of GSOC projects?
22:34 PM
It falls behind in terms of UI design but i could literally see it as being an alternative to letterboxd for music
22:40 PM
mayhem[m]

hi ahmad!
22:42 PM
we've kinda deprecated CritiqueBrainz as a separate project -- overall we feel that a lot of these things should be available to ListenBrainz users, so we're planning on migrating or adding features from CB to LB.
22:42 PM
if you wanted to proposed a project that takes the useful bits of CB and adds them to LB, we'd consider it.
22:43 PM
however, we are very hesitant on accepting UI projects from GSoC students. we have pretty solid design guidelines and coordinating between our designer, the mentor and the student against a tight deadline doesn't work all that well.
22:43 PM
So, if you focus on API work, then that should work great. If you want to do UI work, you'll have to be a wizard and really impress us to take a project on.
22:45 PM
glucosesniffer[m

mayhem[m]: well the last line just scared me off, ill stick to existing ideas lmfao
22:45 PM
mayhem[m]

sorry, but best to make things clear early on. :)
22:46 PM
we've done GSoC so many times that we know what works for us so we pick things that tend to have the best outcomes for all of us.
22:47 PM
glucosesniffer[m

mayhem[m]: i thought itd be easy work so i was thinking of proposing that as a project idea, but honestly if you hadn't clarified i wouldve went with it and bitten off more than i could chew, apart from gsoc if you guys decide to incorporate some features in the future from CB to LB i would love to contribute
22:48 PM
julian45[m]

mayhem and zas : question for you two since you would be the primary folks managing things w/i this system in future...... (full message at <https://matrix.chatbrainz.org/_matrix/media/v3/...>)
22:48 PM
* mayhem and zas : question for you two since you would be the primary folks managing things w/i this system in future...... (full message at <https://matrix.chatbrainz.org/_matrix/media/v3/...>)
22:48 PM
mayhem[m]

glucosesniffer[m: https://www.reddit.com/r/funny/comments/16mo0tx...
22:49 PM
glucosesniffer[m: find a non-gui way to contribute then!
22:49 PM
julian45[m]: I think I would prefer the GUI option this time. I use these services once every 6 months and GUIs are much more easily discoverable than having to re-lean a GUI.
22:49 PM
glucosesniffer[m

mayhem[m]: lolol
22:50 PM
mayhem[m]

but if zas strongly prefers a CLI, then that's fine by me.
22:50 PM
glucosesniffer[m

mayhem[m]: will try!
22:54 PM
zas[m]

I'm fine with the simplest one. Not like we will manage a lot of users anyway.
22:58 PM
julian45[m]

good to know! unfortunately neither of the options i presented are particularly simple (a bit of the nature of the beast when it comes to identity providers), but each is relatively easy to reach MVP and do ongoing work in.
23:00 PM
i do have a follow-up question: the CLI-only option can theoretically handle user auth and SSH key distribution for *nix hosts, but the web GUI-first option can't. if the implementation of this feature was deemed strong enough and usable enough to viably replace the current ansible-based user and key mgmt processes, would that skew things either way?
23:01 PM
mayhem[m]

julian45[m]: My nose is not close enough to the grindstone to answer that question. Zas will have a better answer than I.
23:27 PM
zas[m]

Clearly if it is possible to use it instead of Ansible for SSH keys deployment I guess we should opt for the solution that allows that, it would be very convenient and safer. Though how is it easy to set up compared to the current Ansible "solution" (which is far from perfect but rather simpler and reliable)?
23:48 PM
julian45[m]

Unknown for now, hoping to test in near future