in #metabrainz

15:12 PM
zas

restarted solr 4
15:13 PM
I stopped 5, I'll restart it once 4 recovered (if it does)
15:16 PM
rimskii[m] joined the channel
15:16 PM
rimskii[m]

<lucifer> "rimskii: hi! let me know when..." <- lucifer (IRC): hi! srry, i saw your message just now, was on flight.
15:16 PM
is it okay if I will ask the questions here? :>
15:17 PM
mayhem

yes
15:17 PM
rimskii[m]: just ask questions; we may not answer immediately, but we do eventually. :)
15:18 PM
zas

I stopped sir-prod for now
15:18 PM
restarting solr 5
15:20 PM
lucifer: the main question is why solr 5 eventually start to read from disk very heavily (like 800MB/s vs 50MB/s for others), when it does, the cluster crashes at some point, I think that's because solr5 gets very slow (load > 100) and it somehow cascades
15:20 PM
BrainzGit

[listenbrainz-server] 14MonkeyDo merged pull request #2883 (03brainzplayer-spa…refactor-recommendations-page): Refactor RecommendationsPage to Functional Component https://github.com/metabrainz/listenbrainz-serv...
15:20 PM
zas

lucifer: see https://stats.metabrainz.org/d/T4MODrIiz/solr-c...
15:20 PM
lucifer

rimskii[m]: yes
15:21 PM
zas

https://usercontent.irccloud-cdn.com/file/Ahso6...
15:22 PM
mayhem

wow, that graph. scary
15:22 PM
lucifer

i see
15:23 PM
rimskii[m]

lucifer (IRC): I'm setting up my stuff now. wanted to ask what about the apple music and soundcloud apis? I have an account to use soundcloud APIs, but not for the musickit one( also its paid. do we have a MB acc for it or should I buy it?
15:24 PM
lucifer

rimskii[m]: we have an apple music developer token you should be able to use methinks.
15:25 PM
rimskii[m]

cool !
15:25 PM
zas

can it be an underlying hardware issue? I mean that's really weird it always happen on solr5 (and sometimes 4) but never on others
15:26 PM
rimskii[m]

I think i will start from importing functions first (for Spotify, Soundcloud and apple music), then go to exports
15:26 PM
zas

I could snapshot the vm, destroy it and recreate it while preserving IPs (I think)
15:27 PM
but first let's see if lucifer can get something from logs
15:28 PM
bumping the rate limit to 60 (from 20/s)
15:29 PM
mayhem

monkey[m]: lucifer : do we know why the feed doesn't load?
15:29 PM
https://usercontent.irccloud-cdn.com/file/aszoY...
15:30 PM
lucifer

it loads for me
15:31 PM
rimskii[m]

lucifer (IRC):... (full message at <https://matrix.moviebrainz.org/_matrix/media/v3...>)
15:31 PM
lucifer

zas: looks like memory issues. every 1s, GC activity is causing 0.3s stalls
15:31 PM
zerodogg has quit
15:31 PM
mayhem

aerozol had the same problem.
15:31 PM
lucifer

rimskii[m]: i see, makes sense. i can help with it in a while.
15:32 PM
rimskii[m]

okay, thanks!
15:33 PM
lucifer

when that happens solr starts reading a lot of data from disks.
15:33 PM
rimskii[m]

also if I want to PR should I first open a ticket for it? if the PR is related to GSoC. I would like to commits PR little by little
15:37 PM
mayhem

rimskii[m]: we open tickets sometimes, but in a lot of cases they are not needed for ordinary PRs. I suspect that lucifer wants you to open a ticket, he'll ask you to do that.
15:37 PM
.. *if* lucifer wants ....
15:42 PM
zas

lucifer: by memory issues do you mean lack of RAM? or something else?
15:47 PM
lucifer

zas: yes. RAM and JVM heap.
15:48 PM
but still not sure why only on two particular nodes.
15:48 PM
zas

perhaps they are slower than others (shared cpu) and it leads to the issue more quickly
15:48 PM
or an underlying hardware issue leads to that
15:48 PM
lucifer

possible yeah
15:49 PM
zas

I think I'll rescale this server: it wil rule out an underlying hardware issue and give it more ram. Extra cost until we moved to new cluster though.
15:49 PM
mayhem

fine.whatever.
15:49 PM
zas

let's try
15:49 PM
I'll stop solr5
15:52 PM
pranav[m] joined the channel
15:52 PM
pranav[m]

akshaaatt (IRC): if ur free post today’s dev meet id like to have a small brief meet re how to proceed with my GSoC project if that’s fine with u..
15:53 PM
akshaaatt

Sure, pranav[m] !
16:01 PM
lucifer

mayhem: i'll try to debug. sentry has been down for a few days so can't check ther.
16:01 PM
zas

solr5 has twice more cores now, but I couldn't get more RAM. Though underlying hardware changed at least. I'll slowly set the rate limit back to normal
16:02 PM
mayhem

ok, no rush I would say. not sure how many people use that page.
16:03 PM
lucifer

can you try loading it now?
16:03 PM
(no fixes just to see if something is logged in prod)
16:05 PM
pranav[m]

lucifer (IRC): for a project of mine that I have to make for my college, I had a doubt re pagination from backend. I have let’s say 10,000-15,000 entries and am using page number pagination.. what shud be my page size and how is it determined in order for the server to not slow down
16:05 PM
BrainzGit

[bookbrainz-site] 14MonkeyDo merged pull request #1081 (03master…BB-760): Fix BB-760 : When elasticsearch configuration is missing, server crashes https://github.com/metabrainz/bookbrainz-site/p...
16:07 PM
lucifer

pranav[m]: no fixed rule for that, depends on what database you are using, how your data is structured, how often data is retrieved and lots of other factors.
16:08 PM
probably start so that most common cases are covered in 1 api call. if that is too much then decrease the page count.
16:08 PM
but to be honest, most applications never reach that scale that you need to worry about such things.
16:09 PM
pranav[m]

Okay, thanks a lot lucifer_ (IRC) will keep this in mind
16:10 PM
lucifer

rimskii[m]: PRs are fine. tickets are not really required in most cases but if you like to create some structure or etc. feel free to.
16:11 PM
lucifer for the troi part. https://github.com/metabrainz/troi-recommendation-playground/blob/main/troi/tools/spotify_lookup.py
16:11 PM
this is an example of how we export playlists from LB to spotify. you can create a similar function here that imports playlists from spotify to LB.
16:12 PM
most of the code you have in LB server should remain the same just live in this repo instead.
16:13 PM
rimskii[m]

got it, thank you very much !
16:14 PM
zerodogg joined the channel
16:15 PM
for the last:... (full message at <https://matrix.moviebrainz.org/_matrix/media/v3...>)
16:17 PM
twodoorcoupe joined the channel
16:23 PM
zas

Hey twodoorcoupe
16:23 PM
twodoorcoupe

zas: what's up
16:24 PM
zas

a lot on Picard front ;) check latest merged PRs
16:24 PM
and trying to restore our search cluster atm
16:24 PM
btw, solr cluster is back, new solr5 has twice cores, so it handles twice more requests than others, everything is back to normal
16:28 PM
lucifer

rimskii[m]: you can finish moving the spotify import for now. i'll look into soundcloud apis and let you know how we should proceed
16:30 PM
zas

twodoorcoupe: if you have any question about Picard, feel free to ask.
16:34 PM
twodoorcoupe

zas: thank you, I saw you have been doing a ton of refactoring lately
16:34 PM
rimskii[m]

<lucifer> "rimskii: you can finish moving..." <- okay, thanks a lot! ll let you know once I finish or mybe need a help xd
16:35 PM
lucifer

sure sounds good
16:35 PM
zas

Yes, we are heading towards Picard3 and there were a lot of things we couldn't do before without breaking too much stuff
16:36 PM
Picard3 will come with new plugin system and so compatibility will be broken anyway. outsidecontext is also experimenting moving to PySide6.
16:38 PM
twodoorcoupe

zas: I was planning to use extension points for registering processing functions. If that is bound to change I can modify it later on
16:39 PM
zerodogg has quit
16:39 PM
zas

The main change is we'll make plugin accessible extension points much more obvious, and we'll provide a "plugin API" to guarantee access to Picard internals.
16:40 PM
relaxoMob joined the channel
16:41 PM
In the past, maintaining plugins compat was always a bit of a problem, because plugins were accessing the Picard code freely. It will be still possible (that's Python), but if we break something not part of the "official" plugin API it'll be not our problem ;)
16:42 PM
Basically every patch had an underlying question: does it break one plugin? -> look for plugins code, etc... kinda impossible to manage with a growing list of plugins
16:44 PM
For your GSoC project, I think you can work on a PR over Picard code, without bothering plugins side while keeping in mind it will be a plugin at the end.
16:46 PM
The new plugin system should be out (I hope) before end of GSoC. I'll focus on it once the cleanup work will be done.
16:47 PM
twodoorcoupe

Makes sense, thank you. Do you and outsidecontext prefer to see a pr for each feature added, let's say each week, or a larger one every once in a while?
16:47 PM
Great, I had already planned to work on plugins side in the final weeks
16:47 PM
zas

We usually prefer to follow devel steps, so opening a draft PR early is often better: we can review and give you feedback along the way
16:48 PM
outsidecontext

hi
16:48 PM
zas

Hey outsidecontext
16:48 PM
twodoorcoupe

Ok, will do
16:49 PM
outsidecontext

twodoorcoupe: ideal would be multiple PRs, but for parts that make sense individually. doesn't need to be exactly weekly or such.
16:50 PM
zas

yes, subdivide your work in small reviewable bits
16:50 PM
outsidecontext

and you can open PRs already early as soon as you think there is something to discuss. Mark the PR as a draft and we can have an active discussion on anything
16:50 PM
twodoorcoupe

Perfect
16:50 PM
outsidecontext

we currently had multiple of those due to zas' many refactorings :)
16:59 PM
mayhem

<BANG>
17:00 PM
hello everyone. I'm a poor excuse for the usual spaniard running the show!
17:00 PM
lets jump right in with the mailed in reviews.
17:00 PM
aerozol says:
17:00 PM
https://www.irccloud.com/pastebin/7ryAarNJ/
17:00 PM
Tarun_0x0 joined the channel
17:01 PM
huh. kiwis say knock on wood, brits say touch wood. interesting.
17:01 PM
anways. reosarevok, who is currently flying somewhere above the EU, says:
17:01 PM
https://www.irccloud.com/pastebin/rFFNFk4K/
17:02 PM
https://www.irccloud.com/pastebin/2e1Ewx0Q/
17:02 PM
then Ericd says:
17:02 PM
Last week mostly played around with feed generation in Python with my feed reader. Found that Payoneer blocked me for not using Gmail during registration and luckily I fixed it :)
17:02 PM
And finally JadeBlueEyes:
17:02 PM
Nothing much from me from last week! I've been very busy with exams, but I've only got three more to go this week!
17:02 PM
After that, I'll work on setting up a local mock for testing sending emails
17:03 PM
That was it for mailed in reviews, lets jump to reviews for the rest of us.
17:03 PM
people up today: ansh, atj, bitmap, lucifer, mayhem, monkey, reosarevok, yvanzo, zas, akshaaatt, ApeKattQuest, outsidecontext, Tarun_0x0, ericd, JadedBlueEyes, kellnerd, pranav, rimskii, theflash_, twodoorcoupe, yellowhatpro, jasje, Pratha-Fish
17:03 PM
kellnerd: go!
17:03 PM
kellnerd

Can I go later? Not ready yet :D
17:03 PM
yvanzo

Sure
17:03 PM
kellnerd

Go yvanzo!
17:03 PM
yvanzo

Hi all!
17:04 PM
Last week I reviewed PRs and helped bitmap with the Event Art Archive (currently in beta).
17:04 PM
With atj we prepared switching search on beta.mb.o to the SolrCloud 9 instance (hopefully soon).
17:04 PM
theflash__ joined the channel
17:04 PM
With lucifer and mayhem we organized a video meeting with all the GSoC contributors and mentors: https://blog.metabrainz.org/2024/05/27/faces-of... (nice to see you all)
17:05 PM
mayhem

oh regarding the internet archive: https://bsky.app/profile/archive.org/post/3ktia...
17:05 PM
yvanzo

Also worked on SIR testing and Solr 9.6 update, upgraded the wiki server to Noble, plus some support and tickets triage.
17:05 PM
Good luck to the IA!
17:06 PM
Fin, go atj?
17:06 PM
atj

hello
17:06 PM
Last week I continued testing and tweaking the new Solr cluster. I configured ZooKeeper ACLs, wrote some scripts to try and balance leaders across the nodes in the cluster, improved the Ansible role and started working on documentation.
17:07 PM
I also worked with yvanzo to prepare the cluster for use as the search provider on beta.
17:08 PM
and bootstrapped the new Matrix server, and assisted lucifer in setting up Borg backups.
17:08 PM
that's all I can remember at this point! lucifer?
17:08 PM
lucifer

sure sounds good!
17:08 PM
hi all!
17:09 PM
last week, i worked on setting up chatbrainz, participated in the gsoc intro meeting, worked on existing LB PRs to fix dump issues, and speedup imports in spark.
17:10 PM
also worked on exploring setting up spark cluster with ansible and some misc LB bugs.
17:10 PM
that's it for me. mayhem next?
17:10 PM
mayhem

hey o
17:10 PM
Annoying week, dealing with lots of stupid stuff. like Hetzer abuse BS -- more on that during that topic meeting.
17:10 PM
I worked a bit on JSPF cleanup, since we identified a few ways in which we didn't meet the spec.
17:11 PM
Had two conference calls, one with researchers who are investigating the fairness of recommendation engines and they are
17:11 PM
interesting in working on that with us -- which is quite interesting. The other call was the GSoC welcome meeting which is all around lovely.
17:11 PM
This week I've got lots of small stuff to look after and then I can finally dig back into LB Local.
17:11 PM
fin. twodoorcoupe go!
17:12 PM
twodoorcoupe

hello folks!
17:12 PM
last week I finished preparing drafts for my gsoc project
17:13 PM
started working on filtering out cover art images by size