in #metabrainz

15:51 PM
wargreen joined the channel
15:58 PM
tykling has quit
15:58 PM
tykling joined the channel
16:06 PM
BrainzGit

[bookbrainz-data-js] 14MonkeyDo merged pull request #320 (03master…import-annotation): Support annotations for imported entities https://github.com/metabrainz/bookbrainz-data-j...
16:06 PM
[bookbrainz-site] 14MonkeyDo merged pull request #1106 (03import-entities…import-annotation-id): chore: Allow annotation revision ids to be null (rebased) https://github.com/metabrainz/bookbrainz-site/p...
16:14 PM
Kladky has quit
16:14 PM
Kladky joined the channel
17:09 PM
rimskii[m] has quit
17:11 PM
pranav[m] has quit
17:30 PM
discordbrainz has quit
17:31 PM
discordbrainz joined the channel
17:36 PM
twodoorcoupe[m] has quit
17:39 PM
atj[m] has quit
17:40 PM
outsidecontext[m has quit
17:40 PM
outsidecontext[m joined the channel
17:57 PM
BobSwift[m] has quit
18:02 PM
kellnerd[m]

<mayhem[m]> "https://blog.metabrainz.org/2024..." <- It took me a while, but I just realized that the 23rd birthday post is from exactly a year ago and today we are already celebrating the 25th... time flies by 😁
18:02 PM
mayhem[m]

wut?
18:02 PM
math, my nemesis.
18:05 PM
well, no one noticed for a whole year, so 🤷
18:13 PM
theflash[m] has quit
18:20 PM
atj[m] joined the channel
18:20 PM
atj[m]

[@lucifer:chatbrainz.org](https://matrix.to/#/@lucifer:chatbrainz.org): it would be good to have the LB Solr cluster managed using Ansible, considering all the work done for the MB cluster
18:20 PM
Is it a standard SolrCloud installation?
18:21 PM
lucifer[m]

atj: there is not cluster yet, we are still experimenting to see if it performs better than typesense (what we have now). if it does then we will create a proper cluster for production. but yes makes sense
18:22 PM
atj[m]

OK, well let me know how it goes. MB cluster seems to work well on ARM VMs which are cheaper and offers better performance.
18:23 PM
Sophist-UK has quit
18:23 PM
Don't set the Solr heap size too high would be my advice. Ideally you want to fit the indexes in page cache but I don't know how big they will be for LB
18:26 PM
lucifer[m]

makes sense, will keep it in mind.
18:28 PM
atj[m]

Are you storing documents in Solr or just IDs?
18:30 PM
Storing the MB document XML in Solr was a mistake IMV, it's resulted in the indexes being much bigger than they need to be and reduces performance significantly
18:30 PM
But then I don't know the whole history behind it.
18:36 PM
mayhem[m] uploaded an image: (546KiB) < https://matrix.chatbrainz.org/_matrix/media/v3/download/chatbrainz.org/ugbPRzkrNHFCnDzqTPgdgqPa/image.png >
18:36 PM
mayhem[m]

LOLOLOLOL, I feel seen.
18:38 PM
leftmostcatUTC-7 joined the channel
18:38 PM
leftmostcatUTC-7

I feel trapped.
18:39 PM
lucifer[m]

<atj[m]> "Are you storing documents in..." <- JSON docs, only non searchable fields are mbids.
18:39 PM
<atj[m]> "But then I don't know the..." <- indeed, i have discussed with yvanzo in the past to get rid of it.
18:40 PM
i would like to get rid of it too for performance reasons, its slows down writing the response too imo.
18:40 PM
so a perf win in many ways if we can get rid of it.
18:41 PM
should probably restart that discussion now that solr 9 upgrade is done.
18:41 PM
mayhem[m]

<atj[m]> "But then I don't know the..." <- Back in the day we favored not having to do another DB query to fetch the data. we accepted the larger indexes as a tradeoff for less load on our DB server. but we're in a much different place now. but scaling wise, hosting capabilities and money.
18:41 PM
lucifer[m]

would actually simplify a lot of stuff in Sir too.
18:41 PM
mayhem[m]: also it is XML centric.
18:42 PM
XML requests are served directly iirc.
18:42 PM
where as JSON requests need to deserialize the xml and reserialize it to JSON to serve the request
18:42 PM
atj[m]

and it's a shitty Java dependency that nobody in their right mind should want to deal with
18:43 PM
lucifer[m]

yeah fair and solr has built in xml support too iirc so if we want xml we can get that directly anyway.
18:44 PM
atj[m]

Built in XML and JSON AFAIU
18:44 PM
lucifer[m]

yup.
18:50 PM
Jigen

badly organised bookshelf >_<
18:50 PM
https://usercontent.irccloud-cdn.com/file/8X35t...
19:45 PM
Maxr1998_ joined the channel
19:47 PM
Maxr1998 has quit
19:48 PM
I regret the amount of time and effort I spent trying to make a fun image to post in my commnt
19:48 PM
wordpress mangled the link and the transparenchy broke anyway.
19:49 PM
REGRET
19:49 PM
:(
19:50 PM
Maxr1998_ has quit
19:53 PM
btw birthday blogpost isn't on twitter? i wantedot retweet it
19:53 PM
Maxr1998 joined the channel
19:57 PM
Maxr1998 has quit
20:13 PM
Sophist-UK joined the channel
20:16 PM
Maxr1998 joined the channel
20:47 PM
anyway https://usercontent.irccloud-cdn.com/file/HHDso...
20:48 PM
ahvalmissaamine

https://usercontent.irccloud-cdn.com/file/h7ceu...
20:53 PM
BrainzGit

[bookbrainz-site] 14kellnerd opened pull request #1107 (03import-entities…import-annotation): Display and preserve annotation of imported entities https://github.com/metabrainz/bookbrainz-site/p...
21:17 PM
[bookbrainz-data-js] 14kellnerd opened pull request #321 (03master…import-annotation): Import annotation https://github.com/metabrainz/bookbrainz-data-j...
21:28 PM
Kladky has quit
21:46 PM
Sophist-UK has quit
21:58 PM
Jade[m]

We have bulk email sending!
21:58 PM
Jade[m] uploaded a video: (37662KiB) < https://matrix.chatbrainz.org/_matrix/media/v3/download/matrix.org/IAFAcXKdShPDdQbqNrSRFtKm/Screen%20Recording%202024-07-17%20225349.mp4 >
21:58 PM
500 emails in 7745ms
21:58 PM
It sends them in parallel, with a configurable concurrency limit
21:59 PM
The performance is probably limited by the SMTP relay atm
21:59 PM
^ that was 16 senders
22:00 PM
with 8 it's 3780 ms, so it was probably overloading the relay
22:01 PM
6 is optmal for my machine, at 3605 ms - or just 7.2ms per email!!
22:03 PM
wait, this is in debug mode too haha
22:08 PM
Jigen

https://usercontent.irccloud-cdn.com/file/c0t5A...
22:31 PM
Jade[m]

Running it in release mode is significantly faster - 500 emails in 801ms, or 1.602ms per email
22:34 PM
Jade[m] uploaded an image: (6KiB) < https://matrix.chatbrainz.org/_matrix/media/v3/download/matrix.org/GYXBmyfKDmAKhJdjeyWOtINQ/image.png >
22:34 PM
It scales too - 5000 emails in 7987 ms. At the cost of pegging all cores for that time lol
22:40 PM
aerozol[m] has quit