I can increase the limit again from 4096, however I'm not sure how to determine the number of clauses being generated by the queries were failing earlier in order to get an idea of what a suitable value might be.
2024-06-20 17245, 2024
d4rk has quit
2024-06-20 17238, 2024
d4rk joined the channel
2024-06-20 17212, 2024
zas[m]
atj: moooin, it seems those limits can be set globally or per collection in solr9, but documentation doesn't anything about "correct" values.What I know is we get more 500s than before. But perhaps those 500s are expected, since they are triggered but certain queries.
2024-06-20 17231, 2024
zas[m]
s/but/by/
2024-06-20 17200, 2024
atj[m]
I don't think there is a "correct" value because it depends on your schema and various query parameters
2024-06-20 17234, 2024
zas[m]
I don't think those queries come from Picard (to know better we need to find matching request User Agent)
2024-06-20 17258, 2024
atj[m]
the problem we have is that each word in the query is expanded to query every (?) field in the entity
2024-06-20 17212, 2024
zas[m]
yes, I mean correct values for a certain domain, but they don't provide much guidelines
2024-06-20 17234, 2024
atj[m]
so "the~" is expanded to (artistname:the~2)^1.2 | comment:the~2 | barcode:the~2 | (releaseaccent:the~2)^2.0 | (release:the~2)^1.5 | label:the~2 | ngram:the~2 | (alias:the~2)^1.2 | (creditname:the~2)^1.2 | tag:the~2)
2024-06-20 17205, 2024
atj[m]
(for release)
2024-06-20 17218, 2024
atj[m]
i don't really understand how all this works though to be honest
2024-06-20 17236, 2024
atj[m]
feels like you need a PhD in search
2024-06-20 17254, 2024
atj[m] uploaded an image: (69KiB) < https://matrix.chatbrainz.org/_matrix/media/v3/download/chatbrainz.org/lGWgyHRUcwvehDJJkNsmdlct/image.png >
2024-06-20 17255, 2024
atj[m]
this is quite nice to see, huge improvement in latency, both faster and more consistent
2024-06-20 17251, 2024
atj[m]
<zas[m]> "yes, I mean correct values for a..." <- i think i will try doubling it again to 8192 and see if that is enough for the remaining outliers
2024-06-20 17203, 2024
atj[m]
we haven't seen any performance degradation from increasing it to 4096
2024-06-20 17251, 2024
outsidecontext[m joined the channel
2024-06-20 17251, 2024
outsidecontext[m
lucifer, monkey: rdswift noted that we probably should adapt the IRC link in the Picard website footer now that we have moved to matrix. The same would also apply to LB and CB. Had there already been some talk about this? We could replace the IRC link with a link to the matrix room. Or we keep both.
2024-06-20 17211, 2024
zas[m]
atj: yes; performance is much better (more nodes, more powerful), but what I see is that scaling is easier now (thanks to the new structure and Ansible).
2024-06-20 17215, 2024
zas[m]
old cluster nodes will need to be removed ASAP, likely next week (if everything is ok with new cluster).
2024-06-20 17212, 2024
zerodogg
Hi, my weekly playlists from listenbrainz have been stuck since march. I've tried reconnecting spotify several times, without any luck. Anyone I can contact to have look at it?
2024-06-20 17251, 2024
mayhem[m]
hi zerodogg (IRC) we're working on the computing cluster that computes all of this and should have this resolved soon, sorry.
2024-06-20 17240, 2024
lucifer
uhh weird, should have been processed on tuesday itself. checking.
2024-06-20 17207, 2024
zerodogg
mayhem[m]: aha, thanks! :)
2024-06-20 17202, 2024
lucifer
outsidecontext[m: no discussions afaik but i guess we could point to the communications wiki page which lists both matrix and irc
2024-06-20 17224, 2024
Rotab has quit
2024-06-20 17238, 2024
Rotab joined the channel
2024-06-20 17245, 2024
Jade[m] has quit
2024-06-20 17254, 2024
atj[m]
<zas[m]> "atj: yes; performance is much..." <- I'm a bit disappointed that the nodes are reading from the disk so much. Wish the VMs were available with 48GB of RAM
2024-06-20 17240, 2024
atj[m]
I may look at reducing the replication factor to try and reduce the size of each nodes indexes.
2024-06-20 17220, 2024
yvanzo[m]
The fact that the indexer can be prioritized by HAProxy is big improvement given last week-end’s incident.
2024-06-20 17248, 2024
atj[m]
yvanzo[m]: We'll have to wait and see how much difference that makes to be honest!
2024-06-20 17256, 2024
atj[m]
But it's something
2024-06-20 17210, 2024
yvanzo[m]
<zas[m]> "old cluster nodes will need to..." <- We still need it for dumps at the moment.
2024-06-20 17211, 2024
yvanzo[m]
<outsidecontext[m> "lucifer, monkey: rdswift noted..." <- MB website footer has been updated in beta already.
2024-06-20 17218, 2024
rimskii[m]
<lucifer> "rimskii: hmm i see, okay try..." <- once I run this command I should not kill the terminal right?
2024-06-20 17246, 2024
rimskii[m]
I ran this command yesterday (took ~4 hours), and I wanted to test it today
2024-06-20 17256, 2024
rimskii[m]
but seems like I have to create the tables again?
2024-06-20 17256, 2024
mayhem[m]
I see data, rimskii . are you getting an error? if so, what is it?
2024-06-20 17258, 2024
lucifer
rimskii[m]: i see the tables exist on wolf, so no need to create tables again. are you getting any error?
2024-06-20 17212, 2024
mayhem[m]
lololol
2024-06-20 17222, 2024
lucifer
XD
2024-06-20 17222, 2024
rimskii[m]
lol
2024-06-20 17233, 2024
rimskii[m]
i mean its not seeing the table "relation "mapping.canonical_recording_redirect" does not exist"
2024-06-20 17237, 2024
rimskii[m]
but its happening now
2024-06-20 17244, 2024
rimskii[m]
when I run yesterday, it hadn't that issue, but was running infinitely and not outputing anything
2024-06-20 17206, 2024
mayhem[m]
good, that table has 6677454 rows in it. :)
2024-06-20 17225, 2024
rimskii[m]
I am just afraid if I run "docker run --rm -it --network musicbrainz-docker_default metabrainz/mbid-mapping python3 manage.py canonical-data --use-mb-conn" it will start rebuilding the whole db again
2024-06-20 17235, 2024
lucifer
yes, do not run it again.
2024-06-20 17215, 2024
lucifer
just run the ssh command and start up labs_api container and it should all work.
ah, ok, on MB. yes, but the footer is different. Picard, LB and CB all share the same footer layout, hence I asked. When we change that it should be consistent
2024-06-20 17205, 2024
Maxr1998 joined the channel
2024-06-20 17238, 2024
Maxr1998_ has quit
2024-06-20 17250, 2024
monkey[m]
<yvanzo[m]> "just misread L instead of M" <- MistenBrainz
2024-06-20 17237, 2024
zas[m]
atj: what's unclear to me is why some nodes are reading from disk much more than others. I guess it relates to cores/shards. That said, it reads from fast SSDs so that's not much an issue. Though the biggest core is recording with ~75Gb, all others are much smaller, the second next one is release-group with 7.6Gb, this should fit in ram... does solr use all available ram?
2024-06-20 17252, 2024
yvanzo[m]
outsidecontext: Got it but it still is the same link :)
2024-06-20 17205, 2024
outsidecontext[m
yvanzo: Nearly, or not yet. The footers of the other projects currently link directly to the metabrainz room via kiwiirc.com. But as lucifer suggested we probably should change this to go to the doc page instead.
atj: it would be interesting to increase the memory for one solr instance and see if it makes any difference regarding disk I/O. Current value is 8g, I would suggest to set it to 12g on one node, and observe changes.
rimskii[m]: i will need to create some more tables manually on wolf by dumping data from production. i will let you know in a few hours.
2024-06-20 17239, 2024
rimskii[m]
ahhh i see
2024-06-20 17248, 2024
rimskii[m]
okay thank you !
2024-06-20 17257, 2024
rimskii[m]
i will work on other thing then
2024-06-20 17243, 2024
rimskii[m]
<lucifer> "rimskii: i will need to create..." <- wait can’t I just import the sql data dump you sent me?
2024-06-20 17200, 2024
lucifer
rimskii[m]: that is for apple music, not spotify.
2024-06-20 17223, 2024
d4rk-ph0enix joined the channel
2024-06-20 17224, 2024
d4rk has quit
2024-06-20 17231, 2024
zas[m]
yvanzo: what needs to be done to move dumps to new cluster?
2024-06-20 17209, 2024
d4rk-ph0enix has quit
2024-06-20 17242, 2024
d4rk joined the channel
2024-06-20 17228, 2024
Jade[m] joined the channel
2024-06-20 17228, 2024
Jade[m]
bitmap: I just realised I misread the timings and this *GSoC Session: ‘Contributor Evaluations with GSoC Admins’* thing is at the same time as our weekly catch up
2024-06-20 17238, 2024
Jade[m]
Are you OK with having our meeting an hour later, or should I just record the event to watch later?
2024-06-20 17254, 2024
bitmap[m]
Jade: no problem, I'm fine with meeting later :) just ping me when you're around
[musicbrainz-server] 14reosarevok opened pull request #3293 (03master…MBS-13630): MBS-13630: Prioritize "Voting is closed" as no vote rights reason in AE elections https://github.com/metabrainz/musicbrainz-server/…
2024-06-20 17204, 2024
minimal joined the channel
2024-06-20 17232, 2024
d4rk has quit
2024-06-20 17257, 2024
d4rk joined the channel
2024-06-20 17206, 2024
reosarevok[m]
yvanzo: https://github.com/metabrainz/musicbrainz-server/… is supposed to be ready now after bitmap made some improvements. Do you have any time to review / test? :) I'll test it further too but I expect we should all review this before we consider merging it
2024-06-20 17227, 2024
reosarevok[m]
(probably should release the EAA beta first and merge this to the next beta anyway, but that's a second matter)
2024-06-20 17212, 2024
yvanzo[m]
Hi reosarevok: Nice to see it updated, I will review it in priority among MBS PRs, hopefully this week.
2024-06-20 17254, 2024
yvanzo[m]
zas: (1) Adapt the MBS container to pack the new Solr backup format. (2) Move mirrors from Solr standalone to SolrCloud (and still allow them to either build collections or load backups).
2024-06-20 17211, 2024
yvanzo[m]
Cluster-side, atj already deployed NFS and I tested that.
2024-06-20 17218, 2024
yvanzo[m]
We might need to allow some connection from the MBS container to the SolrCloud cluster.
2024-06-20 17237, 2024
yvanzo[m]
That should be all for our infrastructure. (2) is mirrors only.
2024-06-20 17245, 2024
bitmap[m]
I copied the EAA types to https://wiki.musicbrainz.org/Event_Art/Types yesterday BTW. But I couldn't come up with an introductory description (besides "the event art type describes the type of event art" 😅)
2024-06-20 17220, 2024
yvanzo[m]
bitmap: I cannot find the EAA types in the translation source strings.
2024-06-20 17256, 2024
bitmap[m]
You're right, it's missing from extract_pot_db. I'll add it
2024-06-20 17206, 2024
yvanzo[m]
reosarevok: I have suggestions for the EAA type descriptions, how should I proceed, making separate revisions of the wiki page above?
2024-06-20 17255, 2024
reosarevok[m]
The descriptions are stored in the DB
2024-06-20 17214, 2024
reosarevok[m]
So it seems easiest to just discuss them here and then make the change on both places as needed
2024-06-20 17240, 2024
reosarevok[m]
I hope they don't make them longer because IIRC aerozol was already a bit annoyed with the length of some of them :D
reosarevok: OK, for _Poster_: I suggest replacing “A poster” with “Usually vertical image” to go along with the description of _Banner_.
2024-06-20 17242, 2024
reosarevok[m]
aerozol: ^ opinion? :) I'm okay with it as long as you are
2024-06-20 17231, 2024
yvanzo[m]
Should we rather discuss these in the MB channel?
2024-06-20 17226, 2024
reosarevok[m]
Are there a lot of suggestions? If not, we already started here :) But if there are we can move
2024-06-20 17240, 2024
yvanzo[m]
reosarevok: It is 3:30 AM for our favorite kiwi.
2024-06-20 17255, 2024
reosarevok[m]
Yeah, I know, I was expecting to wait for feedback
2024-06-20 17254, 2024
reosarevok[m]
"A poster" does seem a bit silly, just dunno if "Usually vertical image" helps with poster as much as horizontal helps with banner, since poster is probably more obvious than banner? But we can have it, unless there's a lot of non-vertical posters :D
2024-06-20 17252, 2024
d4rk has quit
2024-06-20 17217, 2024
d4rk joined the channel
2024-06-20 17250, 2024
BrainzGit
[musicbrainz-server] 14reosarevok merged pull request #3293 (03master…MBS-13630): MBS-13630: Prioritize "Voting is closed" as no vote rights reason in AE elections https://github.com/metabrainz/musicbrainz-server/…
2024-06-20 17257, 2024
yvanzo[m]
Describing foo with A foo just doesn’t help.
2024-06-20 17235, 2024
atj[m]
<yvanzo[m]> "That should be all for our..." <- I didn't set up `rrsync` yet, is that a requirement?
2024-06-20 17245, 2024
d4rk has quit
2024-06-20 17210, 2024
d4rk joined the channel
2024-06-20 17214, 2024
yvanzo[m]
atj: Good catch! Only SSH access will be required.
2024-06-20 17230, 2024
atj[m]
<zas[m]> "atj: what's unclear to me is why..." <- The documentation indicates that Solr uses MMAP to read the Lucene indexes so that they can be stored in the Linux page cache.
2024-06-20 17241, 2024
reosarevok[m]
yvanzo: I agree with that, it's just there to make it into a full sentence 😅 But your wording might be better here