#metabrainz

/

      • BobSwift[m]
        throughbirds: I suspect that you might be a bit late to get started with your proposal. See https://wiki.musicbrainz.org/Development/Summer... for more information.
      • HSOWA joined the channel
      • Jigen joined the channel
      • ApeKattQuest has quit
      • LupinIII has quit
      • function1 has quit
      • function1 joined the channel
      • genius3000 has quit
      • genius3000 joined the channel
      • minimal has quit
      • vardhan joined the channel
      • nanobri[m] has quit
      • vardhan_ joined the channel
      • vardhan has quit
      • pite has quit
      • d4rkie has quit
      • d4rkie joined the channel
      • Kladky joined the channel
      • vardhan_ has quit
      • vardhan_ joined the channel
      • lucifer[m]
        throughbirds: i think a very basic implementation of the dashboard that given a room, retrieves the latest messages, stores it in the database and displays it to the user would be good. julian45 might have more thoughts.
      • vardhan_ has quit
      • SigHunter has quit
      • SigHunter joined the channel
      • mayhem[m]
        lucifer: kiss ran out of disk.
      • a zstd compress is running, which will likely fail now.
      • I'll try and remove an old dump
      • site back up now.
      • lucifer[m]
        [@mayhem:chatbrainz.org](https://matrix.to/#/@mayhem:chatbrainz.org) I see. I'll look at it in a bit.
      • derat[m]
        reosarevok bitmap aerozol seeking your opinions on how release editor documentation bubble text should be phrased for https://tickets.metabrainz.org/browse/MBS-5944! some options:... (full message at <https://matrix.chatbrainz.org/_matrix/media/v3/...>)
      • BrainzBot
        MBS-5944: Release editor: add documentation bubbles for extra fields
      • derat[m]
        (the second and third ones are existing strings. i think the second one needs a hyphen in "identically-named" and the third one shouldn't have a comma splice)
      • reosarevok[m]
        Well, the third probably could be two sentences instead. I don't have a clear preference but it would make sense to make them somewhat consistent. I think we might have yet another case where it's just "Blah blah blah blah ({doc_link|documentation})" as well?
      • I'm not sure of the second needs a hyphen or it's more of your love affair with hyphens, but :D
      • Ideally we'd come up with one style we like and try to adapt all to it - I'd love to see what aerozol thinks would be the best option
      • derat[m]
        oh yeah, i guess there's also this one:
      • 4. Review the **list of packaging types** for help.
      • reosarevok[m]
        💖✨️consistency✨️💖
      • derat[m]
        /artist/create uses "Select any type from the list to see its description. If the work doesn’t seem to match any type, just leave this blank." and then shows per-option text. i don't love that approach since it increases the number of places that need to be updated if/when the guidelines change
      • bonus points for the "work" typo when selecting an artist type
      • reosarevok[m]
        The per-option text there is loaded from the database, in any case
      • derat[m]
        ah, interesting
      • reosarevok[m]
        All types are supposed to have definitions, although not all of them do rn
      • And that's supposed to be the most official one - ideally the docs would also use the same strings automatically
      • That said, that only works for a few things, like relationships and types
      • Since most guidelines and the like are just text on a doc page :)
      • derat[m]
        there's a separate issue of displaying a bubble for the artist field. i think that it uses KnockoutArtistCreditEditor, and i have no idea how that interacts with the controlsBubble knockout stuff that's used for the regular inputs in root/release/edit/information.tt
      • (fwiw, apologies for all of my changes adding more knockout junk that'll make it harder to move the release editor to react :-/)
      • reosarevok[m]
        That bit is where bitmap can certainly help better :)
      • BrainzGit
        [musicbrainz-server] 14derat opened pull request #3508 (03master…type_doc): Make type_bubble() refer to "entity" rather than "work" https://github.com/metabrainz/musicbrainz-serve...
      • lucifer[m]
        mayhem: no extra files or anything this time, we need more space to do the full dumps. the uncompressed data is too large even if after zstd compression it takes only 200250 GB.
      • s/200250/200-250/
      • BrainzGit
        [musicbrainz-server] 14reosarevok merged pull request #3508 (03master…type_doc): Make type_bubble() refer to "entity" rather than "work" https://github.com/metabrainz/musicbrainz-serve...
      • mayhem[m]
        how much disk space do we need then?
      • lucifer[m]
        i think 1 TB more would be enough.
      • mayhem[m]
        I think we may need to upgrade servers then. let me check options.
      • another alternative is to add an extra drive to the existing server. it wouldnt need RAID-1 for building dumps, right?
      • lucifer[m]
        mayhem: don't think so.
      • mayhem[m]
        ok.
      • zas: do you know how we can find out what options we have for adding a drive to kiss? we need another TB of non RAID storage.
      • zas[m]
        let me check
      • Not sure, this type of server isn't listed anymore, so we have to contact Sales to know what options we have. But currently it uses 2x1.92TB nvme drives. I guess we can add more, but I fear the price will be high (as most options at Hetzner). What are we targeting to size-wise?
      • We can convert RAID1 to RAID0, if we can afford the risk.
      • mayhem[m]
        1TB
      • and we can afford the risk, since this is for making data dumps. if the drive fails, we run a new dump, no big deal.
      • and another server looks to be more expensive and not needed, so just a single disk for now would be good.
      • zas[m]
        Is a complete system erase possible for this machine? (I can try to do it while keeping data, but ... it may fail)
      • mayhem[m]
        even for just adding one drive?? we want to leave the existing drives alone.
      • I think if we need to prepare for a whole system erase, then we might as well just upgrade. its easier.
      • zas[m]
        ah, I thought you wanted to reclaim 1Tb from existing drives using RAID0. My bad. We can ask for one more drive.
      • mayhem[m]
        great
      • zas[m]
        I sent a request to Hetzner to see which options we have on this type of server.
      • tykling
        they might even be able to do you a usb stick, the biggest in the shop are 64gb but they do have larger ones
      • mayhem[m]
        and we'll be able to create one data dump every 6 months with a USB stick. :)
      • tykling
        :)
      • bitmap[m]
        <derat[m]> "reosarevok bitmap aerozol..." <- I'm not sure strings like "this is the release name" or "please enter the barcode" are very useful, they're just repeating the field name back to the user. so I'd prefer something like the disambiguation one where it's a short summary of what the field is for + a doc link, or alternatively just a bare doc link
      • agreed about the hyphen + comma splice though I don't have a strong opinion about the former :)
      • derat[m]
        bitmap: thanks! if aerozol agrees, i'll go with that style for all of these
      • <derat[m]> "there's a separate issue of..." <- bitmap: do you have any ideas about this? it seems pretty gnarly at first glance, so i might just leave the artist autocomplete bubble-less for now
      • pite joined the channel
      • vardhan_ joined the channel
      • bitmap[m]
        <derat[m]> "bitmap: do you have any ideas..." <- hmm, the simplest option is probably just to use [jquery event delegation](https://learn.jquery.com/events/event-delegation/) (maybe install a handler somewhere in release-editor/init.js) and call `bubble.show()` in there if the AC input is focused
      • although you can't focus that field if a complex AC is entered 🤔
      • derat[m]
        thanks, i'll look into doing that! my main goal with this is just to provide guidance to beginners, so i'm happy enough as long as the bubble gets shown when an initially-empty field is focused
      • lucifer[m]
        mayhem: btw on the similarity data front, i replaced recording_mbids with recording_id's that reduced memory usage 4x and storage space usage 8x.
      • mayhem[m]
        smrt. I do the same in faster_fuzzy.
      • lucifer[m]
        i still need to handle redirects so it might give away some gains but yes lets see.
      • haha cool.
      • i have noticed that in a lot of cases parquet doesn't compresses better than zstd compressed csv's and i think it does have to do a lot with uuids vs numeric ids.
      • because 80% of our data is just that.
      • another thing i noticed is that zstd compresses ~2x better for mlhd if we order by a column.
      • mayhem[m]
        and those are text UUIDs or binary UUIDs?
      • lucifer[m]
        parquet treats them as text i think.
      • BrainzGit
        [musicbrainz-server] 14mwiencek merged pull request #3498 (03master…mbs-12170): MBS-12170: Output containing country code for all artists in web service https://github.com/metabrainz/musicbrainz-serve...
      • lucifer[m]
        or arrow at least. i need to look it up again but last time i checked due to reason it was being treated as text.
      • mayhem[m]
        yeah, I think PG is one of the few tools that does treat them as binary
      • lucifer[m]
        yeah spark doesn't have a uuid type either.
      • i am curious how better the stats/dumps processing for spark can be if we switch to ids instead of uuids and then before sending the stats back to LB, switch id to mbid.
      • the unmapped listens create troubles though so...
      • mayhem[m]
        I would expect it to go as well as the instances you've already seen
      • maybe consider a dense encoding of UUID?
      • lucifer[m]
        hmm could be useful but those aggregating queries also have recording_name etc in group by because mbids are optional for stats anyway.
      • m1gr has quit
      • m1gr joined the channel
      • minimal joined the channel
      • mayhem[m]
        lucifer: signging up for a deezer account was a bad idea. much spam and the unsubscribe page is 404.
      • lucifer[m]
        Rip. I don't like deezer too because I can't even view their developer docs without vpn and even that is aggressively blocked.
      • suvid[m] joined the channel
      • suvid[m]
        <lucifer[m]> "Rip. I don't like deezer too..." <- I think that's cuz deezer is not present in India right?
      • lucifer[m]
        Yes.
      • vardhan_ has quit
      • Sophist-UK joined the channel
      • aerozol[m]
        derat: bitmap reosarevok Agreed with bitmap that I would cut out the parts that are just repeating what the field is. Unless it addresses a common problem - for instance if users often get confused about digital barcodes: “Please enter the barcode from the physical packaging or the digital storefront of the release” (just an example, I imagine reo will have an idea of what fields new users really struggle with/what could benefit
      • from being spelled out)
      • For tutorial/guide stuff I would always use short sentences if possible. It reads less natural, but for those with bad English it is much easier to read. You can google “plain English” for the philosophy behind that. I have also embedded it into our writing styleguide, which everyone has forgotten about I’m sure :P https://github.com/metabrainz/design-system/blo...
      • All the “pleases” are definitely consistent with reosarevok polite style throughout our guides and docs! 🥰
      • DarthGandalf is now known as [LLM]DarthGandal
      • mthax_ joined the channel
      • mthax has quit
      • lucifer[m]
        zas: hi! let me know when you are around tomorrow, i would like a CCX63 VM with a 5 TB storage volume.
      • SigHunter has quit
      • SigHunter joined the channel
      • mayhem[m]
        <lucifer[m]> "zas: hi! let me know when you..." <- Back in my uni days we toured amdahl, a fault tolerant mainframe computer manufacturer. they had a silo with a robot arm and gazillions of tapes and tape readers... it was quite impressive. and it stored 1/4TB. and you just casually asked for a 5TB machine, like nothing. lolololol
      • lucifer[m]
        😆
      • Kladky has quit
      • nelgin
        Is it easy/difficult/imposssible to get the actual cover art into the docker image?
      • kepstinbrainz joined the channel
      • kepstinbrainz
        nelgin (IRC): metabrainz doesn't store the actual cover art, that's all hosted by archive.org - my understanding is that it's far too much data to reasonably put into a docker container.
      • [LLM]DarthGandal is now known as DarthGandalf
      • nelgin
        Ah ok, fair enough.
      • BrainzGit
        [musicbrainz-server] 14derat opened pull request #3509 (03master…bubble_sizing): Decrease excess vertical padding in documentation bubbles. https://github.com/metabrainz/musicbrainz-serve...