#metabrainz

/

      • alastairp
        (because that PR changed a _lot_ of lines)
      • 2021-04-27 11753, 2021

      • _lucifer
        yeah makes, its now just adding a handler in initial conf and be done with it.
      • 2021-04-27 11758, 2021

      • _lucifer
        *makes sense
      • 2021-04-27 11741, 2021

      • leroivi has quit
      • 2021-04-27 11750, 2021

      • _lucifer
        i was thinking to remove pytest deps from reuirements_spark.txt by add another step to build test image in Dockerfile.spark.thoughts?
      • 2021-04-27 11726, 2021

      • alastairp
        yeah, no problem. though can we reuse requirements_dev to install them?
      • 2021-04-27 11712, 2021

      • _lucifer
        no actually we have them in requirements_spark.txt as well.
      • 2021-04-27 11721, 2021

      • _lucifer
        ah yes right we can
      • 2021-04-27 11736, 2021

      • MRiddickW joined the channel
      • 2021-04-27 11727, 2021

      • alastairp
        yeah, great. I just didn't want to hard-code dependencies in Dockerfile, and I didn't want _another_ requirements file
      • 2021-04-27 11756, 2021

      • mckean has quit
      • 2021-04-27 11724, 2021

      • mckean joined the channel
      • 2021-04-27 11726, 2021

      • reosarevok
        yvanzo: around?
      • 2021-04-27 11734, 2021

      • _lucifer
        alastairp: what do you think we should do about pyspark dep? adding to requirements_development.txt seems wrong. if we keep in requirements_spark.txt it'll be for sake of tests only.
      • 2021-04-27 11753, 2021

      • alastairp
        ah, interesting question
      • 2021-04-27 11704, 2021

      • _lucifer
        also, i saw you added `sentry_sdk[pyspark]` instead of just `sentry_sdk` any particular reason?
      • 2021-04-27 11724, 2021

      • alastairp
        I think it makes sense to keep it in requirements_spark.txt, it feels like the better place for it
      • 2021-04-27 11752, 2021

      • alastairp
        mmm, good point. sentry_sdk[pyspark] ensures that pyspark is also installed (it makes it a dependency)
      • 2021-04-27 11729, 2021

      • _lucifer
        i don't think we want that as pyspark should be provided externally by spark
      • 2021-04-27 11737, 2021

      • _lucifer
        and we uninstall it just after anyways
      • 2021-04-27 11753, 2021

      • alastairp
        but if the spark setup provides pyspark without us needing to add it, perhaps just sentry_sdk is OK
      • 2021-04-27 11703, 2021

      • alastairp
        why do we uninstall it?
      • 2021-04-27 11719, 2021

      • alastairp
        I don't know how the spark/pyspark thing works
      • 2021-04-27 11739, 2021

      • alastairp
        how does spark provide pyspark? does it put it in sys.path? is it before or after the rest of our dependencies?
      • 2021-04-27 11739, 2021

      • _lucifer
        because we pack the deps in venv in a zip and send it to all spark workers
      • 2021-04-27 11740, 2021

      • yvanzo
        reosarevok: yup
      • 2021-04-27 11756, 2021

      • reosarevok
        yvanzo: see my comments on the PR :)
      • 2021-04-27 11744, 2021

      • _lucifer
        i am not familiar with how it works under the hood to supply pyspark.
      • 2021-04-27 11731, 2021

      • alastairp
        neither am I
      • 2021-04-27 11747, 2021

      • alastairp
        sorry, I don't have a lot of time today. I'll be at the office tomorrow and would be happy to talk through it with you
      • 2021-04-27 11739, 2021

      • _lucifer
        sure, thanks!
      • 2021-04-27 11744, 2021

      • yvanzo
        reosarevok: thanks, replied!
      • 2021-04-27 11735, 2021

      • sumedh joined the channel
      • 2021-04-27 11737, 2021

      • sumedh has quit
      • 2021-04-27 11709, 2021

      • D4RK has quit
      • 2021-04-27 11705, 2021

      • sumedh joined the channel
      • 2021-04-27 11703, 2021

      • _lucifer
        Mr_Monkey: ping
      • 2021-04-27 11711, 2021

      • Mr_Monkey
        Hai !
      • 2021-04-27 11739, 2021

      • _lucifer
      • 2021-04-27 11745, 2021

      • _lucifer
        I saw you added some mockups here
      • 2021-04-27 11758, 2021

      • _lucifer
        which one should we go ahead with?
      • 2021-04-27 11717, 2021

      • Mr_Monkey
        Which one do you think looks best and clearest?
      • 2021-04-27 11739, 2021

      • Mr_Monkey
        I'm not a fan of option #1, a bit simple.
      • 2021-04-27 11753, 2021

      • _lucifer
        the last one super nice :D
      • 2021-04-27 11714, 2021

      • Mr_Monkey
        Let's go with that then :)
      • 2021-04-27 11729, 2021

      • _lucifer
        second one is also nice
      • 2021-04-27 11754, 2021

      • Mr_Monkey
        We can always revisit the design after an initial implementation
      • 2021-04-27 11735, 2021

      • _lucifer
        sure, do you have this implemented or is it to be done?
      • 2021-04-27 11758, 2021

      • Mr_Monkey
        No, I don't have any code for it I'm afraid
      • 2021-04-27 11728, 2021

      • Mr_Monkey
        That day I got lazy and did the page designs directly in an open browser page…
      • 2021-04-27 11700, 2021

      • _lucifer
        no worries, i'll modify the spotify page for now and try to put the youtube player and oauth up today for testing.
      • 2021-04-27 11757, 2021

      • _lucifer
        ruaok: i think that the redirect uris in syswiki might not work. for my local setup, i had to add the complete url including the path.
      • 2021-04-27 11705, 2021

      • _lucifer
        regarding the youtube api key
      • 2021-04-27 11739, 2021

      • ruaok
        update syswiki, plz
      • 2021-04-27 11727, 2021

      • _lucifer
        we'll have to update in google api console first.
      • 2021-04-27 11746, 2021

      • _lucifer
        the console will give a new configuration file after that.
      • 2021-04-27 11740, 2021

      • ruaok
        ok, logged in. what changes need to be made?
      • 2021-04-27 11722, 2021

      • _lucifer
        http://localhost/profile/music-services/youtube/c… this is what i am using for my local api key
      • 2021-04-27 11759, 2021

      • _lucifer
        if we are fine with the url structure then, adding `/profile/music-services/youtube/callback/` at the end of all urls should be enough
      • 2021-04-27 11718, 2021

      • ruaok
        including trailing slash?
      • 2021-04-27 11727, 2021

      • _lucifer
        yes
      • 2021-04-27 11752, 2021

      • ruaok
      • 2021-04-27 11753, 2021

      • _lucifer
        perfect, thanks!
      • 2021-04-27 11703, 2021

      • _lucifer
        we'll need to add some redirect uris for spotify as well to test on beta. but not needed urgently that can be done tomorrow.
      • 2021-04-27 11710, 2021

      • kyledecot joined the channel
      • 2021-04-27 11744, 2021

      • kyledecot
        Cross-posting this from #musicbrainz
      • 2021-04-27 11744, 2021

      • kyledecot
        Hello everyone! I'm building a website that will allow guitarists to upload Guitar Pro files–as part of this I want to use MusicBrainz to lookup additional metadata about the artist such as their canonical name, aliases, genres, etc.

I've looked over the schema for the DB but I'm still a bit unsure how to traverse it to get the information I
      • 2021-04-27 11745, 2021

      • kyledecot
        want. My thinking was that I would search for the "Work" by name and then get to the Artist from that. Using "Toxicity" as an example however you can see that the relationships do not include "System of a Down" 

https://beta.musicbrainz.org/work/ab2a78d4-b0ce-3…

It looks as though I would have to inspect the recordings
      • 2021-04-27 11745, 2021

      • kyledecot
        to somehow derive the "canonical" recording/artist but I'm unsure if my assumptions are correct or if there is another way to go about this.

Any help in regards to this would be greatly appreciated!
      • 2021-04-27 11709, 2021

      • ruaok
        _lucifer: syswiki updated.
      • 2021-04-27 11720, 2021

      • _lucifer
        thanks!
      • 2021-04-27 11726, 2021

      • ruaok
        reosarevok: can you please help kyledecot?
      • 2021-04-27 11720, 2021

      • _lucifer
        Mr_Monkey: in the mockups I see we do not show import details there. that's expected.
      • 2021-04-27 11737, 2021

      • _lucifer
        s/expected/intended?
      • 2021-04-27 11736, 2021

      • nelgin
        How can I determine if live indexing is turned on with the musicbrainz vm?
      • 2021-04-27 11738, 2021

      • nelgin
        and working
      • 2021-04-27 11749, 2021

      • _lucifer
        if you followed https://github.com/metabrainz/musicbrainz-docker#… live-indexing should probably be working, otherwise there
      • 2021-04-27 11752, 2021

      • _lucifer
        is a bug
      • 2021-04-27 11740, 2021

      • ruaok
        I got the MBID matcher running live against incoming listens:
      • 2021-04-27 11743, 2021

      • ruaok
        listens: exact 1785 high 20 med 154 low 55 no 873 err 0
      • 2021-04-27 11748, 2021

      • ruaok
        68% match rate. :)
      • 2021-04-27 11700, 2021

      • ruaok
        no trouble keeping up with incoming listens.
      • 2021-04-27 11702, 2021

      • reosarevok
        kyledecot: hi! Give me a moment if that's ok? I'm making dinner :)
      • 2021-04-27 11714, 2021

      • ruaok
        on thursday I'll add support for working through old listens.
      • 2021-04-27 11738, 2021

      • nelgin
        Well, my system spent from 2:17am to 4:06pm UTC loading the indexes and it's still not finding any results.
      • 2021-04-27 11755, 2021

      • reosarevok
        kyledecot: oh, I see you're getting help in #musicbrainz, will check for a mo
      • 2021-04-27 11701, 2021

      • reosarevok
        yvanzo: can you help nelgin ?
      • 2021-04-27 11733, 2021

      • reosarevok
        kyledecot: oh, no, I'm being dumb. Sorry. Ok, I'll quickly look, food is in the oven anyway
      • 2021-04-27 11734, 2021

      • reosarevok
        So
      • 2021-04-27 11758, 2021

      • reosarevok
        Yes, you can't trivially get from the work to a "canonical artist", because we don't really have those
      • 2021-04-27 11733, 2021

      • reosarevok
        (for example, many works don't really have those, such as classical, folk and even some jazz music)
      • 2021-04-27 11719, 2021

      • reosarevok
        For what you want, the most common case is going to be "find a recording, and pick the artist". But do you have any artist name to begin with?
      • 2021-04-27 11744, 2021

      • reosarevok
        Because if all you have is, say, the word "Dumb", there's no good way to know if you should be picking Nirvana or Garbage (or someone else)
      • 2021-04-27 11752, 2021

      • kyledecot
        reosarevok Yeah I have the artist name to begin with (it's encoded into the Guitar Pro file). I thought I should start at the "work" to also get the canonical title (in case they submit something like "toXiciTY" or something.
      • 2021-04-27 11719, 2021

      • reosarevok
        Well, the problem is sometimes we won't even have works, especially for popular music
      • 2021-04-27 11743, 2021

      • reosarevok
        Those are not added automatically, they're added by users mostly when they need to either link covers or add a composer / lyricist
      • 2021-04-27 11720, 2021

      • reosarevok
      • 2021-04-27 11726, 2021

      • kyledecot
        Oh I see–so would it be safer to just try and look up the artist directly and then attempt to get the work separately (basically split this into two problems)?
      • 2021-04-27 11751, 2021

      • reosarevok
        I would suggest trying to find a recording using title + artist
      • 2021-04-27 11707, 2021

      • reosarevok
        And then you can check whether the recording is linked to any works if you want info from them :)
      • 2021-04-27 11702, 2021

      • reosarevok
        (if it is, you could use the work title as the standard title, while if not, you could just default to the recording title, which might already be an improvement - for example, a search for toXiciTY would probably give you a recording named Toxicity anyway :) )
      • 2021-04-27 11736, 2021

      • nelgin
        I rebooted my vm and restarted the docker and now it appears to be working.
      • 2021-04-27 11730, 2021

      • kyledecot
        reosarevok that all makes sense–my main blocker was just knowing which resource / entity to start with. I'll probably have additional questions as I continue to develop this but you've done a great job in giving me a direction to head in. Thanks for the quick response / building such an awesome / open platform!
      • 2021-04-27 11740, 2021

      • reosarevok
        kyledecot: neat! Don't hesitate to come back and ask more
      • 2021-04-27 11754, 2021

      • Mr_Monkey
        _lucifer: missing import details is not intended, no. We'll have to find some space for it
      • 2021-04-27 11719, 2021

      • _lucifer
        Mr_Monkey: should we put it on another page?
      • 2021-04-27 11716, 2021

      • Mr_Monkey
        I think it makes sense to keep it with each music service in that page. Maybe an "Import details" accordion hidden by default?
      • 2021-04-27 11730, 2021

      • _lucifer
        yeah that could work
      • 2021-04-27 11735, 2021

      • yvanzo
        nelgin: Solr works asynchronously. It takes extra time for search indexes to be readily available.
      • 2021-04-27 11749, 2021

      • yvanzo
        I’m working on improving stuff around BASE_FTP_URL variable you mentioned yesterday.
      • 2021-04-27 11747, 2021

      • yvanzo
        When live indexing is working, reindex messages are queued. So you can check either logs of 'mq' or 'search' services.
      • 2021-04-27 11705, 2021

      • kyledecot has quit
      • 2021-04-27 11709, 2021

      • yvanzo
        'indexer' logs can also be useful: it indicates when reindex messages are processed with timestamp.
      • 2021-04-27 11750, 2021

      • yvanzo
        sudo docker-compose logs --tail 10 --timestamps indexer
      • 2021-04-27 11731, 2021

      • nelgin
        yvanzo ah, did I screw things up?
      • 2021-04-27 11732, 2021

      • nelgin
        indexer_1 | 2021-04-27T16:25:02.607307633Z This probably means the server terminated abnormally
      • 2021-04-27 11732, 2021

      • nelgin
        indexer_1 | 2021-04-27T16:25:02.607310064Z before or while processing the request.
      • 2021-04-27 11732, 2021

      • nelgin
        indexer_1 | 2021-04-27T16:25:02.607315104Z [SQL: 'SELECT recording_1.id AS recording_1_id \nFROM musicbrainz.recording AS recording_1 JOIN musicbrainz.artist_credit ON musicbrainz.artist_credit.id = recording_1.artist_credit \nWHERE musicbrainz.artist_credit.id = %(id_1)s'] [parameters: {'id_1': 15856}]
      • 2021-04-27 11720, 2021

      • nelgin
        Hm, if I go back further
      • 2021-04-27 11722, 2021

      • nelgin
        indexer_1 | 2021-04-27T16:24:40.431919098Z 2021-04-27 16:24:40,431: Requeuing 100 pending messages.
      • 2021-04-27 11722, 2021

      • nelgin
        indexer_1 | 2021-04-27T16:24:40.441542857Z 2021-04-27 16:24:40,441: 100 messages requeued.
      • 2021-04-27 11722, 2021

      • nelgin
        indexer_1 | 2021-04-27T16:24:57.994325142Z 2021-04-27 16:24:57,993: Error encountered while processing messages: Post to Solr failed. Requeueing all pending messages for retry.
      • 2021-04-27 11742, 2021

      • nelgin
        I'm going to pastebin this entire log - tehre's all sorts of stuff in here.
      • 2021-04-27 11751, 2021

      • yvanzo
        Please pastebin the ouput of this command too: sudo docker-compose exec mq rabbitmqadmin -u sir -p sir -V /search-index-rebuilder list queues
      • 2021-04-27 11745, 2021

      • nelgin
        delete, failed, index, and retry all 0
      • 2021-04-27 11737, 2021

      • okno79 joined the channel
      • 2021-04-27 11710, 2021

      • nelgin
        Too big to pastebin so it's here on my server https://wibble.sysadmininc.com/log.txt
      • 2021-04-27 11707, 2021

      • nelgin
        I tried to download and import the indexes twice but had no job per previous comments so ended up just rebuilding them.
      • 2021-04-27 11710, 2021

      • yvanzo
        Never seen "this IndexWriter is closed" message before.
      • 2021-04-27 11749, 2021

      • yvanzo
        It seems it might be Solr ran out of resources.
      • 2021-04-27 11757, 2021

      • nelgin shrugs
      • 2021-04-27 11725, 2021

      • yvanzo
        Next time your replication cron task is running, there should be new messages in 'indexer' logs again.
      • 2021-04-27 11754, 2021

      • nelgin
        I can run it now if you like, it wont be for another 14 hours otherwise
      • 2021-04-27 11702, 2021

      • nelgin
        Well, I should say I can run it manually
      • 2021-04-27 11743, 2021

      • yvanzo
        Okay
      • 2021-04-27 11752, 2021

      • nelgin
        Ok, replication running
      • 2021-04-27 11726, 2021

      • nelgin
        search.index | 1722
      • 2021-04-27 11751, 2021

      • nelgin
        indexer_1 | 2021-04-27T17:37:50.642120375Z 2021-04-27 17:37:50,641: Successfully processed 100 messages
      • 2021-04-27 11755, 2021

      • nelgin
        So it looks like its working
      • 2021-04-27 11725, 2021

      • nelgin
        Though I just got this in my replication log
      • 2021-04-27 11725, 2021

      • nelgin
        WARNING: amqp could not commit tx mode on broker 1, reply_type=2, library_errno=4
      • 2021-04-27 11739, 2021

      • CatQuest
        :|
      • 2021-04-27 11732, 2021

      • MRiddickW has quit
      • 2021-04-27 11738, 2021

      • yvanzo
        Messages are queued and processed, so it’s working indeed.
      • 2021-04-27 11754, 2021

      • yvanzo
        The warning is about https://rabbitmq-c.docsforge.com/master/api/amqp_… but that requires more investigation.
      • 2021-04-27 11715, 2021

      • nelgin
        It seems to happen a few seconds after I run sudo docker-compose exec mq rabbitmqadmin -u sir -p sir -V /search-index-rebuilder list queues
      • 2021-04-27 11723, 2021

      • nelgin
        I just tried it again and got the same thing.
      • 2021-04-27 11756, 2021

      • Etua joined the channel
      • 2021-04-27 11752, 2021

      • Etua has quit
      • 2021-04-27 11712, 2021

      • yvanzo
        Ok, just ignore this warning then.