#metabrainz

/

      • yvanzo
        Epic won't disappear from other projects though, it is a known thing.
      • 2020-01-27 02706, 2020

      • CatQuest
        sure!
      • 2020-01-27 02716, 2020

      • CatQuest
        I was talking solely about inst
      • 2020-01-27 02734, 2020

      • CatQuest
        I am *curious* about how the epic ticket type functions
      • 2020-01-27 02746, 2020

      • CatQuest
        that was it
      • 2020-01-27 02748, 2020

      • yvanzo
        So, instead of "Epic" or "Mini-version", you do want "Research" (or some other name?) with labels field?
      • 2020-01-27 02755, 2020

      • CatQuest
        ... no
      • 2020-01-27 02757, 2020

      • CatQuest
        i want
      • 2020-01-27 02719, 2020

      • CatQuest
        I'd like "Mini-version" but perhaps with a labels field for it?
      • 2020-01-27 02724, 2020

      • CatQuest
        liek the "epic" ting has
      • 2020-01-27 02731, 2020

      • shivam-kapila
        iliekcomputers: I was thinking the favourite artists and releases can also be shown graphically like a circular graph. What do you think about it
      • 2020-01-27 02702, 2020

      • yvanzo
        Epic is what iliekcomputers told you: an epic goal that usual depends on several other tickets.
      • 2020-01-27 02722, 2020

      • CatQuest
        yea yea yea sure but it needs ot be worked with to be understood
      • 2020-01-27 02738, 2020

      • yvanzo
        But what “Mini-version” means?
      • 2020-01-27 02741, 2020

      • CatQuest
        I mean. "Mini-version" is something I made reo make for me. idk why it ended up no other projects too
      • 2020-01-27 02759, 2020

      • iliekcomputers
        shivam-kapila: circular graph?
      • 2020-01-27 02721, 2020

      • CatQuest
        it was just a "min-version" because (at the time) I didn't know what epic did. nore did i have any editing for it
      • 2020-01-27 02722, 2020

      • yvanzo
        It ended up no other project because nobody understand what it is, there is no description attached to it.
      • 2020-01-27 02730, 2020

      • CatQuest
        ¯\_(ツ)_/¯
      • 2020-01-27 02735, 2020

      • CatQuest
        i'm sorry abotu that
      • 2020-01-27 02751, 2020

      • yvanzo
        No problem, so mini-version = epic?
      • 2020-01-27 02700, 2020

      • CatQuest
        basically "Mini-version" is a ticket that collects loads of related tickets
      • 2020-01-27 02717, 2020

      • CatQuest
        that's waht I was wodnerin if maybe was a good idea :D when i heard tis epic description
      • 2020-01-27 02718, 2020

      • CatQuest
        !
      • 2020-01-27 02705, 2020

      • CatQuest
        so if "Mini-version" could have the *abilities* that epic has.. lables
      • 2020-01-27 02710, 2020

      • CatQuest
        maybe
      • 2020-01-27 02756, 2020

      • CatQuest
        anyway i cant really do anything epic on isnt right now. i don't have that label field
      • 2020-01-27 02701, 2020

      • CatQuest
        inst*
      • 2020-01-27 02712, 2020

      • CatQuest
      • 2020-01-27 02713, 2020

      • BrainzBot
        INST-392: Veena
      • 2020-01-27 02731, 2020

      • CatQuest
        I mean if https://tickets.metabrainz.org/browse/INST-392 is a problematic name
      • 2020-01-27 02735, 2020

      • CatQuest
        ehr
      • 2020-01-27 02745, 2020

      • CatQuest
        if "Mini-version" is a problematic name..
      • 2020-01-27 02712, 2020

      • CatQuest needs breakfast first
      • 2020-01-27 02718, 2020

      • shivam-kapila
        iliekcomputers: something like this. I am not sure can it be called a pie chart. https://usercontent.irccloud-cdn.com/file/SAqvBFD…
      • 2020-01-27 02708, 2020

      • prabal joined the channel
      • 2020-01-27 02737, 2020

      • sarthak_jain joined the channel
      • 2020-01-27 02743, 2020

      • Nyanko-sensei has quit
      • 2020-01-27 02718, 2020

      • BrainzGit
        [listenbrainz-server] dependabot-preview[bot] opened pull request #711 (master…dependabot/pip/py4j-0.10.9): Bump py4j from 0.10.8.1 to 0.10.9 https://github.com/metabrainz/listenbrainz-server…
      • 2020-01-27 02714, 2020

      • MajorLurker has quit
      • 2020-01-27 02700, 2020

      • Nyanko-sensei joined the channel
      • 2020-01-27 02702, 2020

      • sarthak_jain
        Hey pristine__. For this issue https://tickets.metabrainz.org/projects/LB/issues…, what do we exactly want to change in this file https://github.com/metabrainz/listenbrainz-server…
      • 2020-01-27 02703, 2020

      • BrainzBot
        LB-473: ListenBrainz-Labs: Use PySpark SQL Module in place of SQL queries.
      • 2020-01-27 02754, 2020

      • MajorLurker joined the channel
      • 2020-01-27 02706, 2020

      • sarthak_jain
        We wish to return a dictionary right, from each function?
      • 2020-01-27 02755, 2020

      • pristine__
        Let us say we have a dataframe = df
      • 2020-01-27 02714, 2020

      • pristine__
        We want to select col = user_id from this
      • 2020-01-27 02727, 2020

      • pristine__
        There are two ways for that
      • 2020-01-27 02749, 2020

      • pristine__
        Register the dataframe and then write sql query
      • 2020-01-27 02705, 2020

      • pristine__
        `select user_id from df`
      • 2020-01-27 02724, 2020

      • pristine__
        We are following this scheme in utils.py
      • 2020-01-27 02751, 2020

      • CatQuest
        Mr_Monkey: why exactly does all entities have 2 "language" selectors?
      • 2020-01-27 02751, 2020

      • pristine__
        Now the other way out is don't register the dataframe but use pyspark sql module
      • 2020-01-27 02741, 2020

      • pristine__
        Like this : `df.select("user_id")`
      • 2020-01-27 02706, 2020

      • pristine__
        We want to update all the sql queries in utils.py to use pyspark module.
      • 2020-01-27 02740, 2020

      • pristine__
        You will have a read the pyspark sql doc a bit. I have provided the link in the task.
      • 2020-01-27 02757, 2020

      • sarthak_jain
        Have we used this pyspark.sql module in any file in the listenbrainz_spark?
      • 2020-01-27 02743, 2020

      • pristine__
        listenbrainz_spark/recommendations/create_dataframe.py
      • 2020-01-27 02751, 2020

      • pristine__
        But I don't think referring to the file is the best way. You should read the doc once, it is really good :)
      • 2020-01-27 02703, 2020

      • sarthak_jain
        Yes sure. I will use the doc.
      • 2020-01-27 02725, 2020

      • sarthak_jain
        One more doubt!
      • 2020-01-27 02751, 2020

      • sarthak_jain
        Since we are passing a table to the function, do I need to now pass dataframe as a parameter?
      • 2020-01-27 02722, 2020

      • sarthak_jain
        or maybe just use this `sqlContext.table("table")` to convert it to a dataframe ?
      • 2020-01-27 02747, 2020

      • Guest84633 is now known as c1e0
      • 2020-01-27 02710, 2020

      • sarthak_jain
        Also pristine__, I dont see any queries listenbrainz_spark/recommendations/create_dataframe.py in this file. '=D
      • 2020-01-27 02708, 2020

      • pristine__
        This script is calling other modules where queries are actually written
      • 2020-01-27 02718, 2020

      • pristine__
        You have a lot of time. Please read the code.
      • 2020-01-27 02736, 2020

      • pristine__
        Wait. I will give you the line no. A sec
      • 2020-01-27 02747, 2020

      • pristine__
      • 2020-01-27 02713, 2020

      • pristine__
        > or maybe just use this `sqlContext.table("table")` to convert it to a dataframe ?
      • 2020-01-27 02729, 2020

      • pristine__
        I have no idea what this line of code is?
      • 2020-01-27 02742, 2020

      • pristine__
        > Since we are passing a table to the function, do I need to now pass dataframe as a parameter?
      • 2020-01-27 02745, 2020

      • pristine__
        Which function? Take your time. I will review when you make a patch :)
      • 2020-01-27 02705, 2020

      • sarthak_jain
        This piece will register the table as a df so that I can use df operations
      • 2020-01-27 02724, 2020

      • sarthak_jain
        Okay sure
      • 2020-01-27 02751, 2020

      • pristine__
        No
      • 2020-01-27 02718, 2020

      • pristine__
        That's the whole point. We don't want to register tables, that is why we are directly using dataframe
      • 2020-01-27 02728, 2020

      • pristine__
        Did you follow the above examples?
      • 2020-01-27 02708, 2020

      • pristine__
        Tables are not registered as dataframe, we register a dataframe as a view/table to query it.
      • 2020-01-27 02738, 2020

      • sarthak_jain has quit
      • 2020-01-27 02741, 2020

      • Coder44 has quit
      • 2020-01-27 02715, 2020

      • sarthak_jain joined the channel
      • 2020-01-27 02724, 2020

      • sarthak_jain
      • 2020-01-27 02732, 2020

      • sarthak_jain
        yes exactly
      • 2020-01-27 02720, 2020

      • sarthak_jain
        What I am saying is, to perform df operations, I need to paas df as a parameter here, right?
      • 2020-01-27 02744, 2020

      • sarthak_jain has quit
      • 2020-01-27 02703, 2020

      • pristine__
        Obviously!
      • 2020-01-27 02735, 2020

      • sarthak_jain joined the channel
      • 2020-01-27 02719, 2020

      • sarthak_jain
        Yup cool 😅
      • 2020-01-27 02737, 2020

      • sarthak_jain
        I will get started then
      • 2020-01-27 02744, 2020

      • sarthak_jain has quit
      • 2020-01-27 02755, 2020

      • pristine__
        Purpose of utils.py is to hold all the queries together at one place.
      • 2020-01-27 02700, 2020

      • pristine__
        Yeah. Nice
      • 2020-01-27 02724, 2020

      • shivam-kapila
        pristine__: iliekcomputers: For testing the stats module how much data should I download from the ftp server dumps?
      • 2020-01-27 02748, 2020

      • pristine__
        Hey
      • 2020-01-27 02723, 2020

      • pristine__
        So we are in that process but I can suggest you a workaround
      • 2020-01-27 02707, 2020

      • pristine__
        Checkout to `all-changes-mapping` branch
      • 2020-01-27 02725, 2020

      • pristine__
      • 2020-01-27 02702, 2020

      • pristine__
        replace this path to `http://ftp.musicbrainz.org/pub/musicbrainz/listenbrainz/incremental/`
      • 2020-01-27 02743, 2020

      • pristine__
      • 2020-01-27 02710, 2020

      • pristine__
        replace this to `listenbrainz-listens-dump-99-20191107-000002-spark-incremental.tar.xz`
      • 2020-01-27 02732, 2020

      • yvanzo
        CatQuest: From what I can see on Veena INST-392, you should be able to transition, link, and label it. Am I missing something?
      • 2020-01-27 02733, 2020

      • BrainzBot
      • 2020-01-27 02709, 2020

      • pristine__
      • 2020-01-27 02715, 2020

      • pristine__
        and run `/usr/local/spark/bin/spark-submit spark_manage.py upload_listens`
      • 2020-01-27 02732, 2020

      • CatQuest
        yvanzo: sure..
      • 2020-01-27 02745, 2020

      • CatQuest
        but I can't use a "epic link"
      • 2020-01-27 02719, 2020

      • pristine__
        The configs are for full dumps which are not ideal for local env. We are in process to make small dumps for local env which will be done soon. Till then you can use incremental dumps to for stats
      • 2020-01-27 02720, 2020

      • shivam-kapila
        pristine: Got it. Thanks :D
      • 2020-01-27 02703, 2020

      • pristine__
        > and run `/usr/local/spark/bin/spark-submit spark_manage.py upload_listens
      • 2020-01-27 02734, 2020

      • pristine__
        this should be run inside listenbrainz_playground_1 container.
      • 2020-01-27 02736, 2020

      • pristine__
        shivam-kapila:
      • 2020-01-27 02734, 2020

      • shivam-kapila
        Okay.
      • 2020-01-27 02711, 2020

      • pristine__
        `docker exec -it listenbrainzspark_playground_1 bash`
      • 2020-01-27 02708, 2020

      • shivam-kapila
        This is nice. No download required. Wow :p
      • 2020-01-27 02703, 2020

      • pristine__
        We are trying to make it better. In time people will have to write just one command to fill in data on their local machine, and in case you want to test something on big machines, a single command to fill in big data.
      • 2020-01-27 02708, 2020

      • pristine__
        thank you :)
      • 2020-01-27 02753, 2020

      • antlarr has quit
      • 2020-01-27 02724, 2020

      • yvanzo
        CatQuest: thanks, got it :)
      • 2020-01-27 02711, 2020

      • antlarr joined the channel
      • 2020-01-27 02734, 2020

      • amCap1712
        iliekcomputers: worked on the feature. it is working well on the linux build
      • 2020-01-27 02700, 2020

      • c1e0_ joined the channel
      • 2020-01-27 02737, 2020

      • c1e0 has quit
      • 2020-01-27 02733, 2020

      • iliekcomputers
        pristine__: can we document the steps you listed somewhere (for now)
      • 2020-01-27 02700, 2020

      • pristine__
        Readme?
      • 2020-01-27 02755, 2020

      • pristine__
        shivam-kapila: worked for you?
      • 2020-01-27 02742, 2020

      • iliekcomputers
        pristine__: we have a HACKING.md file that seems perfect for stuff like this
      • 2020-01-27 02757, 2020

      • iliekcomputers
      • 2020-01-27 02711, 2020

      • pristine__
        Cool. Thanks.
      • 2020-01-27 02750, 2020

      • prabal has quit
      • 2020-01-27 02733, 2020

      • travis-ci joined the channel
      • 2020-01-27 02733, 2020

      • travis-ci
        Project bookbrainz-data-js build #1366: passed in 1 min 49 sec: https://travis-ci.org/bookbrainz/bookbrainz-data-…
      • 2020-01-27 02733, 2020

      • travis-ci has left the channel
      • 2020-01-27 02743, 2020

      • Lotheric has quit
      • 2020-01-27 02712, 2020

      • iliekcomputers
        Dumps are getting created again, BTW.
      • 2020-01-27 02730, 2020

      • sarthak_jain joined the channel
      • 2020-01-27 02706, 2020

      • Lotheric joined the channel
      • 2020-01-27 02729, 2020

      • shivam-kapila has quit
      • 2020-01-27 02757, 2020

      • yvanzo
        CatQuest: should work now, but the migration is going to be cumbersome.
      • 2020-01-27 02754, 2020

      • rahul24 joined the channel
      • 2020-01-27 02748, 2020

      • rahul24 has quit
      • 2020-01-27 02756, 2020

      • rahul24 joined the channel
      • 2020-01-27 02725, 2020

      • rahul24 has quit
      • 2020-01-27 02703, 2020

      • rahul24 joined the channel
      • 2020-01-27 02723, 2020

      • rahul24 has quit
      • 2020-01-27 02729, 2020

      • rahul24 joined the channel
      • 2020-01-27 02721, 2020

      • prabal joined the channel
      • 2020-01-27 02714, 2020

      • sarthak_jain has quit
      • 2020-01-27 02710, 2020

      • rahul24 has quit
      • 2020-01-27 02727, 2020

      • reosarevok
        yvanzo: can you take a look at https://github.com/metabrainz/musicbrainz-server/… when you have some time?