#metabrainz

/

      • Toasty has quit
      • 2021-06-03 15458, 2021

      • akashgp09_ joined the channel
      • 2021-06-03 15418, 2021

      • lucifer
      • 2021-06-03 15412, 2021

      • shivam-kapila
        lucifer: SO got acquired recently so who knows
      • 2021-06-03 15438, 2021

      • lucifer
        yup, i know. that was the context of the tweet :)
      • 2021-06-03 15423, 2021

      • shivam-kapila
        Ah. Didnt see that coming
      • 2021-06-03 15443, 2021

      • lucifer
        alastairp: i worked ahead on the download dataset feature and added chunking. worked nice for ~5K recordings dataset. now trying it on a ~55K recording dataset.
      • 2021-06-03 15452, 2021

      • ruaok
        moooin!
      • 2021-06-03 15404, 2021

      • ruaok chuckles at expertsexchange
      • 2021-06-03 15406, 2021

      • lucifer
        it downloaded 2 GB and then my internet connection gave up. but this probably means that the feature is working because we load the entire zip first and then send it for download.
      • 2021-06-03 15406, 2021

      • ruaok
        that goof was made by my former comp sci professor. and then one of my first bosses was the guy who leaked the DeCSS key. I've had an interesting professional life. :)
      • 2021-06-03 15430, 2021

      • lucifer
        lol 😆
      • 2021-06-03 15449, 2021

      • ruaok
        zas: question about grafana graphs/influx queries for when you have a moment.
      • 2021-06-03 15421, 2021

      • zas
        I'm only at first coffee, but please ask ;)
      • 2021-06-03 15417, 2021

      • ruaok
        oy. #dangerzone
      • 2021-06-03 15426, 2021

      • ruaok
        my first coffe is still too hot to drink.
      • 2021-06-03 15444, 2021

      • ruaok
      • 2021-06-03 15417, 2021

      • ruaok
        I want to turn that into a "per minute" graph, not just a sum()
      • 2021-06-03 15444, 2021

      • ruaok
        I looked at other graphs you made and copied the query, but never got it to work.
      • 2021-06-03 15454, 2021

      • ruaok
        and then never saved any of those changes, neither.
      • 2021-06-03 15449, 2021

      • zas
        on top graph, that's mean not sum btw
      • 2021-06-03 15407, 2021

      • ruaok
        yes, whatever it is right now, I don't want it. :)
      • 2021-06-03 15415, 2021

      • ruaok
        its useless overall.
      • 2021-06-03 15426, 2021

      • texke has quit
      • 2021-06-03 15447, 2021

      • zas
        ok, the metric is an absolute number of listens, right?
      • 2021-06-03 15409, 2021

      • zas
        try to add non_negative_derivative (1m) from Transformations
      • 2021-06-03 15420, 2021

      • ruaok
        yes
      • 2021-06-03 15432, 2021

      • ruaok
        well, listens added in interval.
      • 2021-06-03 15443, 2021

      • lucifer
        where are the queries located? i wanted to look at it out of curiousity.
      • 2021-06-03 15436, 2021

      • ruaok
        not sure you can see them.
      • 2021-06-03 15400, 2021

      • lucifer
        oh! ok.
      • 2021-06-03 15445, 2021

      • ruaok
        zas: transformations is the tab to the right of query, right?
      • 2021-06-03 15402, 2021

      • zas
        nope, in the query itself
      • 2021-06-03 15410, 2021

      • ruaok
        ah, no wonder I can't find it.
      • 2021-06-03 15423, 2021

      • ruaok
        do you have a dashboard that shows that in action?
      • 2021-06-03 15449, 2021

      • zas
      • 2021-06-03 15410, 2021

      • ruaok
        got it.
      • 2021-06-03 15418, 2021

      • ruaok
        its alll a bit black magic, isn't it?
      • 2021-06-03 15423, 2021

      • zas
        ofc ;)
      • 2021-06-03 15436, 2021

      • ruaok
        ok, works now.
      • 2021-06-03 15438, 2021

      • ruaok
      • 2021-06-03 15442, 2021

      • ruaok
        shivam-kapila: ^^
      • 2021-06-03 15455, 2021

      • param
        that's very cool
      • 2021-06-03 15409, 2021

      • zas
        I see you have missing data, are they submitted every minute?
      • 2021-06-03 15416, 2021

      • lucifer
        yup, very nice :D
      • 2021-06-03 15451, 2021

      • ruaok
        should be. all graphs have the same missing data points. not sure why.
      • 2021-06-03 15450, 2021

      • ruaok
        for smoothing do you normally use moving average, zas? how many seconds?
      • 2021-06-03 15432, 2021

      • lucifer
        the interval drop down on the top right is not working for me. does it work for you?
      • 2021-06-03 15432, 2021

      • ruaok
        lucifer: the one with the arrows is how frequently it should update. the one with the clock is the range to see. which one are you referring to?
      • 2021-06-03 15444, 2021

      • lucifer
        the one with the arrow.
      • 2021-06-03 15404, 2021

      • ruaok
        does it not auto-update for you then?
      • 2021-06-03 15425, 2021

      • lucifer
        no its not auto updating either. looks like another missing minute.
      • 2021-06-03 15432, 2021

      • zas
        mov avg uses a number of values, so if you do mov avg(5) and you have data every min, it will be over 5 mins
      • 2021-06-03 15414, 2021

      • ruaok
        ah, values. I see.
      • 2021-06-03 15430, 2021

      • ruaok
        lucifer: sometimes it helps to close the tab and start over in a new one.
      • 2021-06-03 15448, 2021

      • alastairp
        lucifer: cool, that's great that bulk download seems to work. Can I try it myself (maybe I have a faster connection)
      • 2021-06-03 15403, 2021

      • zas
        don't over smooth, as it will hide problems ;)
      • 2021-06-03 15414, 2021

      • lucifer
        ruaok: right sorry, was network issue on my end. works now.
      • 2021-06-03 15415, 2021

      • ruaok
        what value to you normally use?
      • 2021-06-03 15422, 2021

      • alastairp
        but also, perhaps we could put a lower limit (5k? 10k?) over which you can't download
      • 2021-06-03 15441, 2021

      • lucifer
        alastairp: yes. you can try it out, its up on similarity.ab
      • 2021-06-03 15453, 2021

      • zas
        I don't smooth most of times, or just a little so 5 or 10 max
      • 2021-06-03 15445, 2021

      • ruaok moves it back to 5
      • 2021-06-03 15446, 2021

      • zas
        another approach
      • 2021-06-03 15446, 2021

      • lucifer
        alastairp: i think it makes sense to put a limit, currently i have increased nginx timeout limits to avoid gateway timeouts but not sure we want to do it in production.
      • 2021-06-03 15452, 2021

      • zas
        use group by time
      • 2021-06-03 15400, 2021

      • zas
        let me find an example
      • 2021-06-03 15435, 2021

      • ruaok
        i see it.
      • 2021-06-03 15440, 2021

      • zas
      • 2021-06-03 15448, 2021

      • zas
        check the select box interval
      • 2021-06-03 15411, 2021

      • alastairp
        lucifer: yeah, exactly. for now we want to try avoid that timeout. what did you increase it to?
      • 2021-06-03 15415, 2021

      • zas
        see variables definition in dashboard config
      • 2021-06-03 15431, 2021

      • zas
        and look at group by time() in queries
      • 2021-06-03 15445, 2021

      • zas
        imho that's a better way to smooth out the data
      • 2021-06-03 15447, 2021

      • alastairp
        and I guess it's also related to user download speed, not just how quickly we can select and compress it
      • 2021-06-03 15414, 2021

      • lucifer
        alastairp: 1d :p, i think but a ~10 min timeout would work as well.
      • 2021-06-03 15424, 2021

      • alastairp
        most datasets have very few items anyway (less than 1000, maybe less than 100 too).
      • 2021-06-03 15429, 2021

      • alastairp
        ah, I was thinking 60s
      • 2021-06-03 15403, 2021

      • zas
        by default it is on "auto", which basically uses an interval based on current time window displayed
      • 2021-06-03 15406, 2021

      • alastairp
        zas: what is connection timeout on openresty gateways?
      • 2021-06-03 15415, 2021

      • lucifer
        60s is the default, it timed out on the 5k recordings test.
      • 2021-06-03 15436, 2021

      • lucifer
        so i increased it arbitarily to be able to test the download
      • 2021-06-03 15445, 2021

      • ruaok
        zas: the name in the example you showed is "interva" not "interval". intentional?
      • 2021-06-03 15401, 2021

      • zas
        yes
      • 2021-06-03 15409, 2021

      • ruaok
        dark. magic.
      • 2021-06-03 15416, 2021

      • alastairp
        as we discussed this idea of background processing, we should just release this current feature with a low limit, and use background processing in the future
      • 2021-06-03 15432, 2021

      • lucifer
        yup agreed.
      • 2021-06-03 15416, 2021

      • lucifer
      • 2021-06-03 15426, 2021

      • lucifer
        ruaok: ^ that's why data is missing.
      • 2021-06-03 15427, 2021

      • zas
        alastairp: connection timeout is rather vague, we have millions of settings. what do you want to know exactly?
      • 2021-06-03 15445, 2021

      • zas
        which query? which error?
      • 2021-06-03 15446, 2021

      • alastairp
        lucifer: what setting did you change?
      • 2021-06-03 15419, 2021

      • lucifer
        uwsgi_read_timeout on nginx.
      • 2021-06-03 15406, 2021

      • ruaok
        zas: and then group by time and select a larger interval, zas? you dont have that in the example...
      • 2021-06-03 15416, 2021

      • lucifer
        proxy_read_timeout, proxy_connect_timeout and proxy_send_timeout on nginx-proxy.
      • 2021-06-03 15429, 2021

      • zas
        ruaok: what do you mean?
      • 2021-06-03 15446, 2021

      • alastairp
        lucifer: ah, right. do you know how those items interact with each other? are they all needed?
      • 2021-06-03 15452, 2021

      • alastairp
        zas: we have a view in acousticbrainz that may take a long time to finish (it reads data from the database and generates a zip). a value for uwsgi_read_timeout of 60s is too short
      • 2021-06-03 15404, 2021

      • ruaok
        oh, wait, i see it now.
      • 2021-06-03 15412, 2021

      • ruaok
        $interva not $interval
      • 2021-06-03 15418, 2021

      • zas
        yup^^
      • 2021-06-03 15442, 2021

      • ruaok
        hmm $interva doesn't show in the drop down.
      • 2021-06-03 15445, 2021

      • ruaok
        reload?
      • 2021-06-03 15400, 2021

      • lucifer
        alastairp: no i do not know this works. just followed what this said https://easycloudsupport.zendesk.com/hc/en-us/art… for proxy and another SO answer for nginx
      • 2021-06-03 15414, 2021

      • zas
        ruaok: sometimes you need to save/reload the dashboard
      • 2021-06-03 15424, 2021

      • ruaok
        just did that, no change.
      • 2021-06-03 15425, 2021

      • alastairp
        lucifer: ok, no problem. let's wait until zas and ruaok finish and see if we can look into this more
      • 2021-06-03 15431, 2021

      • lucifer
        sure, thanks!
      • 2021-06-03 15449, 2021

      • ruaok
        shan't be long.
      • 2021-06-03 15455, 2021

      • ruaok
        not showing up, zas. :(
      • 2021-06-03 15425, 2021

      • zas
        alastairp: I'm not sure that's the correct approach to fix this issue, what if your zip takes 2 hours to generate, we keep connections open, and gateways will soon be out of resources
      • 2021-06-03 15423, 2021

      • alastairp
        zas: yes, understood. to minimise this issue we're going to set a maximum number of items that you can compress. we need to do some experiments to decide what the good limit is
      • 2021-06-03 15437, 2021

      • zas
        alastairp: really, I don't think that's a good idea to increase timeouts (60s to answer an HTTP query is already very looong)
      • 2021-06-03 15450, 2021

      • alastairp
        we have a future plan that we will have a background process to generate the zip - you will send a request and come back when it's been generated and download it
      • 2021-06-03 15455, 2021

      • alastairp
        but that's not ready yet
      • 2021-06-03 15423, 2021

      • ruaok
        alastairp: yes, indeed, increasing the timeouts will lead to disaster.
      • 2021-06-03 15451, 2021

      • alastairp
        so can I get an answer to my initial question - what are the timeouts currently?
      • 2021-06-03 15457, 2021

      • zas
        60s
      • 2021-06-03 15418, 2021

      • alastairp
        ok great. I'm happy to work within 60s for now
      • 2021-06-03 15418, 2021

      • lucifer
        cool, so let's set a limit on recording that can be zipped in 60s?
      • 2021-06-03 15442, 2021

      • ruaok
        thats also not a good idea. if the server is loaded that goal post moves.
      • 2021-06-03 15401, 2021

      • ruaok
        sadly, this is a really tricky problem to solve.
      • 2021-06-03 15432, 2021

      • lucifer
        also we need to solve this for LB as well, currently we export user listens in the same way.
      • 2021-06-03 15433, 2021

      • alastairp
        yeah, there's not much we can do at the moment here. We have a PR open which is useful in a small number of situations so I'd like to get it merged
      • 2021-06-03 15452, 2021

      • alastairp
        we've already talked about adding in background processing - as lucifer says it's useful for LB as well
      • 2021-06-03 15414, 2021

      • zas
        ruaok: which dashboard? same one?
      • 2021-06-03 15420, 2021

      • zas
        I don't see any variable set
      • 2021-06-03 15411, 2021

      • ruaok
        same one yes.
      • 2021-06-03 15432, 2021

      • zas
      • 2021-06-03 15453, 2021

      • ruaok
        that is exactly what is shown on my screen.
      • 2021-06-03 15405, 2021

      • zas
        did you update and save?
      • 2021-06-03 15406, 2021

      • ruaok
        all the values for interva are exactly the same as on your stat.
      • 2021-06-03 15435, 2021

      • zas
        because it doesn't show when I load this dashboard, so I don't think you saved it
      • 2021-06-03 15407, 2021

      • ruaok
        I didn't save, spin around three times and wave the chicken in the air, you're right.
      • 2021-06-03 15412, 2021

      • ruaok
        better now.
      • 2021-06-03 15413, 2021

      • zas
        now it shows ;)
      • 2021-06-03 15425, 2021

      • ruaok
        yes. now it looks as I would like it to. thank zas!
      • 2021-06-03 15449, 2021

      • ruaok
        lucifer: alastairp: param: now the graph is about right. https://stats.metabrainz.org/d/OGg5QUCGz/listenbr…
      • 2021-06-03 15421, 2021

      • ruaok
        lucifer: and you can see that the mbid writer is missing writing data -- the timescale writer is not missing data points, so the metric system is working as we expect.
      • 2021-06-03 15402, 2021

      • ruaok
      • 2021-06-03 15409, 2021

      • ruaok
        yeah. :)
      • 2021-06-03 15412, 2021

      • zas
        ruaok: time for 2nd coffee ;) bb soon
      • 2021-06-03 15418, 2021

      • ruaok
        good call.
      • 2021-06-03 15401, 2021

      • lucifer
        ruaok: there is some issue in the metric writer as well, https://sentry.metabrainz.org/metabrainz/metrics-… . but there is some issue in error handling so we are not the actual errors currently.
      • 2021-06-03 15418, 2021

      • ruaok
        I thought I had fixed that. maybe not deployed it.
      • 2021-06-03 15420, 2021

      • lucifer
      • 2021-06-03 15416, 2021

      • ruaok
        no, doesn't look it.
      • 2021-06-03 15426, 2021

      • ruaok won't be able to look at it until next week
      • 2021-06-03 15424, 2021

      • ruaok
        looks to be missing a `exc_error = None`outside the while
      • 2021-06-03 15454, 2021

      • lucifer
        I can take a look at it if you want :)