#metabrainz

/

0:09 AM
kuno joined the channel

2021-05-25 14515, 2021

0:22 AM
Freso__ joined the channel

2021-05-25 14503, 2021

0:23 AM
lucifer_ joined the channel

2021-05-25 14542, 2021

0:23 AM
Freso__ is now known as Freso

2021-05-25 14543, 2021

0:23 AM
lucifer has quit

2021-05-25 14543, 2021

0:23 AM
lucifer_ is now known as lucifer

2021-05-25 14522, 2021

0:24 AM
yyoung joined the channel

2021-05-25 14549, 2021

0:26 AM
leonardo has quit

2021-05-25 14536, 2021

0:28 AM
leonardo joined the channel

2021-05-25 14532, 2021

0:32 AM
antlarr_away has quit

2021-05-25 14542, 2021

0:32 AM
antlarr joined the channel

2021-05-25 14559, 2021

2:38 AM
yyoung

yvanzo: I've fixed the tests and would like to hear your advice on my code :)

2021-05-25 14527, 2021

2:49 AM
thomasross has quit

2021-05-25 14552, 2021

3:43 AM
akashgp0971 joined the channel

2021-05-25 14547, 2021

3:48 AM
akashgp09_ joined the channel

2021-05-25 14506, 2021

3:49 AM
akashgp0971 has quit

2021-05-25 14529, 2021

3:57 AM
bitmap

yyoung: hey, did you manage to get selenium tests running under docker?

2021-05-25 14536, 2021

3:58 AM
yyoung

bitmap: Not yet, I just use Jenkins, maybe I'll take a look at it the other day

2021-05-25 14503, 2021

3:59 AM
bitmap

ah ok. if you need help with it let me know

2021-05-25 14528, 2021

3:59 AM
akashgp09_ has quit

2021-05-25 14515, 2021

4:00 AM
akashgp09_ joined the channel

2021-05-25 14517, 2021

4:16 AM
akashgp09_ has quit

2021-05-25 14553, 2021

4:16 AM
akashgp09_ joined the channel

2021-05-25 14508, 2021

4:27 AM
akashgp09_ is now known as akashgp09

2021-05-25 14534, 2021

4:30 AM
akashgp09 has quit

2021-05-25 14555, 2021

4:31 AM
akashgp09_ joined the channel

2021-05-25 14518, 2021

4:49 AM
akashgp0973 joined the channel

2021-05-25 14546, 2021

4:49 AM
akashgp0973 has left the channel

2021-05-25 14535, 2021

6:25 AM
BrainzGit

[listenbrainz-server] 14amCap1712 opened pull request #1481 (03master…spark-log): Log all requests received in spark https://github.com/metabrainz/listenbrainz-server…

2021-05-25 14504, 2021

6:55 AM
lucifer

ruaok: the dump was successful but due to some reason spark didn't import it. `request_import_incremental` without supplying `--id` seems to be borked on spark side.

2021-05-25 14503, 2021

6:57 AM
lucifer

i think it would be simpler if we have file like `LATEST` which contains the latest dump id as MB has https://data.musicbrainz.org/pub/musicbrainz/data…

2021-05-25 14533, 2021

7:26 AM
lucifer

I see the issue, the current code assumes that all dump ids are sequential. it starts searching incremental dumps with ids since the last full dump and checks whether it exists. if not, it stops further checking. in this case 431 is the id of the full dump. but incremental dumps with id 435 and 436 are missing so it believes there are no other dumps to import and returns.

2021-05-25 14517, 2021

7:32 AM
shivam-kapila

maybe try for next 5 IDs and then gracefully stop.

2021-05-25 14557, 2021

7:41 AM
lucifer

that sounds brittle. but it also depends on why those particular ids are missing.

2021-05-25 14520, 2021

8:08 AM
monkey has quit

2021-05-25 14520, 2021

8:11 AM
outsidecontext_ joined the channel

2021-05-25 14520, 2021

8:11 AM
outsidecontext_ has quit

2021-05-25 14520, 2021

8:11 AM
outsidecontext_ joined the channel

2021-05-25 14520, 2021

8:11 AM
monkey joined the channel

2021-05-25 14551, 2021

8:15 AM
agatzk has quit

2021-05-25 14558, 2021

8:15 AM
agatzk joined the channel

2021-05-25 14519, 2021

8:16 AM
monkey has quit

2021-05-25 14525, 2021

8:18 AM
ruaok

> I see the issue, the current code assumes that all dump ids are sequential. it starts searching incremental dumps with ids since the last full dump and checks whether it exists. if not, it stops further checking. in this case 431 is the id of the full dump. but incremental dumps with id 435 and 436 are missing so it believes there are no other dumps to import and returns.

2021-05-25 14533, 2021

8:18 AM
outsidecontext has quit

2021-05-25 14502, 2021

8:19 AM
monkey joined the channel

2021-05-25 14517, 2021

8:19 AM
ruaok

lucifer: this is what I was saying last week -- this requirement is not really needed for us -- the referential integrity of the dumps is not compromised by skipping a dump. there will just be a gap in the data.

2021-05-25 14525, 2021

8:19 AM
outsidecontext joined the channel

2021-05-25 14541, 2021

8:19 AM
ruaok

we should do tho things: 1) go ahead and import dumps anyway 2) complain to high heaven that a dump is missing.

2021-05-25 14527, 2021

8:20 AM
outsidecontext_ has quit

2021-05-25 14555, 2021

8:20 AM
outsidecontext_ joined the channel

2021-05-25 14555, 2021

8:20 AM
outsidecontext_ has quit

2021-05-25 14555, 2021

8:20 AM
outsidecontext_ joined the channel

2021-05-25 14507, 2021

8:24 AM
outsidecontext has quit

2021-05-25 14519, 2021

8:24 AM
flamingspinach has quit

2021-05-25 14520, 2021

8:24 AM
param has quit

2021-05-25 14520, 2021

8:24 AM
slush has quit

2021-05-25 14537, 2021

8:25 AM
outsidecontext joined the channel

2021-05-25 14517, 2021

8:27 AM
outsidecontext_ has quit

2021-05-25 14555, 2021

8:27 AM
param joined the channel

2021-05-25 14555, 2021

8:27 AM
slush joined the channel

2021-05-25 14555, 2021

8:27 AM
lucifer

ruaok: right, the issue here is the code considers the missing dump as a signal to stop looking further. if start ignoring it, we need another way to check that which the latest dump is.

2021-05-25 14510, 2021

8:28 AM
lucifer

hence, the suggestion about the latest file.

2021-05-25 14547, 2021

8:36 AM
ruaok

timestamps.

2021-05-25 14515, 2021

8:37 AM
ruaok

but, if you prefer, a LATEST file. I hate implementing those. dunno why

2021-05-25 14512, 2021

8:40 AM
lucifer

i see, like the check_ftp_age command.

2021-05-25 14515, 2021

8:41 AM
lucifer

that's doable. i suggested a LATEST file because MB does it that way, they seem to have way less trouble with dumps than us.

2021-05-25 14550, 2021

8:42 AM
ruaok

lucifer: that's because I had all this trouble and more with them over 15 years ago.

2021-05-25 14532, 2021

8:43 AM
ruaok

lucifer: the import command can be a lot simpler really. our dump file names sort nicely, by design.

2021-05-25 14551, 2021

8:43 AM
ruaok

just import to the end of the list and be done.

2021-05-25 14500, 2021

8:46 AM
lucifer

oh! makes sense now why MB dumps are more stable.

2021-05-25 14513, 2021

8:46 AM
lucifer

indeed, it can be a lot simpler.

2021-05-25 14529, 2021

8:48 AM
lucifer

we have some additional complexity introduced on spark side. we store the id and timestamp of import all the imported dumps in hdfs. then while importing we do some non trivial stuff to find which dumps to import.

2021-05-25 14553, 2021

8:48 AM
ruaok

nuke it all.

2021-05-25 14510, 2021

8:49 AM
ruaok

that logic is fitting for some contexts, but not really for our own spark context.

2021-05-25 14511, 2021

8:49 AM
lucifer

storing metadata is probably helpful during debugging but during import isnt needed at all.

2021-05-25 14537, 2021

8:49 AM
ruaok

debugging and process monitoring.

2021-05-25 14551, 2021

8:49 AM
ruaok

if we miss a dump or dumps are out of date, we need to scream.

2021-05-25 14527, 2021

8:50 AM
lucifer

indeed.

2021-05-25 14551, 2021

8:50 AM
ruaok

monkey: can you please work on determining why our LB reports are broken? I'd like to deploy a hotfix ASAP.

2021-05-25 14516, 2021

8:51 AM
monkey

Sure, let me look

2021-05-25 14533, 2021

8:51 AM
ruaok

thx

2021-05-25 14543, 2021

8:51 AM
monkey

Yeah, I see a `TypeError: Cannot read property 'from_ts' of undefined`

2021-05-25 14546, 2021

8:51 AM
monkey

I'll investigate

2021-05-25 14507, 2021

8:52 AM
lucifer

https://sentry.metabrainz.org/metabrainz/listenbr…

2021-05-25 14517, 2021

8:52 AM
lucifer

this is the relevant error in sentry

2021-05-25 14511, 2021

9:04 AM
monkey

Yeah, the code in static/js/src/stats/UserListeningActivity.tsx is quite fragile. We didn't see any of the errors until a month ago when I set it up to catch and report errors.

2021-05-25 14523, 2021

9:04 AM
monkey

I'll see if I can make the whole component mor robust

2021-05-25 14549, 2021

9:06 AM
monkey

I'm not very familiar with that component at all, but I suppose if we don't have all the data needed for the current week (which seems to be what currently breaks the page), I should just show an error message for that range?

2021-05-25 14542, 2021

9:20 AM
lucifer

monkey: i think we should just display 0, in that case for the days for which data is missing.

2021-05-25 14504, 2021

9:22 AM
monkey

Shouldn't the API return 0 for days without listens, in normal cases?

2021-05-25 14522, 2021

9:22 AM
monkey

Doesn't the complete absence of data indicate that there's an error?

2021-05-25 14557, 2021

9:22 AM
lucifer

i am not sure how it works like whether we coalesce the missing data on frontend or in the api.

2021-05-25 14558, 2021

9:22 AM
monkey

Here's what I have for this week, for example: https://usercontent.irccloud-cdn.com/file/4cxEBDE…

2021-05-25 14506, 2021

9:23 AM
monkey

Notice that there's only data for last week

2021-05-25 14518, 2021

9:24 AM
lucifer

missing data is indeed an error but we used to display outdated reports instead of erroring.

2021-05-25 14530, 2021

9:24 AM
monkey

The code sort of relies on there being *some* data so that it can display the start and end dates of the range to the user, but if there's not data at all there's no point anyway

2021-05-25 14535, 2021

9:24 AM
monkey

Ah?

2021-05-25 14514, 2021

9:25 AM
lucifer

i think there is some bug in week calculation somewhere.

2021-05-25 14518, 2021

9:25 AM
monkey

Outdated reports… Well, the code hasn't changed recently, we're now just catching JS errors. I'm not sure how the mechanism of showing older reports worked

2021-05-25 14530, 2021

9:25 AM
lucifer

yeah, me neither.

2021-05-25 14550, 2021

9:25 AM
lucifer

imo, ideally the api should send the data of the last two weeks.

2021-05-25 14511, 2021

9:26 AM
monkey

That would work, and display things correctly.

2021-05-25 14542, 2021

9:26 AM
lucifer

whichever has last two weeks it has, doesn't matter current or not.

2021-05-25 14532, 2021

9:27 AM
lucifer

i think we might be able to fix it for now by generating new stats.

2021-05-25 14534, 2021

9:28 AM
lucifer

i had imported yesterday's data into spark today morning, if i request new stats now there will be data for the current week.

2021-05-25 14551, 2021

9:28 AM
lucifer

that'll give us some time to think about how to solve this correctly.

2021-05-25 14510, 2021

9:29 AM
monkey

Wait before you do that please

2021-05-25 14523, 2021

9:29 AM
monkey

It's helpful for me for testing to have it in broken state

2021-05-25 14546, 2021

9:29 AM
lucifer

sure

2021-05-25 14535, 2021

9:33 AM
monkey

So in summary there are two separate issues:

2021-05-25 14535, 2021

9:33 AM
monkey

1. The API isn't returning any data for the current week

2021-05-25 14535, 2021

9:33 AM
monkey

2. As a result, some parts of the JS page breaks (we weren't previously aware of this as we didn't have sentry reporting it)

2021-05-25 14502, 2021

9:34 AM
lucifer

yes, right.

2021-05-25 14523, 2021

9:35 AM
monkey

As we discussed I'll make sure the front-end recovers from missing data it's expecting, that'll be a start.

2021-05-25 14523, 2021

9:35 AM
monkey

Which means maybe there's nothing special to do for error #1 other than actually calculating the data (meaning we don't need to ensure we return a provisional 0 listens for each day of the current week up until then)

2021-05-25 14557, 2021

9:35 AM
monkey

Or do we want to ensure the API calls always return some form of data for the current week?

2021-05-25 14525, 2021

9:36 AM
monkey

(Or returns an error if we somehow don't have the data?)

2021-05-25 14516, 2021

9:37 AM
lucifer

i see the API does return a 204 if there is not data.

2021-05-25 14554, 2021

9:38 AM
monkey

But it isn't currently returning a 204 as there is data from last week, even though there is no data for this week

2021-05-25 14504, 2021

9:39 AM
monkey

(I'm talking about the 'week' range here)

2021-05-25 14525, 2021

9:39 AM
lucifer

right, according to my understanding it sends data for both weeks in a single response.

2021-05-25 14556, 2021

9:39 AM
monkey

So it should probably return a 204 if there's only data for the last week but not for the current week

2021-05-25 14521, 2021

9:42 AM
lucifer

i think the data of a single week should still display a report. we use data of two weeks so that we can compare the last two weeks but if that data is not there then displaying just one week's chart is fine.

2021-05-25 14519, 2021

9:43 AM
monkey

I see. Let me try that then

2021-05-25 14559, 2021

9:43 AM
monkey

Yes, that makes a lot more sense, thanks for the help

2021-05-25 14557, 2021

9:54 AM
monkey

Yeah, it's not perfect but it sure is better than a broken page (note the presence of the orange square in the caption, but without dates next to it)

2021-05-25 14501, 2021

9:55 AM
monkey

https://usercontent.irccloud-cdn.com/file/AwGCQOy…

2021-05-25 14535, 2021

9:57 AM
BrainzGit

[metabrainz.org] 14mayhem merged pull request #360 (03master…irc): Update IRC links to point to Libera.Chat https://github.com/metabrainz/metabrainz.org/pull…

2021-05-25 14548, 2021

9:57 AM
lucifer

indeed. I think this is fine for use case.

2021-05-25 14507, 2021

9:58 AM
BrainzGit

[metabrainz.org] 14mayhem merged pull request #361 (03master…nicks): Update nicks of team members as on Libera.Chat https://github.com/metabrainz/metabrainz.org/pull…

2021-05-25 14540, 2021

10:02 AM
BrainzGit

[listenbrainz-server] 14MonkeyDo opened pull request #1482 (03master…monkey-sentry-119844): Make user reports JS component more robust https://github.com/metabrainz/listenbrainz-server…

2021-05-25 14534, 2021

10:06 AM
lucifer

monkey: is the the only component that was causing issues? do other charts work fine?

2021-05-25 14502, 2021

10:07 AM
monkey

Charts seem to work fine: https://listenbrainz.org/user/mr_monkey/charts?pa…

2021-05-25 14543, 2021

10:07 AM
lucifer

right, sorry i meant other reports on this page. heatmap and world map etc.

2021-05-25 14529, 2021

10:08 AM
monkey

Yes, they work fine. It's really just the way we tried to access the timestamps that was throwing an error

2021-05-25 14546, 2021

10:08 AM
lucifer

ah! makes sense.

2021-05-25 14558, 2021

10:08 AM
atj

I like the bold/coloured text support, makes those messages much more readable

2021-05-25 14503, 2021

10:09 AM
atj

is that an ircd thing?

2021-05-25 14546, 2021

10:09 AM
monkey

lucifer: Oh, and we can ignore the failing front-end test, it's unrelated. I'm working on a separate PR for that

2021-05-25 14518, 2021

10:10 AM
lucifer

sure, i'll merge and deploy then?

2021-05-25 14537, 2021

10:10 AM
monkey

(another one of those "your snapshot contains a date string and each time you run the tests your date string will change, failing the test" kind of deal)

2021-05-25 14554, 2021

10:10 AM
shivam-kapila

atj: I09 get the,09 option in android app but not on desktop client

2021-05-25 14518, 2021

10:11 AM
atj

shivam-kapila: use your powers for good :P

2021-05-25 14530, 2021

10:11 AM
BrainzGit

[listenbrainz-server] 14MonkeyDo merged pull request #1482 (03master…monkey-sentry-119844): Make user reports JS component more robust https://github.com/metabrainz/listenbrainz-server…

2021-05-25 14532, 2021

10:11 AM
monkey

boop

2021-05-25 14552, 2021

10:11 AM
atj

that's working for me in weechat, but it didn't work on freenode

2021-05-25 14507, 2021

10:13 AM
atj

https://i.ibb.co/0GkS0jj/Screenshot-2021-05-25-at…

2021-05-25 14509, 2021

10:13 AM
BrainzGit

[listenbrainz-server] release 03v-2021-05-25.0 has been published by 14github-actions[bot]: https://github.com/metabrainz/listenbrainz-server…

2021-05-25 14509, 2021

10:18 AM
Rotab

atj: the channel had +c on freenode

2021-05-25 14546, 2021

10:18 AM
atj

rotab: I've forgotten how all these channel modes work, I assume that prevents colours