#metabrainz

/

0:25 AM
supersandro2000 has quit

2020-09-10 25404, 2020

0:26 AM
supersandro2000 joined the channel

2020-09-10 25400, 2020

0:28 AM
iconoclasthero joined the channel

2020-09-10 25413, 2020

1:57 AM
dseomn1 has quit

2020-09-10 25446, 2020

2:13 AM
dseomn joined the channel

2020-09-10 25440, 2020

2:34 AM
ishaanshah

Morning!

2020-09-10 25424, 2020

2:35 AM
ishaanshah

iliekcomputers: About the cronjob for incremental dump, if we are creating an incremental dump everyday then we should import it on the same day too

2020-09-10 25455, 2020

2:36 AM
ishaanshah

Otherwise we will never import that dump at all, because before the next day's import a newer incremental dump will be created which will then get imported.

2020-09-10 25438, 2020

3:10 AM
sumedh joined the channel

2020-09-10 25423, 2020

3:46 AM
thomasross has quit

2020-09-10 25457, 2020

4:31 AM
sumedh has quit

2020-09-10 25407, 2020

4:43 AM
BrainzGit

[listenbrainz-server] ishaanshah opened pull request #1083 (master…ishaan/listening-activity-range-update): LB-690: Minor improvements to Listening Activity graph https://github.com/metabrainz/listenbrainz-server…

2020-09-10 25408, 2020

4:43 AM
BrainzBot

LB-690: Dates on the listening-activity graph don't match with the labels https://tickets.metabrainz.org/browse/LB-690

2020-09-10 25445, 2020

5:12 AM
nelgin

yvanzo

2020-09-10 25446, 2020

5:12 AM
nelgin

Before replication

2020-09-10 25446, 2020

5:12 AM
nelgin

| search.index | 194852 |

2020-09-10 25446, 2020

5:12 AM
nelgin

After replication

2020-09-10 25446, 2020

5:12 AM
nelgin

| search.index | 451263 |

2020-09-10 25446, 2020

5:12 AM
nelgin

There is no way that I can ever catch up.

2020-09-10 25419, 2020

5:17 AM
nelgin

https://pastebin.com/etnUufn3

2020-09-10 25431, 2020

6:14 AM
v6lur joined the channel

2020-09-10 25400, 2020

6:30 AM
MusicbrainzB0T joined the channel

2020-09-10 25454, 2020

6:31 AM
MusicbrainzB0T2 has quit

2020-09-10 25431, 2020

6:37 AM
diru1100

Morning!!

2020-09-10 25404, 2020

6:38 AM
diru1100

My lecturer dropped a bomb saying college starts on 21st 😬😬😬

2020-09-10 25450, 2020

6:58 AM
CatQuest

sep?

2020-09-10 25413, 2020

6:59 AM
CatQuest

well school has already been in session here fro 2 weeks :D

2020-09-10 25426, 2020

6:59 AM
CatQuest

man am i glad i no longer have it

2020-09-10 25455, 2020

6:59 AM
SomalRudra

my college started a week back

2020-09-10 25439, 2020

7:04 AM
zas

It seems changes that significantly reduced traffic between gateways and mb backend servers also had an impact on web service (that wasn't obvious at start)

2020-09-10 25445, 2020

7:04 AM
zas

https://usercontent.irccloud-cdn.com/file/ayV0CzT…

2020-09-10 25410, 2020

7:05 AM
zas

not sure it can only be explained by https://github.com/metabrainz/musicbrainz-server/…

2020-09-10 25442, 2020

7:05 AM
zas

bitmap: ^^

2020-09-10 25435, 2020

7:07 AM
yvanzo

hi nelgin, thanks, if you want to turn it down: admin/setup-amqp-triggers uninstall && sudo docker-compose stop indexer

2020-09-10 25424, 2020

7:08 AM
zas

hmmm wait, perhaps graphs don't tell the truth here, checking again

2020-09-10 25431, 2020

7:09 AM
zas

ah ignore, for some reason this graph was set to stack, so visually it shows a drop on ws, but it doesn't exist

2020-09-10 25435, 2020

7:10 AM
diru1100

Oh really, I have to physically attend college it seems. Online classes started 5 weeks back

2020-09-10 25422, 2020

7:12 AM
zas

fixed graph https://usercontent.irccloud-cdn.com/file/jSVJR5Y…

2020-09-10 25400, 2020

7:13 AM
BrainzGit

[musicbrainz-server] reosarevok opened pull request #1689 (master…MBS-11086): MBS-11086: Add icon for tabs with errors in release editor https://github.com/metabrainz/musicbrainz-server/…

2020-09-10 25402, 2020

7:13 AM
BrainzBot

MBS-11086: Request: option for high-contrast-compatible indication of validation problems https://tickets.metabrainz.org/browse/MBS-11086

2020-09-10 25400, 2020

7:34 AM
yvanzo

bitmap, reosarevok: updated blog post with yesterday’s hotfixes, pushed a git tag and fixed previous git tag message. Make sure to delete your local tag: git tag -d v-2020-09-07 # then fetch to get new tag

2020-09-10 25416, 2020

7:36 AM
nelgin

yvanzo, well...There has to be a better way to optimize the indexing? Maybe it's time to think of a different type of database? Ever through about NoSQL?

2020-09-10 25415, 2020

7:37 AM
yvanzo

nelgin: live indexing works perfectly in production, the issue is more that about setup for mirrors.

2020-09-10 25442, 2020

7:37 AM
jesus2099 joined the channel

2020-09-10 25402, 2020

7:38 AM
jesus2099

Rotab yvanzo bitmap CatQuest: Indeed I don't use Opera 12 any more. ;)

2020-09-10 25421, 2020

7:39 AM
jesus2099

reosarevok > "Sigh. I wish github was a bit better at showing where the only change is space

2020-09-10 25424, 2020

7:39 AM
yvanzo

nelgin: for example, there is only one Solr instance in musicbrainz-docker whereas we have many nodes in production.

2020-09-10 25401, 2020

7:40 AM
pristine___

How can I create current_user object in tests?

2020-09-10 25404, 2020

7:40 AM
jesus2099

You can also diff as unified: https://github.com/metabrainz/musicbrainz-server/…

2020-09-10 25411, 2020

7:40 AM
pristine___

It fails at current_user.id :(

2020-09-10 25422, 2020

7:40 AM
reosarevok

jesus2099: huh!

2020-09-10 25424, 2020

7:40 AM
reosarevok

That's neat

2020-09-10 25453, 2020

7:40 AM
jesus2099

Yes, once you know it exists, you just have to find it back on the page. :)

2020-09-10 25404, 2020

7:41 AM
jesus2099

(the diff settings button)

2020-09-10 25410, 2020

7:43 AM
jesus2099

bitmap: I recently added a CAA ticket because it seems the CORS headers are missing when you use the release-group API. The release API CORS headers are OK but if you use the release-group API, that forwards to the release API, then you get the error: has been blocked by CORS policy: No 'Access-Control-Allow-Origin' header is present on the

2020-09-10 25411, 2020

7:43 AM
jesus2099

requested resource.

2020-09-10 25417, 2020

7:43 AM
jesus2099

https://tickets.metabrainz.org/browse/CAA-122

2020-09-10 25418, 2020

7:43 AM
BrainzBot

CAA-122: Enable CORS on API (web services)

2020-09-10 25454, 2020

7:43 AM
jesus2099

Las time (CAA-4), the "bug" was forwarded to the (CAA teams?) who fixed it by updating CORS headers.

2020-09-10 25454, 2020

7:43 AM
BrainzBot

CAA-4: Enable CORS (Cross-Origin Resource Sharing) https://tickets.metabrainz.org/browse/CAA-4

2020-09-10 25418, 2020

7:44 AM
jesus2099

If I understood correctly. I'm not saying I'm 100% sure something should be fixed.

2020-09-10 25411, 2020

7:51 AM
shivam-kapila

pristine___: self.temporary_login

2020-09-10 25456, 2020

7:51 AM
pristine___

shivam-kapila: tried. Still not working

2020-09-10 25434, 2020

8:17 AM
iliekcomputers

ishaanshah: it'll import the full dump with the same id as the incremental dump, so we should be good.

2020-09-10 25459, 2020

8:17 AM
iliekcomputers

The full dump command creates a full dump with the ID of the last incremental dump

2020-09-10 25448, 2020

8:18 AM
iliekcomputers

What I am worried about is this: suppose incremental dump creation fails, we request import of the newest incremental dump, we might be importing the same incremental dump twice

2020-09-10 25433, 2020

8:20 AM
ishaanshah

> it'll import the full dump with the same id as the incremental dump, so we should be good.

2020-09-10 25433, 2020

8:20 AM
ishaanshah

I didn't get you

2020-09-10 25427, 2020

8:21 AM
ishaanshah

rn, suppose we trigger a full dump on 1st

2020-09-10 25452, 2020

8:21 AM
iliekcomputers

1st of month - incremental dump with Id x is created.

2020-09-10 25407, 2020

8:22 AM
iliekcomputers

Later on the 1st - full dump with Id x is created

2020-09-10 25427, 2020

8:22 AM
iliekcomputers

2 - full dump with Id x is imported

2020-09-10 25417, 2020

8:23 AM
ishaanshah

2 - another incremental dump is created right

2020-09-10 25423, 2020

8:23 AM
iliekcomputers

We don't need to import the incremental dump with Id x because the full dump will contain the same data

2020-09-10 25431, 2020

8:23 AM
ishaanshah

but we arent importing this one

2020-09-10 25438, 2020

8:23 AM
iliekcomputers

Oh

2020-09-10 25442, 2020

8:23 AM
iliekcomputers

Oof

2020-09-10 25448, 2020

8:23 AM
iliekcomputers

My bad

2020-09-10 25458, 2020

8:23 AM
iliekcomputers

Yeah, we should be importing that

2020-09-10 25412, 2020

8:24 AM
ishaanshah

either we should skip generating that dump or import everyday

2020-09-10 25400, 2020

8:25 AM
iliekcomputers

Yeah, I guess we need to import everyday.

2020-09-10 25407, 2020

8:25 AM
jesus2099 has left the channel

2020-09-10 25412, 2020

8:25 AM
iliekcomputers

We need some Id validation on the spark side as well

2020-09-10 25432, 2020

8:25 AM
iliekcomputers

Right now it's all dependent on the cron job and brittle

2020-09-10 25459, 2020

8:25 AM
ishaanshah

hmm, so the last imported id for incremental

2020-09-10 25416, 2020

8:26 AM
ishaanshah

otherwise we might end up importing it twice

2020-09-10 25419, 2020

8:26 AM
iliekcomputers

Spark should store the current Id somewhere, check if the dump it's importing is greater than the ID and then import

2020-09-10 25421, 2020

8:26 AM
iliekcomputers

Yeah

2020-09-10 25457, 2020

8:26 AM
ishaanshah

id can be stored in hdfs only ig, cause we dont have redis in spark cluster

2020-09-10 25418, 2020

8:27 AM
iliekcomputers

Hdfs makes sense to me.

2020-09-10 25407, 2020

8:28 AM
iliekcomputers

Maybe make it a dataframe with history. (id, imported_timestamp, dump_type, dump_timestamp)

2020-09-10 25459, 2020

8:28 AM
ishaanshah

cool, cool, I'll make a PR for it over this weekend...

2020-09-10 25409, 2020

8:29 AM
iliekcomputers

Sounds good, thanks!

2020-09-10 25419, 2020

8:29 AM
ishaanshah

btw, how much time did the import take?

2020-09-10 25423, 2020

8:29 AM
iliekcomputers

Let's see how it runs in the meanwhile, it should still be stable this week

2020-09-10 25428, 2020

8:29 AM
ishaanshah

like the copy part

2020-09-10 25440, 2020

8:29 AM
iliekcomputers

ishaanshah: the copy command took ~20 min I think

2020-09-10 25404, 2020

8:30 AM
iliekcomputers

It's not very scalable, but that's because of the way we store data in hdfs

2020-09-10 25419, 2020

8:30 AM
ishaanshah

yep, I saw the ticket

2020-09-10 25426, 2020

8:30 AM
iliekcomputers

I figure we'll have to partition the parquet files based on the listen submission timestamps

2020-09-10 25439, 2020

8:30 AM
iliekcomputers

That way we can just add a new file to hdfs and we're done

2020-09-10 25447, 2020

8:30 AM
BrainzGit

[musicbrainz-server] yvanzo opened pull request #1690 (master…fix-lmsg): Amend d5e3f84e62 to fix localizable string https://github.com/metabrainz/musicbrainz-server/…

2020-09-10 25452, 2020

8:30 AM
iliekcomputers

But that's a pretty huge project

2020-09-10 25453, 2020

8:30 AM
ishaanshah

I tried to partition the data by inserted at when I was working on incremental dumps

2020-09-10 25413, 2020

8:31 AM
ishaanshah

but it ended up taking far more time, so decided against it

2020-09-10 25420, 2020

8:31 AM
iliekcomputers

Yeah

2020-09-10 25448, 2020

8:31 AM
iliekcomputers

I think we're OK in the short term, but it'll have to get done eventually

2020-09-10 25459, 2020

8:31 AM
ishaanshah

yep, agreed

2020-09-10 25411, 2020

8:32 AM
nelgin

yvanzo can I start up more instances to try and clear the backlog?

2020-09-10 25419, 2020

8:37 AM
yvanzo

nelgin: no, docker compose supports setting a number of 'replicas' per service, but the current image for the 'search' service doesn't support it yet.

2020-09-10 25426, 2020

8:37 AM
ruaok

heh https://www.ruok.org.au/

2020-09-10 25401, 2020

8:38 AM
yvanzo

I agree on “A conversation could change a life.” ;)

2020-09-10 25415, 2020

8:38 AM
ruaok

true dat

2020-09-10 25415, 2020

8:40 AM
pristine___

Cute

2020-09-10 25453, 2020

8:41 AM
yvanzo

nelgin: I’m not convinced this is the main bottleneck either, this is something we will likely be looking at this afternoon.

2020-09-10 25401, 2020

8:45 AM
nelgin

OK, let me know if you want me to pull any logs, I'm about to go to bed.

2020-09-10 25456, 2020

8:45 AM
yvanzo

Thanks

2020-09-10 25442, 2020

8:46 AM
nelgin

Then I'll be back in 6 hours.

2020-09-10 25451, 2020

8:46 AM
nelgin

Just finishing up a glass of water

2020-09-10 25405, 2020

8:47 AM
reosarevok

yvanzo: does https://github.com/metabrainz/musicbrainz-server/… need spaces at the end / beginning of the segments?

2020-09-10 25421, 2020

8:47 AM
reosarevok

(sorry, I forgot we can't have nice things in perl :p )

2020-09-10 25412, 2020

8:48 AM
yvanzo

reosarevok: good catch

2020-09-10 25425, 2020

8:49 AM
yvanzo

reosarevok: fixed it

2020-09-10 25459, 2020

8:52 AM
shivam-kapila

pristine___: can you show the test

2020-09-10 25401, 2020

9:05 AM
Gazooo794 has quit

2020-09-10 25441, 2020

9:06 AM
Gazooo794 joined the channel

2020-09-10 25437, 2020

9:07 AM
BrainzGit

[musicbrainz-server] yvanzo merged pull request #1690 (master…fix-lmsg): Amend d5e3f84e62 to fix localizable string https://github.com/metabrainz/musicbrainz-server/…

2020-09-10 25408, 2020

9:26 AM
supersandro2000 has quit

2020-09-10 25428, 2020

9:26 AM
supersandro2000 joined the channel

2020-09-10 25402, 2020

9:50 AM
BrainzGit

[bookbrainz-site] MonkeyDo merged pull request #511 (master…collection-change-default-privacy): Collection: Change default privacy to public https://github.com/bookbrainz/bookbrainz-site/pul…

2020-09-10 25459, 2020

9:50 AM
BrainzGit

[bookbrainz-site] MonkeyDo merged pull request #510 (master…fix-edition-group-bug): fix: convert collectionType to appropiate format before param-validation https://github.com/bookbrainz/bookbrainz-site/pul…

2020-09-10 25432, 2020

9:55 AM
travis-ci joined the channel

2020-09-10 25432, 2020

9:55 AM
travis-ci

Project bookbrainz-site build #3430: passed in 4 min 53 sec: https://travis-ci.org/bookbrainz/bookbrainz-site/…

2020-09-10 25432, 2020

9:55 AM
travis-ci has left the channel

2020-09-10 25414, 2020

9:56 AM
travis-ci joined the channel

2020-09-10 25414, 2020

9:56 AM
travis-ci

Project bookbrainz-site build #3431: passed in 4 min 37 sec: https://travis-ci.org/bookbrainz/bookbrainz-site/…

2020-09-10 25414, 2020

9:56 AM
travis-ci has left the channel

2020-09-10 25455, 2020

10:14 AM
pristine___

ruaok: hey. I have raised InternalServerError if labs.api.listenbrainz.org does not return 200 as status code or if there is any other problem with it. What do you think?

2020-09-10 25449, 2020

10:15 AM
ruaok

hmmm.

2020-09-10 25431, 2020

10:16 AM
ruaok

if labs returns a 400 error, then the problem is on the caller (your) side of things. Should that be an ISE?

2020-09-10 25404, 2020

10:19 AM
reosarevok

"External Server Error" :D

2020-09-10 25432, 2020

10:19 AM
BrainzGit

[listenbrainz-server] mayhem merged pull request #1080 (master…distinct-similar-top-artist): [LB-703] Similar and top artist should be distinct for a user https://github.com/metabrainz/listenbrainz-server…

2020-09-10 25432, 2020

10:19 AM
BrainzBot

LB-703: Top artists and similar artist should be distinct https://tickets.metabrainz.org/browse/LB-703

2020-09-10 25401, 2020

10:20 AM
ruaok

pristine___: should I reload request_consumer since I merged the above?

2020-09-10 25410, 2020

10:20 AM
pristine___

Yup.

2020-09-10 25441, 2020

10:20 AM
pristine___

that's shouldn't be an ISE

2020-09-10 25453, 2020

10:20 AM
pristine___

ruaok: whenever any such error is raised do we get an email?

2020-09-10 25413, 2020

10:21 AM
ruaok

which component does your question pertain to?

2020-09-10 25414, 2020

10:22 AM
pristine___

ISE, not found any such error. Is it just displayed to the user or do we also get an email?

2020-09-10 25420, 2020

10:22 AM
pristine___

I am asking this because