in #metabrainz

0:03 AM
supersandro2000 has quit
0:03 AM
supersandro20000 joined the channel
0:22 AM
d4rkie has quit
0:23 AM
D4RK-PH0ENiX joined the channel
0:42 AM
ephemer0l_ joined the channel
0:51 AM
KindOne has quit
0:53 AM
KindOne joined the channel
0:57 AM
KindTwo joined the channel
0:58 AM
KindOne has quit
1:02 AM
KindTwo has quit
1:04 AM
KindOne joined the channel
2:31 AM
Gore has quit
2:32 AM
Gore joined the channel
3:24 AM
Chinmay3199 joined the channel
5:41 AM
bitmap

zas: not urgent but you can remove musicbrainz-redis-store-beta from the redis stats whenever you get a chance, it's been merged into musicbrainz-redis-store
5:50 AM
btw, if anyone was logged out of beta let me know, out of curiosity (my session was preserved but there might've been race conditions during the move)
5:55 AM
yvanzo

bitmap: can I help you with beta or redis?
5:57 AM
bitmap

yvanzo: I finished merging the beta redis store into the prod one, so we can deploy #1529 to beta now if you're up for that
5:58 AM
off to sleep for now
6:14 AM
yvanzo

'night!
6:25 AM
BrainzGit

[musicbrainz-server] yvanzo merged pull request #1529 (master…mbs-10845): MBS-10845: User lookup tools for account admins https://github.com/metabrainz/musicbrainz-serve...
6:25 AM
BrainzBot

MBS-10845: Tool to allow account admins to look up accounts by e-mail https://tickets.metabrainz.org/browse/MBS-10845
6:26 AM
Rotab has quit
6:33 AM
Rotab joined the channel
7:05 AM
v6lur_ joined the channel
7:18 AM
Sophist-UK has quit
7:47 AM
BrainzGit

[musicbrainz-server] yvanzo merged pull request #1507 (master…edit-label): MBS-10817: Convert Edit Label edit to React https://github.com/metabrainz/musicbrainz-serve...
7:47 AM
BrainzBot

MBS-10817: Convert Edit Label edit to React https://tickets.metabrainz.org/browse/MBS-10817
8:04 AM
Mr_Monkey

Mornin'!
8:05 AM
Wizzup has quit
8:07 AM
yokel has quit
8:08 AM
yokel joined the channel
8:17 AM
Wizzup joined the channel
8:37 AM
diru1100

Mooin'!!
8:42 AM
supersandro20000 is now known as supersandro2000
8:57 AM
_lucifer

SomalRudra: you added to the wrong .gitignore file. add the contents to the gitignore file in the root directory.
8:58 AM
after that follow the steps here to enforce the changes https://stackoverflow.com/questions/7075923/res...
9:05 AM
Gazooo has quit
9:06 AM
Gazooo joined the channel
9:07 AM
jmp_music joined the channel
9:07 AM
jmp_music

moooin
9:21 AM
ruaok

moin!
9:22 AM
iliekcomputers: ping me when you have a minute.
9:25 AM
shivam-kapila

Morning
9:33 AM
iliekcomputers

ruaok: hey
9:35 AM
yvanzo

updating beta.mb.o
9:39 AM
ruaok

iliekcomputers: so, I've gotten very little to no feedback on the dups. everyone seems to be fixated on listen counts, rather than doing some digging to find dups.
9:39 AM
I think the listen counts in production are wrong too and I dont want to spend time fixing them.
9:40 AM
iliekcomputers

The listen counts in production as in the user listen counts or the global listen count?
9:40 AM
ruaok

I think the think to do is actually start a migration that could make its way to production.
9:40 AM
both probably
9:40 AM
iliekcomputers

The global listen count is definitely incorrect
9:41 AM
I remember opening an issue about it.
9:41 AM
ruaok

so, by getting a production ready setup, we can create a more realistic comparison, that people can then ignore.
9:42 AM
but I wonder if "this is being ignored" is more like "I can't find problem, but I can't say for sure, so I won't say anything."
9:42 AM
iliekcomputers

I think it's the second.
9:42 AM
ruaok

likely.
9:43 AM
so, starting a production ready conversion.... I suppose first step might be for you to review the import script.
9:43 AM
then to do an actual migration -- which will be a bit tricky.
9:43 AM
iliekcomputers

Yeah, that sounds good. Happy to review.
9:43 AM
Would a doc detailing the steps be helpful?
9:44 AM
ruaok

I would need to start a new exchange and connect it to the incoming stream.
9:44 AM
yes!
9:44 AM
I'll setup for the review and the doc after this chat.
9:45 AM
once the next change starts receiving listens (and the queue will grow significantly) we will need to trigger a new dump.
9:45 AM
and then once the dump is done, then it will take some 12-24 hours to prepare and import the dump.
9:45 AM
then we can connect it live and in theory we should have a clean and consistent database.
9:46 AM
I'm somewhat concerned about the number of listens growing, but I think we should be ok.
9:46 AM
so, right now my migration code is in a separate repo.
9:46 AM
iliekcomputers

Could we just keep writing the listens in the queue and import simultaneously?
9:47 AM
ruaok

that is exactly what I want to do.
9:47 AM
iliekcomputers

Writing the listens to timescale
9:47 AM
I meant to say
9:47 AM
ruaok

ah, no, not ideal.
9:48 AM
the insert will be most performant when you have insert in sequential time order from oldest to newest.
9:48 AM
iliekcomputers

Okay, that makes sense.
9:48 AM
ruaok

having listens stream in will break that.
9:48 AM
what I suppose can try is to stream the listens to timescale *until* the import. then stop it, let the queue grow, do the import, then catch up.
9:49 AM
I *think* that wont impact the import so badly.
9:49 AM
iliekcomputers

If the rabbitmq queue falls over, we'll just start the process over again with something else I guess. Sounds reasonable to me.
9:50 AM
ruaok

given that the import code is going to run once, by me, I'm not too keen on importing the code into lb-server and tidying it up to our strict standards.
9:50 AM
can you review two scripts on their own and mainly look for logical errors?
9:50 AM
iliekcomputers

OK, sure.
9:51 AM
ruaok

https://github.com/mayhem/timescale-testing
9:51 AM
https://github.com/mayhem/timescale-testing/blo...
9:51 AM
that is the first script. it reads in the listens dump and creates a single file that gnu sort can sort on the command line.
9:51 AM
https://github.com/mayhem/timescale-testing/blo...
9:52 AM
is the second script that does the actual importing.
9:52 AM
iliekcomputers

Okay, will read through when I get the time.
9:52 AM
ruaok

ok, great.
9:52 AM
I'll work up a migration doc next.
9:52 AM
iliekcomputers

Great, thanks!
9:52 AM
ruaok

the importer is a bit tricky, since it uses threads to run 5 insertions at the same time.
9:53 AM
tuned to make the import move at a reasonable speed.
9:53 AM
the two most critical functions are check_for_duplicates and import_dump_file
9:57 AM
iliekcomputers

Thanks, I'll take a look.
9:57 AM
zas

yvanzo: we can stop search-server (old one) containers right?
10:00 AM
yvanzo

zas: it is still available for mirrors using LUCENE search, not sure it is still used.
10:04 AM
zas

we need to sort this out, because those are using quite a lot resources we may use for something else. What about stopping them for a while and see if anyone complains (if they do, we'll see what to do, but unlikely starting those again, rather pointing them at solr), I remember we had some hacks in preparation to ease the move, but they were never finished, and lead to a lot of complexity.
10:06 AM
yvanzo

is there any stats about search.musicbrainz.org ?
10:06 AM
zas

the story of the death of old search servers is too long, I want to put an end to it, we have better things to do with those resources
10:07 AM
ruaok

make the death of those services worth it, ok?
10:07 AM
pick some good bullets.
10:07 AM
yvanzo

zas: we defintely have to point search.mb.o at solr, that is MBH-502.
10:07 AM
BrainzBot

MBH-502: Point search.musicbrainz.org to the new SOLR cloud https://tickets.metabrainz.org/browse/MBH-502
10:08 AM
zas

We have some web access stats
10:09 AM
43k hits yesterday
10:09 AM
30k were 200s
10:09 AM
ruaok

no way. that's incredible? sad?
10:10 AM
yvanzo

The MBS PR for that ticket seems to be mostly ready, it just gathered a bit of dust.
10:11 AM
zas

can we take some time on this this week, and sort it out?
10:11 AM
yvanzo

yup!
10:11 AM
BrainzGit

[bookbrainz-site] prabalsingh24 opened pull request #428 (UserCollection…master): Merge master into UserCollection https://github.com/bookbrainz/bookbrainz-site/p...
10:11 AM
zas

ok, tell me when
10:12 AM
yvanzo

we need a roadmap at least
10:14 AM
pointing at solr could be made on Monday along with MBS release.
10:16 AM
zas

bitmap disables search indexer already: https://github.com/metabrainz/docker-server-con...
10:16 AM
so I guess current instances aren't updated since a while
10:22 AM
yvanzo

bitmap: I changed staticbrainz project on jenkins to remove previous temporary container, but the actual issue is that this container is not run and thus doesn't have a build/ directory anyway. Doesn’t seem to be in use currently anyway.
10:34 AM
ishaanshah

iliekcomputers: Hi, please ping me when you get time
10:36 AM
jmp_music has quit
10:46 AM
iliekcomputers

ishaanshah: hey
10:47 AM
ruaok: thanks, I'll take a look today, after work
10:47 AM
ishaanshah

hi
10:47 AM
ruaok

no rush.
10:47 AM
ishaanshah

I was working on fixing the db tests
10:48 AM
However I am running into some weird issue
10:48 AM
https://github.com/metabrainz/listenbrainz-serv...
10:48 AM
This line should insert the release statistics in the db
10:49 AM
However it does not
10:49 AM
https://travis-ci.org/github/metabrainz/listenb...
10:49 AM
I tried out inserting manually and the function seems to be working fine
10:49 AM
artist data is getting inserted, but not release
10:51 AM
iliekcomputers

That sounds weird.
10:52 AM
ishaanshah

maybe I am making some silly error
10:53 AM
iliekcomputers

what does this return: https://github.com/metabrainz/listenbrainz-serv...
10:53 AM
jmp_music joined the channel
10:53 AM
ishaanshah

https://www.irccloud.com/pastebin/zUOd2KfB/
10:55 AM
iliekcomputers

it's probably something in the query.
10:55 AM
i'll have to look into it, can't think of anything off the top of my head.
10:56 AM
ishaanshah

I tried the query separately in docker container
10:57 AM
It works correctly
10:58 AM
I will take another look at it
10:59 AM
Is there any way to make sqlalchemy log if there is any error in the query
11:00 AM
iliekcomputers

sqlalchemy would raise an exception if the query was erroneous (ideally)
11:00 AM
i'm suspicious of the query.format
11:00 AM
maybe you could write a specific function for inserting release and see if that works
11:00 AM
if it does, see what the difference between the two is
11:01 AM
ishaanshah

Hmm, I will do that
11:05 AM
iliekcomputers

and check that the release data is being passed around into the query correctly