i see there's a difference in buffer contents as well, prod had to read entire recording table from disk where it was in cache on wolf.
2022-10-04 27708, 2022
alastairp
I think they might both be hard drives
2022-10-04 27718, 2022
alastairp
zas: md3 on aretha is rebuilding, not sure if that's expected or known
2022-10-04 27746, 2022
lucifer
you mean both aretha disks are HDD or both wolf and aretha are HDDs?
2022-10-04 27755, 2022
alastairp
lucifer: both servers have hdds
2022-10-04 27716, 2022
lucifer
ah i see.
2022-10-04 27727, 2022
lucifer
i'll dump the settings from both instances and do a compare to see if i can figure out some difference.
2022-10-04 27736, 2022
mayhem
aaargh.
2022-10-04 27713, 2022
mayhem
the signup process for the airbnb has become decisively spanish. type on lots of data, in triplicate and sign it.
2022-10-04 27717, 2022
zas
alastairp: that's not a rebuild, but a check. Those are done on regular basis. That's still a resync.
2022-10-04 27733, 2022
atj_mb
Will there be an iris scan on arrival?
2022-10-04 27737, 2022
lucifer
alastairp: its the cache difference. mapping.canonical_release_redirect needs to read entirely from disk and takes about 15mins to read. rerunning the query after cache is built take 43s.
2022-10-04 27741, 2022
alastairp
zas: right thanks, I didn't look into it in too much detail, only the .....>
2022-10-04 27755, 2022
alastairp
lucifer: right, makes sense
2022-10-04 27708, 2022
alastairp
atj_mb: only for people who left the EU
2022-10-04 27732, 2022
mayhem
reosarevok: is you want to eat at this restaurant at 10pm, you'll wait until 1am before you get in.
2022-10-04 27749, 2022
atj_mb
alastairp: as ever, thanks for the reminder
2022-10-04 27737, 2022
lucifer
alastairp: i guess the 3.5hours runtime makes sense then. not sure anything can be done about this.
[critiquebrainz] 14dependabot[bot] opened pull request #470 (03master…dependabot/npm_and_yarn/glob-parent-5.1.2): build(deps): bump glob-parent from 3.1.0 to 5.1.2 https://github.com/metabrainz/critiquebrainz/pull…
2022-10-04 27706, 2022
BrainzGit
[critiquebrainz] 14dependabot[bot] opened pull request #471 (03master…dependabot/npm_and_yarn/json-schema-and-jsprim-0.4.0): build(deps): bump json-schema and jsprim https://github.com/metabrainz/critiquebrainz/pull…
2022-10-04 27711, 2022
lucifer
also +1 on adding run_sql_script and the non transaction version to BU.
2022-10-04 27731, 2022
BrainzGit
[critiquebrainz] 14dependabot[bot] opened pull request #472 (03master…dependabot/npm_and_yarn/ajv-6.12.6): build(deps): bump ajv from 6.10.0 to 6.12.6 https://github.com/metabrainz/critiquebrainz/pull…
2022-10-04 27729, 2022
alastairp
lucifer: hi
2022-10-04 27740, 2022
lucifer
hey!
2022-10-04 27746, 2022
alastairp
consul config for LB has ` "websockets_server_url": "https://sockets.listenbrainz.org",`
2022-10-04 27756, 2022
alastairp
but this seems obviously wrong
2022-10-04 27706, 2022
lucifer
yes that config url is unused.
2022-10-04 27750, 2022
alastairp
I'm just talking to monkey now about it
2022-10-04 27759, 2022
alastairp
we see that in js we use window.location.path and append socket.io to the end
alastairp: apparently the cleanup process is taking ~24 seconds for 105k rows 🗿
2022-10-04 27718, 2022
Pratha-Fish
And was without factoring in the r/w time
2022-10-04 27751, 2022
Pratha-Fish
I suspect that's because the functions I am using (pandas.series.map / pandas.series.apply) are just glorified for loops. I am currently trying out ways to vectorize the operations somehow
2022-10-04 27746, 2022
lucifer
alastairp: i don't think we can do ws in same containers in local development. flask dev server serves only 1 request at a time iirc so incompatible with having a ws server running. unless we want to replace to flask dev server with something else.
2022-10-04 27709, 2022
lucifer
(regarding the last point in that ticket)
2022-10-04 27709, 2022
alastairp
lucifer: new flask actually uses threads I think
2022-10-04 27719, 2022
lucifer
oh, nice.
2022-10-04 27752, 2022
alastairp
you're right that we'd need to test is
2022-10-04 27716, 2022
alastairp
doing this would mean that our ws server was on the same host as the flask server, allowing us to continue using window.location
2022-10-04 27741, 2022
lucifer
yes makes sense. also 1 less container is better
2022-10-04 27746, 2022
alastairp
yep, right
2022-10-04 27708, 2022
alastairp
but there is the magic in the ws container with the extra thread for rmq connections
2022-10-04 27720, 2022
alastairp
which makes me a bit worried about how we do that
2022-10-04 27757, 2022
lucifer
yeah, i wonder if we should add celery and move the magic to it.
2022-10-04 27722, 2022
lucifer
there are probably better resources available on how celery interacts with flask/threads etc.
2022-10-04 27736, 2022
alastairp
I think I had another thought about how we could do it
2022-10-04 27708, 2022
alastairp
OK, for now let's delete all of the websocket data in the config file (I'll do that now, to assist zas with some cleanup he's doing). This isn't going to make anything _more_ broken than it is now
2022-10-04 27719, 2022
alastairp
and if we need to add something back in when we try and fix local dev, we can do that
I would put money on more than 50% being duplicates and other other unlistenable shit.
2022-10-04 27733, 2022
lucifer
MB has 27 M recordings and 40 M tracks. it would be odd if 60M tracks were missing from MB but in apple music. so a lot more than 50% in dupes i think.
2022-10-04 27751, 2022
mayhem
agreed.
2022-10-04 27706, 2022
mayhem
40M tracks. wow.
2022-10-04 27719, 2022
mayhem
seems it was just yesterday we celebrated 1M tracks.