aerozol: atj_mb akshaaatt : these are the human tower builders I was talking about
2022-10-05 27845, 2022
reosarevok
mayhem: can you change the name for a donation? (see support) Alternatively, if I have access to do that, can you show me how so I don't have to pester you? :)
2022-10-05 27854, 2022
bitmap
hola I’m aboard the aerobus, I’ll head to the office
2022-10-05 27804, 2022
mayhem
moin bitmap
2022-10-05 27805, 2022
mayhem
not sure if anyone is at the office yet. you could head to the other airbnb and see if aerozol akshaaatt and atj_mb will let you take a shower there until people head to the office.
2022-10-05 27838, 2022
bitmap
Oh sounds good, then I’ll ping the three of them when I’m outside the Airbnb
2022-10-05 27827, 2022
mayhem
good plan. I'll be heading to the office within the hour.
2022-10-05 27826, 2022
reosarevok
mayhem: I understand the plan is to have zoom streaming for the 14:00 - 19:30 section each day?
2022-10-05 27836, 2022
reosarevok
(just figuring out when I should check in)
2022-10-05 27809, 2022
mayhem
I've not really be part of the streaming/zoom discussion. all I know is that I am lending my webcam to the effort.
2022-10-05 27830, 2022
reosarevok
Ok, sure, but I mean that that's the actual "meeting" part, and the rest is hacking and socialising, from the schedule, right? :)
2022-10-05 27811, 2022
reosarevok
Should be doable, if so :)
2022-10-05 27852, 2022
mayhem
as for tomorrow, we're going to have a sooper special project for the day. one we all need to get through, whether we like it or not. OAuth.
2022-10-05 27802, 2022
reosarevok
AOuch
2022-10-05 27807, 2022
mayhem
if would be good if you could be around for that and help the MB team
2022-10-05 27820, 2022
reosarevok
Let me know the estimated time and I'll do my best :)
2022-10-05 27845, 2022
mayhem
BCN summit-like daytime. you know what that means. :)
2022-10-05 27826, 2022
mayhem
reosarevok: that donation has been updated
2022-10-05 27850, 2022
reosarevok
Thanks!
2022-10-05 27819, 2022
reosarevok
Mailed them back then
2022-10-05 27803, 2022
aerozol
Hey bitmap, sounds good, see you soon!
2022-10-05 27855, 2022
aerozol
mayhem: those human tower photos are insane!!
2022-10-05 27809, 2022
mayhem
right?
2022-10-05 27837, 2022
mayhem
and on university campuses you can see them practicing over lunch.
2022-10-05 27856, 2022
aerozol
The photos of one collapsing are terrifying. Would definitely watch though!
2022-10-05 27807, 2022
mayhem
which is far differente from the computer sci geeks watching the aggies rope wooden bulls in the parking lot with a real rope.
2022-10-05 27819, 2022
mayhem
which is what kept happening at my uni, lol
2022-10-05 27804, 2022
aerozol
Hah, a bit less exciting. Real rope though, ooh
2022-10-05 27825, 2022
bitmap
aerozol: akshaaatt: atj_mb: I’m outside Carrer de Nàpols, 98, I think
2022-10-05 27805, 2022
bitmap
place with a bunch of scaffolding?
2022-10-05 27814, 2022
mayhem
yep
2022-10-05 27826, 2022
mayhem
Press the Atico 3 button to wake them up. :)
2022-10-05 27831, 2022
bitmap
Button pressed!
2022-10-05 27830, 2022
yvanzo
Is there a universal power adapter (19V, 2.1A) I could borrow from BCNers? Or I will buy one for my laptop when I arrive in 6h from now (I know the best shops already).
2022-10-05 27819, 2022
mayhem
yvanzo: I dont have one of those, everything I have is now USB powered. :(
2022-10-05 27804, 2022
q3lont joined the channel
2022-10-05 27849, 2022
agatzk has quit
2022-10-05 27855, 2022
agatzk joined the channel
2022-10-05 27846, 2022
alastairp
yvanzo: what kind of plug does your laptop take?
2022-10-05 27807, 2022
alastairp
I have a 20v 1.5A old thinkpad round connector
2022-10-05 27836, 2022
alastairp
and a 20V 1.3A lenovo new-style thinkpad square connector
2022-10-05 27800, 2022
mayhem has arrived at the office
2022-10-05 27825, 2022
alastairp
slow start here, but I'll turn up soonish
2022-10-05 27831, 2022
q3lont has quit
2022-10-05 27801, 2022
yvanzo
alastairp: round dc jack, tip pin size: 5.5mm * 2.5mm
2022-10-05 27853, 2022
yvanzo
(40w)
2022-10-05 27859, 2022
alastairp
sorry, looks like mine isn't that size. I'll bring it anyway in case you want to try
2022-10-05 27802, 2022
mayhem
lucifer: regarding with what we want to store, I wonder if we should store the key things we really care about as columns, but still have a JSONB column for all the other fields.
2022-10-05 27830, 2022
mayhem
and the question about markets is tricky: do we know that the cross linking they mentioned works as expected?
2022-10-05 27846, 2022
q3lont joined the channel
2022-10-05 27859, 2022
mayhem
I think storing external links is also useful. stuff that we could mine...
2022-10-05 27829, 2022
lucifer
mayhem: i see, how about keep everything as jsonb in the existing cache table but build normalized tables of only the stuff we need. so those normalized tables won't have any jsonb columns but the original table we have currently will.
2022-10-05 27800, 2022
lucifer
uh let me write down a schema to clear it up.
2022-10-05 27816, 2022
alastairp
I'm on my way. will stop and pick up some tea. does anyone drink it/have preferences? akshaaatt aerozol bitmap?
2022-10-05 27851, 2022
lucifer
oh and about those external links, the album tracks endpoint only returns isrc/ean/upc for albums but not for individual tracks. the tracks lookup endpoint does.
2022-10-05 27811, 2022
lucifer
so we'll probably have to do some more lookups to get all the external ids.
2022-10-05 27845, 2022
lucifer
same for genres. those are not returned in all endpoints only some.
2022-10-05 27808, 2022
petitminion joined the channel
2022-10-05 27816, 2022
agatzk has quit
2022-10-05 27822, 2022
agatzk joined the channel
2022-10-05 27831, 2022
q3lont has quit
2022-10-05 27831, 2022
petitminion has quit
2022-10-05 27833, 2022
petitminion_ joined the channel
2022-10-05 27814, 2022
lucifer
mayhem: actually thinking more, yes makes sense to have just one JSONB column for extra things there.
2022-10-05 27815, 2022
lucifer
i think we can columns for the data we need in building the index, rest all goes to jsonb column for now. we can add more columns as need them in future. usecases like reading external ids only need a read of the jsonb column without joins so shuold be fast regardless.
2022-10-05 27829, 2022
lucifer
and we won't need to do them on a very regular basis.
there is the tmp_sp_metadata table on gaga that you could try it on.
2022-10-05 27845, 2022
lucifer
i am not sure how spotify handles artist credits for now, i have put a unique index on spotify_id, name. we'll be able to detect issues but will have to rebuild in case we find some. i guess that's fine.
do you mean storing an array of artist name in the artist table with each id ? or an array of artist name in track or album table?
2022-10-05 27852, 2022
yvanzo
alastairp, mayhem: Thanks, issue resolved, I bought an adapter on the way.
2022-10-05 27842, 2022
reosarevok
mayhem: since you're apparently not on #musicbrainz, someone posted "helloooo, I just tested the recommendation endpoint and wanted to let you know I think results are greeeeat o/ its amazing :D thx a lot :) " :)
2022-10-05 27855, 2022
reosarevok
Oh. That'd be petitminion_
2022-10-05 27815, 2022
mayhem
Heh, lol.
2022-10-05 27802, 2022
lucifer
alastairp: hi! any progress on CB PRs?
2022-10-05 27808, 2022
alastairp
lucifer: hi! we've just finished lunch, lol
2022-10-05 27819, 2022
alastairp
so that's a no
2022-10-05 27826, 2022
lucifer
ah nice! :D
2022-10-05 27829, 2022
lucifer
np
2022-10-05 27844, 2022
Pratha-Fish
alastairp: h e n l o
2022-10-05 27808, 2022
Pratha-Fish
I have a little update.
2022-10-05 27808, 2022
Pratha-Fish
The script will be completed in no time, but it might be a lot slower than expected. To counter that, I am trying to rack my brain to optimize the cleanup functions. But you could recommend any options, it would be pretty helpful.
2022-10-05 27808, 2022
Pratha-Fish
Here's what I've tried:
2022-10-05 27808, 2022
Pratha-Fish
- numpy.vectorize (slower than pandas for some reason lol)
2022-10-05 27808, 2022
Pratha-Fish
- numba.vectorize, numba.jit -> apparently, can't even process a simple dictionary checking function :|
2022-10-05 27808, 2022
Pratha-Fish
- Currently trying out Dask, modin for speedups
2022-10-05 27809, 2022
Pratha-Fish
CC: lucifer
2022-10-05 27844, 2022
lucifer
what does the cleanup function do?
2022-10-05 27850, 2022
lucifer
can share its code?
2022-10-05 27809, 2022
alastairp
Pratha-Fish: yeah, let's take a look at the code first
2022-10-05 27814, 2022
mayhem
lucifer: I guess an array of (artist name, artist id) would be best.
2022-10-05 27848, 2022
Pratha-Fish
lucifer, alastairp: pushing the code to the repo..
2022-10-05 27849, 2022
alastairp
Pratha-Fish: my initial feedback on this is that it's almost certainly not as slow as you think
2022-10-05 27806, 2022
alastairp
and if it does have a problem, it's probably a really simple fix
2022-10-05 27856, 2022
Pratha-Fish
alastairp: I really hope so
2022-10-05 27827, 2022
Pratha-Fish
Currently, it's taking ~18 - 24s to process 105k rows (excluding r/w times)
2022-10-05 27808, 2022
alastairp
I bet we can get that 10x faster
2022-10-05 27812, 2022
Pratha-Fish
note that it's 105k non-unique rows. Maybe making it unique could help, but mapping the results back to the dataframe could be another bottleneck
2022-10-05 27822, 2022
Pratha-Fish
alastairp: epic
2022-10-05 27858, 2022
lucifer
mayhem: that can be built during the join query to fetch the data imo.