in #metabrainz

8:00 AM
_lucifer

how many listens did you have?
8:00 AM
ishaanshah

But there was a bug in the query thatI wrote, maybe because of that
8:01 AM
One incremental dump
8:01 AM
I'll try again using matchable text now
8:01 AM
_lucifer

Listen count from 2020-04-01 17:59:43 to 2020-09-28 17:59:43: 1876719
8:01 AM
Number of distinct rows in the mapping: 4510905
8:01 AM
Listen count after mapping: 991921
8:02 AM
this is my stats, yeah i am using the full matchable mapping
8:04 AM
pristine___

Two hours, too much.
8:04 AM
I think when you are testing, you should use less data to make the process faster.
8:05 AM
_lucifer

pristine___: yeah right, for once i want to see how much time it takes but for future i woul sure like it to be quicker
8:06 AM
pristine___

Yes please!
8:06 AM
_lucifer

let's talk with ruaok on this to get a sample mapping
8:06 AM
pristine___

In prod it takes 15 min to create df :p
8:06 AM
_lucifer

my pc is not that bad by those standards then :P ;)
8:07 AM
Nyanko-sensei joined the channel
8:07 AM
leonardo joined the channel
8:07 AM
imdeni joined the channel
8:07 AM
mruszczyk joined the channel
8:07 AM
diru1100 joined the channel
8:07 AM
reg[m] joined the channel
8:07 AM
joshuaboniface joined the channel
8:07 AM
djinni` joined the channel
8:07 AM
shivam-kapila

Yeah
8:07 AM
Only 8 times slower :p
8:07 AM
pristine___

_lucifer: I think we should first make a small dump of listens, join it will the 11GB mapping to get subset of mapping and do the same with artist relation. This is like the basic idea.
8:08 AM
> my pc is not that bad by those standards then :P ;)
8:08 AM
Mine toooo.
8:08 AM
So I never try with full dumps :p
8:09 AM
_lucifer

pristine___: what is the benefit of --html flag? it says it'll generate html files but what is their content and use?
8:09 AM
shivam-kapila

Observability
8:09 AM
Of results
8:09 AM
_lucifer

yeah so what will those files contains?
8:10 AM
pristine___

_lucifer: we have two rn
8:10 AM
One for model, one for candidate bets
8:10 AM
Sets*
8:10 AM
_lucifer

candidate set one
8:10 AM
pristine___

Yeah
8:11 AM
I use it to debug discrepancy in candidate set data
8:11 AM
Try generating one for yourself, it will give you and idea
8:11 AM
An*
8:11 AM
_lucifer

train model completed succesfully as well in 5 min
8:16 AM
adhawkins

Running the mbserver docker containers, I'm seeing entries like this in the logs:
8:16 AM
musicbrainz_1 | [error] 08006 DBI connect('dbname=musicbrainz_db;host=db;port=5432','musicbrainz',...) failed: FATAL: the database system is starting up
8:17 AM
According to the output of 'docker ps', the database container has only been up for about 6 hours.
8:17 AM
'docker-compose logs db' shows that the database did indeed start up around 2am.
8:18 AM
Any suggestions for working out what's going on? No OOM errors since making the adjustments to shared_buffers and stopping replication during my overnight music 'scan'
8:27 AM
alastairp

"Your package will be delivered within the next 24 hours"
8:27 AM
thanks, courier company, that's super helpful
8:28 AM
btw: I got about 20 spam PRs in Freesound today, it seems likely that people are trying to pad commits to get credit in hacktoberfest
8:28 AM
it'll be interesting to see if MB gets anyway
8:30 AM
reosarevok

That's sad :D
8:33 AM
adhawkins

Oh, seems like I was wrong about the OOM. It fired around midnight this morning. This time the culprit was 'java'.
8:33 AM
I'm just running a scan over my music at the moment. When this completes I'll reboot the VM running it all.
8:34 AM
v6lur joined the channel
8:52 AM
v6lur has quit
8:52 AM
Nyanko-sensei has quit
8:52 AM
leonardo has quit
8:52 AM
imdeni has quit
8:52 AM
diru1100 has quit
8:52 AM
mruszczyk has quit
8:52 AM
reg[m] has quit
8:52 AM
joshuaboniface has quit
8:52 AM
djinni` has quit
9:05 AM
Gazooo794 has quit
9:06 AM
Gazooo794 joined the channel
9:10 AM
v6lur joined the channel
9:10 AM
Nyanko-sensei joined the channel
9:10 AM
leonardo joined the channel
9:10 AM
imdeni joined the channel
9:10 AM
mruszczyk joined the channel
9:10 AM
diru1100 joined the channel
9:10 AM
reg[m] joined the channel
9:10 AM
joshuaboniface joined the channel
9:10 AM
djinni` joined the channel
9:17 AM
_lucifer

https://www.irccloud.com/pastebin/0lirukSn/
9:17 AM
pristine___: ishaanshah: ^
9:17 AM
it fails during html file generation in candidate sets
9:17 AM
the candidate sets are saved by that point so i guess it should work if do not pass the --html flag
9:18 AM
pristine___

Yeah, you can disable html file generation
9:19 AM
I think it's not able to write/read/open huge file.
9:20 AM
_lucifer

so that is a listens issue then because the mapping is not involved there
9:20 AM
yeah right
9:21 AM
pristine___

_lucifer: if you really want to generate the html file, try with a subset of listens. Just do `dataframe --days=30 (or even less) `
9:30 AM
_lucifer

👍
9:31 AM
ruaok

alastairp: https://blog.domenic.me/hacktoberfest/
9:32 AM
alastairp

ruaok: yep, just found that
9:32 AM
the PRs we got are exactly like the ones shown there
9:33 AM
I've restricted access to the repo, but it only applies for 24h :/
9:33 AM
maybe they'll give up after the first day
9:43 AM
reosarevok

None yet on our repos it seems? Maybe we're not cool enough :p
9:44 AM
kieto joined the channel
9:45 AM
yvanzo

or they considered it's too much spammed already ;)
9:45 AM
reosarevok hides
9:45 AM
reosarevok

Argh, MBS-486 is such a pain in the ass
9:45 AM
BrainzBot

MBS-486: Add support for years BC https://tickets.metabrainz.org/browse/MBS-486
9:45 AM
reosarevok

But I really want to get it done, so let's keep trying :D
9:47 AM
_lucifer

pristine___: recs generated :D, any way to export and share them ?
9:53 AM
shivam-kapila

troi?
9:54 AM
_lucifer

let me check it out!
10:32 AM
chaban

reosarevok: https://tickets.metabrainz.org/browse/MBS-11107...
10:32 AM
BrainzBot

MBS-11107: span.name-variation class is missing on some relationship credits
10:33 AM
reosarevok

Goddamit :D
10:33 AM
I can check in a bi
10:33 AM
*bit
10:38 AM
BrainzGit

[musicbrainz-server] reosarevok opened pull request #1721 (master…MBS-2418): MBS-2418: Show Edit URL edits in entity edit histories https://github.com/metabrainz/musicbrainz-serve...
10:38 AM
BrainzBot

MBS-2418: Edit URL edits not shown in entity edit histories https://tickets.metabrainz.org/browse/MBS-2418
10:39 AM
reosarevok

hmm. chaban: I'm not seeing that :/ Do you have an example?
10:40 AM
Oh wait
10:40 AM
Sorry, I was being dumb
10:40 AM
I'm seeing it :)
10:48 AM
BrainzGit

[musicbrainz-server] reosarevok opened pull request #1722 (beta…MBS-10536-redux): MBS-10536 (redux): Remove span.name-variation around "see all" releases link https://github.com/metabrainz/musicbrainz-serve...
10:48 AM
BrainzBot

MBS-10536: Release group link "see all versions of this release" has span.name-variation https://tickets.metabrainz.org/browse/MBS-10536
10:56 AM
CatQuest

:| check otu the lin kin this popular link section https://community.metabrainz.org/t/alias-name-v... there is a mojibakke link
10:57 AM
i thought t first that it was a topic asking aobut this mojibakke, but it turns out it's fine on bookbrainz - is it possible ot fix this on community. side?
11:02 AM
reosarevok

Freso: do you know? ^ :)
11:15 AM
slriv has quit
11:18 AM
slriv joined the channel
11:23 AM
BrainzGit

[listenbrainz-server] dependabot-preview[bot] opened pull request #1116 (master…dependabot/pip/ujson-3.2.0): Bump ujson from 1.35 to 3.2.0 https://github.com/metabrainz/listenbrainz-serv...
11:24 AM
[listenbrainz-server] dependabot-preview[bot] opened pull request #1117 (master…dependabot/pip/pytest-6.1.0): Bump pytest from 6.0.1 to 6.1.0 https://github.com/metabrainz/listenbrainz-serv...
11:24 AM
[listenbrainz-server] dependabot-preview[bot] opened pull request #1118 (master…dependabot/pip/pyspark-3.0.1): Bump pyspark from 2.4.5 to 3.0.1 https://github.com/metabrainz/listenbrainz-serv...
11:25 AM
[listenbrainz-server] dependabot-preview[bot] opened pull request #1119 (master…dependabot/pip/py4j-0.10.9.1): Bump py4j from 0.10.9 to 0.10.9.1 https://github.com/metabrainz/listenbrainz-serv...
11:25 AM
[listenbrainz-server] dependabot-preview[bot] opened pull request #1120 (master…dependabot/pip/numpy-1.19.2): Bump numpy from 1.19.1 to 1.19.2 https://github.com/metabrainz/listenbrainz-serv...
11:25 AM
[listenbrainz-server] dependabot-preview[bot] opened pull request #1121 (master…dependabot/pip/eventlet-0.28.0): Bump eventlet from 0.26.1 to 0.28.0 https://github.com/metabrainz/listenbrainz-serv...
11:25 AM
[listenbrainz-server] dependabot-preview[bot] opened pull request #1122 (master…dependabot/pip/psycopg2-binary-2.8.6): Bump psycopg2-binary from 2.8.5 to 2.8.6 https://github.com/metabrainz/listenbrainz-serv...
11:26 AM
[listenbrainz-server] dependabot-preview[bot] opened pull request #1123 (master…dependabot/pip/spotipy-2.16.0): Bump spotipy from 2.14.0 to 2.16.0 https://github.com/metabrainz/listenbrainz-serv...
11:26 AM
[listenbrainz-server] dependabot-preview[bot] opened pull request #1124 (master…dependabot/pip/coverage-5.3): Bump coverage from 5.2.1 to 5.3 https://github.com/metabrainz/listenbrainz-serv...
11:26 AM
[listenbrainz-server] dependabot-preview[bot] opened pull request #1125 (master…dependabot/pip/pygments-2.7.1): Bump pygments from 2.6.1 to 2.7.1 https://github.com/metabrainz/listenbrainz-serv...
11:26 AM
_lucifer

lol is depandabot also participating in hacktoberfest
12:55 PM
iliekcomputers

that's me delegating my hacktoberfest spam to a bot
12:55 PM
this year's t-shirt actually looks pretty good tbh
13:15 PM
_lucifer

lol, yeah that's true
13:16 PM
pristine___: i am getting prediction scores on -10 to 10 now. so the predictions scores in themself probably do not make much sense
13:17 PM
earlier i was thinking that this is just due to input being not scaled but that turns out to be wrong.
13:17 PM
pristine___

_lucifer: now? After normalization
13:17 PM
alastairp

I'm guessing that we're not going to have a meeting on Monday due to tradition?
13:18 PM
_lucifer

pristine___: yes
13:18 PM
pristine___

> earlier i was thinking that this is just due to input being not scaled but that turns out to be wrong.
13:18 PM
Yeah.
13:18 PM
I read something on CF scores (pyspark)
13:18 PM
Lemme see if I have a link!
13:19 PM
TOPIC: MetaBrainz Community and Development channel | MusicBrainz non-development: #musicbrainz | Channel is logged; see https://musicbrainz.org/doc/IRC for details | Summit 20: https://wiki.musicbrainz.org/MusicBrainz_Summit/20
13:19 PM
_lucifer: if the scores don't have a meaning, then the candidate set itself a set of user recommendations :(
13:19 PM
Is a *
13:20 PM
_lucifer

pristine___: no i mean the score do a have meaning but only relative
13:21 PM
9.0 > 8.0 for this time but it may or may not be for the next run
13:21 PM
pristine___

Right
13:22 PM
What I think is we shouldn't show scores to users?
13:22 PM
Just the rec and inputs for feedback
13:23 PM
_lucifer

yeah makes sense, just sort based on the score but do not show it to the user