Hm. Okay okay. I was able to access the url so I asked. Thanks
2020-05-14 13516, 2020
alastairp
sure, I guess our webserver accepts requests to all domains
2020-05-14 13525, 2020
shivam-kapila
Got it
2020-05-14 13541, 2020
ruaok
yvanzo: I'm reading through the updated docs for musicbrainz-docker and I've got two questions:
2020-05-14 13516, 2020
ruaok
1. how do I expose the port for PG only? Can I set an alternate port number other than 5432?
2020-05-14 13544, 2020
ruaok
2. How do I tune PG? I need to set more shared_buffers.
2020-05-14 13505, 2020
yvanzo
ruaok: for 1. create a file local/compose/publish-db-port.yml (see compose/publishing-all-ports.yml for example) and run admin/configure add local/compose/publish-db-port.yml; docker-compose up -d
2020-05-14 13533, 2020
yvanzo
2. cannot be done without building your own image for db service, should be added.
2020-05-14 13541, 2020
ruaok
thanks for #1.
2020-05-14 13523, 2020
ruaok
#2 could a drastic issue we need to address. stock PG is dog slow. :(
2020-05-14 13545, 2020
ruaok
shared_buffers = 512MB
2020-05-14 13507, 2020
ruaok
if we're recommending a machine of 16GB, should we set that to 4096MB as default?
2020-05-14 13536, 2020
yvanzo
Maybe but 16GB was mainly required for live indexing.
2020-05-14 13508, 2020
ruaok
well, we can argue over defaults and what-not, but we need a way to tweak the settings.
2020-05-14 13536, 2020
ruaok
because a noob will not know that they need to tune DB and will get the impression that our shit is dog slow.
2020-05-14 13539, 2020
yvanzo
Working on a patch, should be quick.
2020-05-14 13545, 2020
ruaok
<3
2020-05-14 13502, 2020
ruaok
no rush. I've fixed my setup.
2020-05-14 13509, 2020
ruaok
(but I know how)
2020-05-14 13522, 2020
ruaok
the admin/configure stuff is really nice too.
2020-05-14 13519, 2020
yvanzo
the idea is have a file like default/indexer.ini that could be appended to db configuration.
oh man. Mr_Monkey userscript that can pre-fill language twice on https://bookbrainz.org/work/create?author=23f213e… pls (I need to add 28+ works for small short-stries and it means I have to type "norw" and select from a drop down 56 times )
2020-05-14 13509, 2020
ishaanshah[m]
If we remove artist_msid from group_by I think we can fix a part of LB-547
i'm not completely sure we want to do that calculation on names
2020-05-14 13552, 2020
iliekcomputers
different artists can have same names
2020-05-14 13558, 2020
ishaanshah[m]
Another part is duplicates because of different names
2020-05-14 13510, 2020
iliekcomputers
ideally the bug would be fixed by messybrainz
2020-05-14 13520, 2020
iliekcomputers
i'd not worry about it tbh.
2020-05-14 13547, 2020
ishaanshah[m]
Ok
2020-05-14 13517, 2020
ishaanshah[m]
For range queries we should compare the timestamp right
2020-05-14 13544, 2020
iliekcomputers
so if you look at hdfs
2020-05-14 13546, 2020
ishaanshah[m]
Like, WHERE listen.timestamp > min_ts AND listen.timestamp < max_ts
2020-05-14 13549, 2020
pristine__
ishaanshah[m]: that bug will be better handled once we have the mapping right. Artist with two different msids can have same mbid and it leads to weird results. The ideal way is to use mbid everywhere once we have the mapping :)
2020-05-14 13502, 2020
iliekcomputers
the strcuture of the data is data/year/month.parquet
2020-05-14 13513, 2020
iliekcomputers
where month.parquet contains that month's listens
2020-05-14 13526, 2020
iliekcomputers
for a month, you'd just load the month's data and query on that.
2020-05-14 13531, 2020
iliekcomputers
won't need a where
2020-05-14 13541, 2020
iliekcomputers
but for week, yes, the where would make sense.
2020-05-14 13544, 2020
ruaok
pristine__: I'm working on the aa relations right now. once that is built up, I'm going to do the same for the msb mapping
2020-05-14 13557, 2020
ishaanshah[m]
But that won't work for week
2020-05-14 13504, 2020
iliekcomputers
even for week
2020-05-14 13512, 2020
pristine__
ruaok: no hurry :)
2020-05-14 13513, 2020
iliekcomputers
you should load just the month's data and put the where on that.
2020-05-14 13549, 2020
ishaanshah[m]
Ohk
2020-05-14 13501, 2020
ishaanshah[m]
Now suppose I import a months data
2020-05-14 13535, 2020
ishaanshah[m]
and create a temporary view
2020-05-14 13524, 2020
ishaanshah[m]
Then I should first filter out the the weeks listens
2020-05-14 13530, 2020
pristine__
ruaok: I see that we want to send recordings to lemmy that means we don't need a tar.
2020-05-14 13540, 2020
ruaok
iliekcomputers: thoughts on where this code should live? it uses the MB database to calculate relationship info that will be used in LB recommendation stuff. I'm inclined to make a new top-level dir in listenbrainz-server called `relations` and stuff put the code there.