in #metabrainz

12:47 PM
alastairp

I just built my own copy of it
12:48 PM
first thing that I see is that the Dockerfile copies both cpanfile and cpanfile.snapshot
12:48 PM
I don't know if they both have to be configured
12:49 PM
at the top of cpanfile.snapshot it says "# carton snapshot format: version 1.0"
12:49 PM
ferbncode

In a PR, i was suggested to build the cpanfile.snapshot..not sure how to do this here
12:49 PM
alastairp

given that we install dependencies with the command `sudo -E -H -u musicbrainz carton install --deployment`
12:50 PM
I suspect that this might be the cause
12:50 PM
I don't know either
12:50 PM
maybe zas or Gentlecat or yvanzo know
12:51 PM
I'm reading this page which seems to say that carton will make this file automatically by reading data from cpanfile
12:52 PM
how much do you know docker? what I would do at this point is split this RUN command into many commands
12:52 PM
so that I can log into the docker container after the required packages have been installed with apt and play with carton
12:52 PM
ferbncode

okay, I can do that
13:02 PM
alastairp

ferbncode: from http://search.cpan.org/~miyagawa/Carton-v1.0.28...
13:02 PM
The --deployment flag makes sure that carton will only install modules and versions available in your snapshot, and won't fallback to query for CPAN Meta DB for missing modules.
13:03 PM
so it looks like you need to add things to cpanfile; use carton install; copy the generated cpanfile.snapshot; do the install with carton install --deployment
13:05 PM
ferbncode

alastairp: makes sense, thanks :D
13:08 PM
samj1912 has quit
13:10 PM
m0n0g0n joined the channel
13:12 PM
to81 has quit
13:16 PM
to81 joined the channel
13:23 PM
hibiscuskazeneko has quit
13:24 PM
agentsim joined the channel
13:28 PM
agentsim has quit
13:29 PM
alastairp: works the way you suggested. thanks :)
13:30 PM
ferbncode goes add more modules to cpanfile :P
13:30 PM
alastairp

excellent!
13:51 PM
Gore|woerk joined the channel
14:04 PM
to81 has quit
14:04 PM
to81 joined the channel
14:09 PM
to81 has quit
14:28 PM
agentsim joined the channel
14:44 PM
arbenina_ has quit
14:45 PM
d4rkie has quit
14:45 PM
D4RK-PH0ENiX joined the channel
14:45 PM
github joined the channel
14:45 PM
github

[listenbrainz-server] paramsingh opened pull request #186: Remove .kitchen.yml (master...kitchen) https://git.io/vH0bi
14:45 PM
github has left the channel
14:47 PM
github joined the channel
14:47 PM
[listenbrainz-server] mayhem closed pull request #186: Remove .kitchen.yml (master...kitchen) https://git.io/vH0bi
14:47 PM
github has left the channel
14:47 PM
ruaok

iliekcomputers: for simple things like that, just ask me to remove the file. no need for a PR.
14:47 PM
iliekcomputers

ruaok: okay, noted.
14:50 PM
github joined the channel
14:50 PM
github

[messybrainz-server] mayhem closed pull request #20: LB-93: Migrate to Python 3 (master...2to3) https://git.io/vH452
14:50 PM
github has left the channel
14:50 PM
D4RK-PH0ENiX has quit
14:54 PM
github joined the channel
14:54 PM
[listenbrainz-server] mayhem closed pull request #181: Add .travis.yml for continuous integration. (master...travis) https://git.io/vHcrU
14:54 PM
github has left the channel
14:57 PM
D4RK-PH0ENiX joined the channel
14:58 PM
to81 joined the channel
15:02 PM
D4RK-PH0ENiX has quit
15:02 PM
hibiscuskazeneko joined the channel
15:06 PM
to81 has quit
15:07 PM
to81 joined the channel
15:08 PM
D4RK-PH0ENiX joined the channel
15:12 PM
to81 has quit
15:40 PM
github joined the channel
15:40 PM
[listenbrainz-server] paramsingh opened pull request #187: LB-93: Migrate to Python 3 (master...2to3) https://git.io/vHEej
15:40 PM
github has left the channel
15:49 PM
padraic joined the channel
15:51 PM
padraic

I'm new to the xml service and REST queries. Can anyone reccommend me something useful on linux, or even browser based, to test queries for the xml service on?
15:53 PM
alastairp

the browser is a good start
15:53 PM
Gentlecat

https://www.getpostman.com/
15:56 PM
padraic

@alastairp I can construct a query string and download the file from the browser alright, it's more that I don't know how to do it with a script properly. Thanks, @Gentlecat, I'll take a look at that
15:56 PM
github joined the channel
15:56 PM
github

[metabrainz.org] gentlecat opened pull request #282: MEB-93: Add an API endpoint for serving MB JSON dumps (master...json-api) https://git.io/vHEJZ
15:56 PM
github has left the channel
15:57 PM
github joined the channel
15:57 PM
[metabrainz.org] gentlecat opened pull request #283: Switch to Python 3.6 (master...py36) https://git.io/vHEJl
15:57 PM
github has left the channel
15:57 PM
padraic has quit
15:58 PM
alastairp

so much python3 love <3
16:08 PM
antgel joined the channel
16:09 PM
antgel

Hi there, I was referred to here from #musicbrainz. Suffering from 503s in Picard when it does lookups. I understand that there has been some discussion on this - would it be possible to get an update?
16:10 PM
I came by a few weeks ago with the same issue, and I was told that throttling had been eased. But I would expect throttling not to return 503 ;)
16:11 PM
lazka joined the channel
16:16 PM
github joined the channel
16:16 PM
github

[metabrainz.org] gentlecat closed pull request #283: Switch to Python 3.6 (master...py36) https://git.io/vHEJl
16:16 PM
github has left the channel
16:31 PM
to81 joined the channel
16:31 PM
to81 has quit
16:31 PM
to81 joined the channel
16:51 PM
ruaok

antgel: still suffering from spammers sucking up capacity and host of other inefficiencies in our setup.
16:51 PM
Zas: fuck it. Order two more servers?
16:52 PM
Mineo joined the channel
16:54 PM
Zas: back to caching. Is this something we can do without engineering effort?
17:02 PM
zas

ruaok: i don't know enough about current caching in mbs
17:03 PM
bitmap: ?
17:04 PM
ruaok

Would an http level cache help at all? We have gobs of ram spare...
17:05 PM
zas

This is exactly what i think
17:06 PM
ruaok

That might be so.ethi g you can do without a lot of help from bitmap and yvanzo , right?
17:06 PM
Lo. So.ethi g. #covfefe for life!
17:06 PM
zas

I'm about to leave for diner, but tomorrow morning (or later this evening if i'm motivated enough) i could evaluate number of unique queries to have an estimation of cacheability.
17:07 PM
also it would require api changes (somehow) since we would support usual cache management feature, and there still the problem of cache invalidation
17:07 PM
ruaok

There are plenty of requests to pages that are not really dynamic... E.g account creation.
17:07 PM
arbenina_ joined the channel
17:07 PM
Not having to render those might help.
17:08 PM
zas

But since i don't know what exactly is cached or not... it would be great to have bitmap and yvanzo in this discussion, to see where we can improve things
17:09 PM
ruaok

Not sure if catching the ws would be a good goal.
17:09 PM
Since our web pages are so slow.... Cache those?
17:09 PM
zas

I vote to use caches everywhere we can ;) plenty of ram / lack of cpu
17:10 PM
ruaok

Still, we should cache for maximum impact first.
17:10 PM
zas

Of course
17:11 PM
But according to my first analysis, i get 200s on pages that didn't change at all between my queries
17:12 PM
Where i would have expect a not modified (and therefore a client side caching)
17:13 PM
we never really investigate this, but imho we don't rely enough on various cache possibilities, which lead to excessive traffic, and excessive cpu usage
17:13 PM
on the website, and on the web service
17:13 PM
lazka has quit
17:14 PM
our entities are that much changing (in fact, our db is quite static if you look at number of changes vs number of entries)
17:16 PM
But proper implementation will need to be done with a proper cache invalidation when entities are modified, i think that's the main issue
17:18 PM
Varnish has tons of features for that: http://book.varnish-software.com/4.0/chapters/C...
17:19 PM
Is anyone around with a good experience of Varnish (or similar tools) ?
17:21 PM
bbl
17:21 PM
flamingspinach joined the channel
17:22 PM
kepstin notes that for mbjs.kepstin.ca, he just runs a server side caching reverse proxy which unconditionally stores stuff for 12h :/
17:23 PM
kepstin

(well, not unconditionally, it doesn't store 503 errors)
17:26 PM
to81 has quit
17:41 PM
to81 joined the channel
17:44 PM
ruaok

kepstin: it can handle a few million requests per day, yes? We'd be happy to use it.
17:44 PM
Lol
17:45 PM
kepstin

heh, I honestly have no idea. it's just a single DO droplet, but it is behind cloudflare...
17:45 PM
I'd probably run out of disk space for the cache, at least :)
17:45 PM
ruaok

Tease.
17:46 PM
kepstin

I guess what I'm saying is that not all applications actually need data that's guaranteed to be fresh, I wonder if there's a market for an api that might serve a bit older data, but has the rate limits eased
17:50 PM
of course, our current ws design doesn't really suit that, since there's so many parameters available that tweak the results :/
17:54 PM
CatQuest

a "stable.musicbrainz.org" ? wich is less rate limits but also older data?
18:01 PM
hibiscuskazeneko has quit
18:03 PM
to81 has quit
18:04 PM
Sophist_UK

I have some experience with caching. Firstly there are several different types of cache other than web-cache. You can cache: SQL statements (to avoid the SQL server having to recalculate the access plan every time.) You can cache SQL search results. (I am assuming that moving .js / .css etc. onto content management platform is already done.
18:06 PM
The trick is to cache the frequently accessed stuff - because that is where you will make savings - and not cache the vast majority which is accessed once a week or less frequently.
18:06 PM
to81 joined the channel
18:08 PM
arbenina_ has quit
18:09 PM
And to know how to invalidate the cache when the underlying data changes.
18:10 PM
Like kepstin I wonder how suitable MBS is for caching. I suspect it has an ultra-long tail.
18:18 PM
ruaok

ultra long tail is my thought. which is why we need daily updated top-URL reports.
18:18 PM
how are those coming, zas?
18:20 PM
drsaunders has quit
18:21 PM
github joined the channel
18:21 PM
github

[picard-plugins] rdswift closed pull request #105: Standardized Album Artist Name(s) (1.0...1.0) https://git.io/vHRxW
18:21 PM
github has left the channel
18:33 PM
yvanzo

ruaok, zas: Only compiled templates are currently cached, not pages. Varnish would be a good fit, but it requires to purge cache from within MBS editing system, just like zas wrote.
18:35 PM
ruaok

now the big question: what is better: 1) Add caching 2) move templates to react ?
18:39 PM
yvanzo

both
18:39 PM
drsaunders joined the channel
18:45 PM
zas

Yes, i need to generate a daily page, but parser is working
18:48 PM
Quesito

Get Hip Folks: Social Media Banner Contest https://community.metabrainz.org/t/social-media...
18:49 PM
reosarevok

What about a social media banning contest?
18:50 PM
Can we ban ruaok from social media? He posts pics that make me jealous