#metabrainz

/

0:01 AM
D4RK-PH0ENiX has quit

2016-04-30 12122, 2016

0:09 AM
minteria has quit

2016-04-30 12154, 2016

0:10 AM
CallerNo6

I've seen e.g. https://github.com/rembo10/headphones/wiki/Musicb…

2016-04-30 12102, 2016

0:12 AM
CallerNo6

... which doesn't have much to do with the MB community, but presumably it helps /somebody/ :-)

2016-04-30 12142, 2016

0:25 AM
MBJenkins

Project musicbrainz-server_master build #394: ABORTED in 13 sec: https://ci.metabrainz.org/job/musicbrainz-server_…

2016-04-30 12104, 2016

0:26 AM
bitmap

grrr I keep getting logged out of ci so I can't cancel things before they start

2016-04-30 12106, 2016

0:26 AM
D4RK-PH0ENiX joined the channel

2016-04-30 12125, 2016

0:27 AM
bitmap

if I accidentially visit http, then switch to https, I'm logged out

2016-04-30 12117, 2016

0:29 AM
Gentlecat

there's a "Whitelist Target Branches" option

2016-04-30 12128, 2016

0:29 AM
Gentlecat

maybe we can just set it to master?

2016-04-30 12153, 2016

0:29 AM
Gentlecat

ok, done

2016-04-30 12104, 2016

0:30 AM
bitmap

what does it do?

2016-04-30 12121, 2016

0:30 AM
Gentlecat

ideally not build pull requests to schema-change-2016-q2

2016-04-30 12139, 2016

0:30 AM
bitmap

cool

2016-04-30 12148, 2016

0:31 AM
Gentlecat

"Adding branches to this whitelist allows you to selectively test pull requests destined for these branches only. Supports regular expressions (e.g. 'master', 'feature-.*')."

2016-04-30 12100, 2016

0:33 AM
bitmap

ah, so only prs to master will be built, that sounds good

2016-04-30 12155, 2016

0:34 AM
bitmap

or ^(master|beta|production)$ if that actually works

2016-04-30 12131, 2016

0:36 AM
linuxrocks

I read the policy regarding the Internet Archive storage, which lists it for historical, research purposes and fair use - sounds good to me. Then the MB site says use images at your own risk, I'm assuming they are referring to commercial interests?

2016-04-30 12132, 2016

0:40 AM
bitmap

basically any redistribution or non-private use of the images is at your own risk, since nobody owns the copyrights to them

2016-04-30 12116, 2016

0:41 AM
bitmap

nobody meaning MB or the IA

2016-04-30 12132, 2016

0:42 AM
linuxrocks

I just want to upload much of the content I have, encouraging others to as well - to make a more complete archive. But I don't want to cause any issues with MB or IA.

2016-04-30 12122, 2016

0:43 AM
linuxrocks

Maybe I could add a coffee stain to the images and call it my art ;-)

2016-04-30 12135, 2016

0:43 AM
bitmap

ah, I wouldn't worry about that. if the IA gets a complaint, they'll take the images down (it's happened before)

2016-04-30 12109, 2016

0:45 AM
JesseW joined the channel

2016-04-30 12158, 2016

0:45 AM
linuxrocks

ahh so there's no guarantee that if I spend time to do this, it will *stick*. Can images be added to a release through an API? This way I could more easily upload it in a batch sort of way?

2016-04-30 12153, 2016

0:47 AM
bitmap

well, by happened before, I mean 0.something% of all uploads :)

2016-04-30 12106, 2016

0:49 AM
bitmap

there's not an API to do this, unfortunately

2016-04-30 12149, 2016

0:49 AM
CallerNo6

who doesn't want to be archived? that's crazytalk.

2016-04-30 12107, 2016

0:50 AM
linuxrocks

OK, got it. Does the original uploader of an image get notified if it get's deleted? Actually how does this work, doesn't MB just store a link to to the image at IA (CAA). How does MB know that the IA has deleted an image?

2016-04-30 12109, 2016

0:51 AM
bitmap

we don't, actually. there's a way to get a list of deleted images from the IA (in a bug report somewhere), but it hasn't been implemented yet

2016-04-30 12150, 2016

0:51 AM
bitmap

it hasn't happened frequently enough to be a development priority, I guess

2016-04-30 12111, 2016

0:52 AM
linuxrocks

makes sense

2016-04-30 12101, 2016

0:53 AM
Gentlecat

bitmap: added beta and production to that list

2016-04-30 12110, 2016

0:53 AM
bitmap

thx

2016-04-30 12139, 2016

0:54 AM
bitmap

about an API for uploads, the main problem is that the CAA was designed so that images never pass through MB servers

2016-04-30 12131, 2016

0:56 AM
bitmap

but only MB has the keys necessary to sign uploads directly to the IA

2016-04-30 12114, 2016

1:21 AM
bitmap

Gentlecat: seems to be working :)

2016-04-30 12122, 2016

1:21 AM
Gentlecat

\o/

2016-04-30 12109, 2016

1:22 AM
JesseW has quit

2016-04-30 12159, 2016

1:23 AM
JesseW joined the channel

2016-04-30 12139, 2016

1:28 AM
JesseW has quit

2016-04-30 12152, 2016

1:59 AM
bitmap has quit

2016-04-30 12113, 2016

2:07 AM
bitmap joined the channel

2016-04-30 12133, 2016

2:08 AM
QuoraUK has quit

2016-04-30 12147, 2016

2:53 AM
zas

bitmap: ping

2016-04-30 12144, 2016

2:56 AM
zas

ah nvm, astro / was full, i removed >280Gb of nginx logs (only .gz),

2016-04-30 12141, 2016

3:00 AM
zas

http://stats.metabrainz.org/dashboard/db/musicbra…

2016-04-30 12156, 2016

3:01 AM
zas

http://stats.metabrainz.org/dashboard/db/server-o…

2016-04-30 12112, 2016

3:02 AM
zas

How come astro alone was using so high bandwidth ?

2016-04-30 12101, 2016

3:04 AM
zas

And that's not nginx according to http://stats.metabrainz.org/dashboard/db/all-ngin…

2016-04-30 12133, 2016

3:27 AM
Gentlecat

bitmap: can you take a look at https://github.com/metabrainz/pytools/pulls if you have some time?

2016-04-30 12132, 2016

3:41 AM
bitmap

Gentlecat: sure thing

2016-04-30 12147, 2016

3:41 AM
Gentlecat

thanks!

2016-04-30 12112, 2016

3:43 AM
bitmap

zas: maybe a full db export?

2016-04-30 12143, 2016

3:48 AM
bitmap

there was a --with-full-export running, so I guess

2016-04-30 12157, 2016

3:49 AM
bitmap

the 30d range makes that easier to see

2016-04-30 12140, 2016

4:32 AM
yeeeargh joined the channel

2016-04-30 12158, 2016

5:06 AM
neersighted has quit

2016-04-30 12108, 2016

5:09 AM
neersighted joined the channel

2016-04-30 12108, 2016

5:19 AM
neersighted has quit

2016-04-30 12124, 2016

5:39 AM
JesseW joined the channel

2016-04-30 12136, 2016

5:58 AM
kanha has quit

2016-04-30 12124, 2016

6:03 AM
neersighted joined the channel

2016-04-30 12104, 2016

6:04 AM
darwin

super minor formatting issue in notification mail, missing space : "Label "2 Swords"(Copy Paste Soul's personal imprint) - merged by edit #38369836

2016-04-30 12118, 2016

6:04 AM
darwin

there should be a space after " and before (

2016-04-30 12102, 2016

6:12 AM
linuxrocks is now known as linuxrocks_

2016-04-30 12148, 2016

6:20 AM
linuxrocks_ is now known as linuxrocks

2016-04-30 12130, 2016

6:23 AM
linuxrocks has left the channel

2016-04-30 12128, 2016

6:25 AM
linuxrocks joined the channel

2016-04-30 12146, 2016

6:25 AM
linuxrocks has left the channel

2016-04-30 12128, 2016

6:26 AM
linuxrocks joined the channel

2016-04-30 12102, 2016

6:33 AM
JesseW has quit

2016-04-30 12128, 2016

6:36 AM
regagain joined the channel

2016-04-30 12101, 2016

7:34 AM
kanha joined the channel

2016-04-30 12128, 2016

7:54 AM
UmkaDK has quit

2016-04-30 12118, 2016

8:27 AM
zas

bitmap: then this export may have failed, due to lack of disk space, did you verify it ?

2016-04-30 12123, 2016

8:50 AM
ruaok

so, uhm, who is responsible for the local weather. I'd like to file a report. :(

2016-04-30 12150, 2016

8:51 AM
zas

Hey good morning Rob !

2016-04-30 12157, 2016

8:51 AM
ruaok

morning!

2016-04-30 12118, 2016

8:53 AM
zas

Finally at home ?

2016-04-30 12147, 2016

8:53 AM
ruaok

yep, got home last night.

2016-04-30 12110, 2016

8:54 AM
ruaok

and all I want to do is go for a nice long ride in the spanish sun. except there is no sun today. oh well, manaña. :)

2016-04-30 12142, 2016

9:02 AM
zas

ruaok: i thought about something about number of IPs we need at NewHost, having more IPs is good when it comes to the number of sockets restrictions, which are (source ip, source port, dest ip, dest port) tuple (do you remember ernie/bert issues related to this at some point). Since we want to be able to handle a lot of simultaneous connections, better

2016-04-30 12142, 2016

9:02 AM
zas

spread services on different IPs (and hostnames).

2016-04-30 12157, 2016

9:05 AM
zas

Also i think about redis HA setup, we'll need another server for that (one redis master and, at least, one redis slave, basically the setup is based on HAProxy + redis sentinel, and is quite simple. It will solve a reliability issue we currently have (that is losing redis master). bitmap said mbs doesn't handle well (at all) redis master failures.

2016-04-30 12134, 2016

9:07 AM
ruaok

make sense to me. just keep adding these thoughts to the doc.

2016-04-30 12155, 2016

9:07 AM
zas

ok

2016-04-30 12101, 2016

9:08 AM
ruaok

now as far as redis masters... the use case for redis is so bizarre that most machines don't fit really well for that.

2016-04-30 12115, 2016

9:08 AM
ruaok

ideally we'd need 64GB with one or two cores.

2016-04-30 12119, 2016

9:08 AM
ruaok

really weird setup.

2016-04-30 12139, 2016

9:08 AM
zas

i think redis makes good use of more cores, to be verified

2016-04-30 12140, 2016

9:08 AM
ruaok

so, the question in my mind is this: what do we use that is CPU intensive, but not memory intensive?

2016-04-30 12159, 2016

9:08 AM
ruaok

making good use and using them all are two different things no?

2016-04-30 12109, 2016

9:09 AM
zas

Yes ;)

2016-04-30 12114, 2016

9:09 AM
ruaok

a memcached machine is bored most of the time, but with massive memory use.

2016-04-30 12139, 2016

9:09 AM
ruaok

I'd like to find a complementary task, that also needs to be HA, that we can stick on those servers with redis for better utilization.

2016-04-30 12150, 2016

9:09 AM
zas

This is why we can run cpu intensive processes (but not memory intensive processes) on the same machines

2016-04-30 12101, 2016

9:10 AM
ruaok

exactly.

2016-04-30 12113, 2016

9:10 AM
zas

indexer is one (on jiji)

2016-04-30 12125, 2016

9:10 AM
ruaok

building indexes is one, but that will go away hopefully soon.

2016-04-30 12134, 2016

9:10 AM
ruaok

what are our future, long term use cases?

2016-04-30 12138, 2016

9:10 AM
zas

perhaps compression (backups, logs ?)

2016-04-30 12157, 2016

9:10 AM
ruaok

logs, quite possibly.

2016-04-30 12113, 2016

9:11 AM
zas

logs would fit

2016-04-30 12131, 2016

9:11 AM
zas

we need HA centralized logs storage

2016-04-30 12153, 2016

9:12 AM
zas

if logs are written through network, and not stored on most machines, it means we can go for small and fast non-RAID SSD for web servers ie?

2016-04-30 12102, 2016

9:13 AM
ruaok

just as long as we don't impement that before the move. after the move, yes.

2016-04-30 12115, 2016

9:13 AM
zas

Same goes for gateways

2016-04-30 12125, 2016

9:13 AM
ruaok

why SSD?

2016-04-30 12135, 2016

9:13 AM
ruaok

HDD should be sufficient.

2016-04-30 12154, 2016

9:13 AM
zas

Oh yes, i mean HDD

2016-04-30 12158, 2016

9:13 AM
ruaok

ok.

2016-04-30 12105, 2016

9:14 AM
ruaok

astro disk is usually < 5%

2016-04-30 12125, 2016

9:14 AM
zas still under the needed coffee level

2016-04-30 12126, 2016

9:14 AM
ruaok

but yes, non RAID HDD on app servers.

2016-04-30 12131, 2016

9:14 AM
ruaok

:)

2016-04-30 12148, 2016

9:14 AM
zas

We need to think about backups

2016-04-30 12101, 2016

9:15 AM
ruaok

not coffee??

2016-04-30 12125, 2016

9:15 AM
zas

NewHost may not have someone to rotate our USB drives ;)

2016-04-30 12135, 2016

9:15 AM
ruaok

oh, yes that concept is dead.

2016-04-30 12105, 2016

9:16 AM
ruaok

but, given that EACH machine has a massive bandwitdh budget, we take our backups machine and give it a public IP.

2016-04-30 12120, 2016

9:16 AM
ruaok

then blast backups as fast as we can to google cloud or glacier.

2016-04-30 12125, 2016

9:16 AM
ruaok

probably both.

2016-04-30 12134, 2016

9:16 AM
ruaok

google cloud and then once a week to glacier as well

2016-04-30 12140, 2016

9:16 AM
zas

Yup, may be a soluton

2016-04-30 12145, 2016

9:16 AM
zas

solution*

2016-04-30 12155, 2016

9:16 AM
zas

ok coffee ;)

2016-04-30 12155, 2016

9:16 AM
ruaok

for super fast retrieval from google cloud, and archival on glacier.

2016-04-30 12158, 2016

9:16 AM
ruaok

bai!

2016-04-30 12138, 2016

9:17 AM
zas

About number of web servers... i did some rough calculation, i would like to redo with you

2016-04-30 12157, 2016

9:17 AM
ruaok

ok.

2016-04-30 12110, 2016

9:18 AM
ruaok

the good thing about the vrack is that we can start smaller and expand when we want to.

2016-04-30 12125, 2016

9:18 AM
ruaok thinks scalability

2016-04-30 12143, 2016

9:18 AM
ruaok

search, web servers and postgres can scale easily with the new setup.

2016-04-30 12106, 2016

9:19 AM
ruaok

gateways, when done properly will have loads of capacity and will scale nicely on a 2 host setup for quite some time.

2016-04-30 12126, 2016

9:19 AM
ruaok

we also need to think about what traffic we want to serve.

2016-04-30 12138, 2016

9:19 AM
ruaok

I'm not interested in spending a lot of hosting resources on headphones.

2016-04-30 12149, 2016

9:22 AM
ruaok

speaking of headphones, the new rate limiting needs to have per-app limits.

2016-04-30 12112, 2016

9:28 AM
zas

Coffee ready.

2016-04-30 12128, 2016

9:28 AM
zas

Ok, based on current incoming reqs

2016-04-30 12132, 2016

9:32 AM
ruaok needs to pop out before the old-catalan-lady-brigade ddos'es every available veggie shop in the area

2016-04-30 12134, 2016

9:32 AM
zas

we have 1k req/s incoming

2016-04-30 12129, 2016

9:33 AM
zas

around 65% hit our web servers

2016-04-30 12156, 2016

9:33 AM
zas

and 55-60% of those are rate limited (503s)

2016-04-30 12118, 2016

9:34 AM
zas

so we actually return ~270 req/s as 20x

2016-04-30 12106, 2016

9:35 AM
zas

so let's say we want to double that, and convert most 503s in 200s

2016-04-30 12126, 2016

9:36 AM
zas

it will double load on web servers, so if we want to keep same load with same hardware it would mean having x2 web servers (we have 5) -> 10

2016-04-30 12107, 2016

9:37 AM
zas

but new web servers will be more performant, likely at least 30%, -> 7

2016-04-30 12119, 2016

9:37 AM
ruaok

considerably more, I would think.

2016-04-30 12141, 2016

9:37 AM
ruaok

these machines were given to us in 2010 and were several years old.