#metabrainz

/

      • tiago joined the channel
      • tiago has quit
      • tiago joined the channel
      • hibiscuskazeneko joined the channel
      • hibiscuskazeneko has quit
      • dseomn joined the channel
      • gdalfo joined the channel
      • gdalfo has left the channel
      • samj1912 joined the channel
      • hibiscuskazeneko joined the channel
      • CatQuest has quit
      • CatQuest joined the channel
      • drsaunders has quit
      • catblup has quit
      • catblup joined the channel
      • catblup has quit
      • catblup joined the channel
      • ephemer0l has quit
      • kyan has quit
      • d4rkie joined the channel
      • hibiscuskazeneko has quit
      • D4RK-PH0ENiX has quit
      • outsidecontext joined the channel
      • Mineo has quit
      • samj1912 has quit
      • iliekcomputers has quit
      • iliekcomputers joined the channel
      • d4rkie has quit
      • D4RK-PH0ENiX joined the channel
      • rembo10_ has quit
      • rembo10 joined the channel
      • UmkaDK joined the channel
      • rembo10 has quit
      • rembo10 joined the channel
      • drsaunders joined the channel
      • psolanki has quit
      • samj1912_riot[m] has quit
      • suhas2go[m] has quit
      • sagar-kohli[m] has quit
      • Leo_Verto[m] has quit
      • tiago has quit
      • Leo_Verto[m] joined the channel
      • tiago joined the channel
      • tiago has quit
      • tiago joined the channel
      • adhawkins has quit
      • xps2_bo has quit
      • xps2_bo joined the channel
      • drsaunders has quit
      • adhawkins joined the channel
      • sagar-kohli[m] joined the channel
      • samj1912_riot[m] joined the channel
      • suhas2go[m] joined the channel
      • psolanki joined the channel
      • psolanki is now known as Guest23593
      • zas
        Top 50 of requests, excluding ws, search & static for yesterday log file (= 153978184 hits)
      • first number is the number of occurences
      • ruaok
        this list is seriously WTF.
      • line 2, line 46
      • a clearer picture is starting to emerge!
      • alastairp
        I'm a little surprised that the page with the highest number of hits is so "small" (8000/153978184)
      • zas: what if you redo it by removing the last component? (so that we get things like /artist, /recording, /ws/js/edit)
      • I guess some artists take longer to render than others though
      • zas
        Yes, we have a very wide variety of requests, actual number is 8114 over 1017893 (the rest being mostly ws or errors)
      • alastairp
        ah, that sounds more correct
      • why do we have so many http2 requests?!?
      • (and http1.0 requests! are those bots?)
      • zas
        because http2 is much more common now (most browsers support it)
      • 1.0 is very likely scripts
      • yvanzo
        so many spammers…
      • zas
        yvanzo: yes
      • ok let me few minutes to list only GET requests with prefix as asked by alastairp
      • ruaok
        yvanzo: ding! spammers.
      • we've got a ton of spammer traffic hitting us.
      • zas: can you please run a query to check ws vs web traffic?
      • used to be less than 5% for web. I think that figure has gone up.
      • next query: top /user/[name] pages. there is no reason that any user should get more than 10-15 hits in 24 hours.
      • I think a few tweaks according to what we talked about during the meeting are going to have a massive impact.
      • e.g. looking at a user page? robots.txt and you must be logged in.
      • alastairp
        ruaok: yeah, user/bruceluciani4639
      • skeeeetchy
      • ruaok
        given that this is a spam problem, not an inefficiency of our code, we need to act a little harder on spam.
      • yvanzo
        Which percentage of spammers among 355 account/edit?
      • arbenina_ joined the channel
      • ruaok
        yeah, good questions.
      • these are spammers editing their profiles.
      • zas
      • top 20 GET, stripping anything but first element after initial /
      • CatQuest
        [11:38] <ruaok> next query: top /user/[name] pages. there is no reason that any user should get more than 10-15 hits in 24 hours. +1
      • zas
      • top 20, users
      • CatQuest
        221 GET /user/expensiverhythm this also
      • zas
        most to be deleted imho
      • ruaok
        should we start deleting spammers or wait?
      • CatQuest
        how about keeping a list of these "very often GET'd user/* names and if it hits a certain number message people woh can deleteaccounts right away
      • zas
        each profile has to be checked, but most are prolly editors with no edit, or recently created
      • CatQuest
        and keep doing that
      • sothat peopel can check them and delete them accordingly
      • (I kinda don't wnt being able to watching my profile account you have to bel ogged in, i mena i link that all over the place as a n advertisment *for* mb)
      • ruaok
        yep.
      • zas: can you please prepare to run these queries daily, with a report that links back to the user profile?
      • CatQuest
        i cna go trought them if you guys want?
      • erh these zas listed today i mena :)
      • alastairp
        are there things like akismet for user profiles? does it make sense to perform some kind of validation on profile edits as they're made?
      • ruaok
        CatQuest: thanks, but lets leave this to reosarevok, Freso
      • CatQuest
        alright!
      • ruaok
        alastairp: we need one, for sure.
      • zas
        ruaok: ok, i'll look in an efficient way to do it
      • ruaok
        thx
      • alastairp
        I guess it's clear that these are being made manually? since we have captcha
      • CatQuest
        oh no pita about not adding url to my profile tohguh.. i'll be ok with it if "validated" users lie kme and others who literally edit alotds coudl add links and stuff
      • when I say "validated" I mean, autoeditors, all theditors that make loads of editors like hibiscus etc
      • ruaok
        alastairp: or automated captcha solving.
      • zas
        not the same protocol
      • ruaok
        woo, it feels great to finally have some insight.
      • zas: for this query we don't care about protocol, please collapse them.
      • zas
        ok
      • collapsed, top 50
      • ruaok
        great, thanks. Freso, reosarevok. Can you please help out?
      • we need to implement the suggestion from last nights meeting ASfuckingAP.
      • and we need a report that lists all users who have a bio/link, but zero edits.
      • CatQuest
        I remember when.. ian showed me numbers /graps and said that they were spammers
      • and I explaimed we shoudl delete thme
      • and he didn't (and noone esle did either) care sicne they jsut sat there and did nothing just inflating user count
      • alastairp
        thinking out loud here... if captchas reduce the speed at which spammers can make changes, what about putting them on all (or many) forms? profile edit, collection create/edit
      • ruaok
        yup, now that policy is haunting us.
      • CatQuest
        ruaok: they could have critiquebrainz works thoguh
      • alastairp: how aobut NO
      • alastairp
        google has the invisible captcha now. let them trigger it when they think the user is spammy
      • CatQuest
        that is a much better idea
      • ++ for that
      • alastairp
        (can they report to us how many requests triggered it?)
      • ruaok
        I doubt it.
      • outsidecontext has quit
      • making it harder to edit the profile is a good starting point.
      • removing a pile of users would also help a lot
      • alastairp
        it doesn't make sense to add it if we can't see the impact that it has on changes
      • CatQuest
        this is one case where being right all those years ago does not please me :/
      • ruaok
        yeah, agreed.
      • lots of people have shouted me down on spam,but I am going to take a much harder stance now.
      • at least all of this is finally starting to make sense. NONE of the entity pages or otherwise MB centric stuff is showing up in the top.
      • CatQuest
        just! don't make it harder for legitimate users editing things :x
      • ruaok
        CatQuest: I don't plan to. but I don't plan to let people shout me down from taking action.
      • CatQuest
        am I shouting?
      • i'm agreeing with you! infact I say "welcome after!"
      • ruaok
        no.
      • CatQuest
        alright then :)