You are viewing Revision 6 of Spam

WikiSpam is a wikiwide problem. It won't be solved but wikiwide. (see http://c2.com/cgi/wiki?WikiSpam)


I'll be using this page to brainstorm on tactics and techniques to use to combat wiki spam. As you may or may not have noticed, all forms of editing for non-logged-in users has been disabled here on Aneuch Wiki (including the use of discussion pages). This was due to being targeted by various spam bots.

Once I figure out the spam problem, I can re-enable (limited) forms of editing. ~AaronGraves


I'm looking at the possibility of using TextCha for combating wiki spam. ~AaronGraves

In addition to the TextCha, I'd like to implement a banned content feature, similar to something that Oddmuse uses.

Have a look at http://nedbatchelder.com/text/stopbots.html, it's got some interesting tips/ideas to use that I will likely implement into Aneuch 0.30.

07/24/2013 After reading the above linked page and a few comments, I think I've come up with a very good solution. I'll use a honeypot of sorts on all forms in a hope to catch spam bots. This is step one. We'll see how effective that becomes, and go from there.

Proposed Solutions

Here are some solutions I'm proposing for Aneuch. Bold are definites, bold italic I'm still on the fence about.

  • Honeypot
    All forms on the site (except the search form) should have a "honeypot" - that is to say, a field that is invisible to regular users, but any spam bot would see it. If there is data submitted in this field (or modified in a field where data already exists), the edit will be rejected as spam.
  • Link counting
    Count all of the links in a page before edit, and count the number of links after the edit. If more than a certain number have been added, reject the edit as spam. (Note that this should probably only apply to external links)
  • Timestamping
    Introduce an encrypted time stamp into the form, and if it's been too short or too long between the time the form was loaded and submitted, reject the edit as spam.
  • Content filtering
    Implement a content filtering system, whereby a wiki admin can load regular expressions into the filter which, if matched on an edit, the edit is rejected as spam.
  • TextCha
    Implement a text version of the popular captcha, where a user must successfully answer a challenge, and if not correct, the edit is rejected as spam.