Now that I am back home and rested, it is time to share an
amusing story... as
Randy noticed, I got some comment spam on Monday, all
referencing an online gambling site.
32 comments in the course of 65 minutes. The last 9 of
which were not seen by anybody as I had blocked the ip address by
65 minutes to create. Carefully crafted to appear to be on
topic. 10 seconds to wipe out.
Others may disagree, but this strikes me as an important problem that we should meet now rather than later.
I don't think that IP banning will be very effective, unless the information is centralized in a service, and controlled in a reasonable manner. It's just too easy for a spammer to change to a different IP source.
The bayesian approach is going to work better with large chunks of input text, and blog comments are typically fairly small. Plus, the comment spammers are already specifically tailoring spams to appear 'hammy'.
The main target to filter on is going to be the URLs that the spammers are trying to promote. Those can be blacklisted much more effectively. Another idea I'm kicking around is to obfuscate URLs in comments by randomly replacing characters with their numeric entity equivalents. Also, this may push more blogs towards encouraging users to go through some sort of optional registration/verification process. A verified user would bypass the filtering.
The WordPress team is looking at these issues and will probably implement several of these strategies to work in concert. I invite further discussion on my War on Spam site:
As Dougal noted, blocking the spam URLs is the key. MT-Blacklist, new plugin by Jay Allen does just this. It seems to work great. I have created a blog spam database where people can share information about spam URLs and other spam sources.
Grr, ahora está llegando acá también la mafia de los spams en comentarios, algo acerca de lo que había leído, pero que no había visto aún en sitios en español. Hello from the USA. My spanish is not so good......
I've implemente a throttle on comments to prevent runaway spammers. One of the major topics to du Jour is spam comments. There are those who are wildly optimistic and others who are wildly pessimistic... (I tend to the optimistic side myself - the tr...
I'm getting my fair share of comment spam like many other bloggers, but I can't imagine that the cost/time ratio is actually worth it. I think Sam Ruby sums it up best: "65 minutes to create. Carefully crafted to appear......