Just the FAQs
Blog and Website help
for non-geeks

Check out our
Package Deals

Home |Buy eBooks |




A new series of
WordPress books
is here!

Subscribe to the blog feed
for useful tips and to be notified when more new books become available.

 Subscribe in a reader

Subscribe by Email



Bloggers and Web site owners use many different tricks to curtail having their contact information harvested by spam bots while still making it easily available to humans. Spam bots are a spin-off of the same sort of automated programs used by search engines, which are called spiders because they crawl the Web gathering information about site pages. The spiders collect data primarily from meta tags in the page’s code. These include the description and keywords meta tags. Spam bots, on the other hand, look through the page’s code in search of terms like “mailto” or the “@” symbol. These are dead giveaways that an email address is contained in that part of the code.

The simplest way to foil a spam bot is to create a string of characters that will confuse the bot but be intelligible to a human reader. For example, the email address me@mysite.com can be rewritten as me (at) mysite [dot] com. This keeps the bot from simply copying any useful data directly. A special programming language like Javascript can be used to make a similar string inside the code. As the script is processed, it strips out the extra characters and only displays the email address to the viewer. This makes the code unintelligible to the bot, but appears normally to the viewer. One such service that stays a step ahead of the bots is The Enkoder Form from Hivelogic.

Adding extraneous characters slowed down spam for a while, but bots have become smarter too and can sometimes tease out the actual email address from the simple strings. Most blog templates, such as those used on Blogger, allow HTML and Javascript code to be inserted in a widget that will appear in the sidebar. One of the best ways you can protect your email address is to have it hidden in a string that will appear normally to the viewer, as mentioned above. It’s not 100% foolproof, but will certainly diminish your spam.

A new trick to protect contact forms on Web sites was created by folks at IBM and is called CAPTCHA, which is an acronym that stands for “Completely Automated Public Turing test to tell Computers and Humans Apart." It displays a random image of mangled words or numbers that a bot cannot read but a human can. When the viewer types in the correct words, the form is processed and sent to the owner. Advancements in image recognition software are making it possible for bots to now read these words. This means that CAPTCHA will eventually fail as a foolproof method to stop the harvesting of email addresses.

The latest twists on random word image CAPTCHA methods are simple math questions such as “2 + 3 = ?” where the viewer fills in the blank, and a series of dog and cat images where the viewer is asked to select only the cats. These methods are still proving to be problematic for many bots, but eventually the math code will be decipherable. It will take longer for the bots to figure out how to deal with the cats and dogs.

Another way to protect an email address on a contact form is to hire a programmer who is fluent in the PHP and/or the Javascript languages and can create special code that will validate the fields in the form. Usually this method double checks to ensure a viewer includes critical information such as their name or email address before processing the form. Bots don’t tend to leave any fields empty because the form may not process properly without them. A coder can insert a field that is hidden from a human viewer and then check to see if a bot filled it with information. Since the human couldn’t fill in a field they couldn’t see, the form is not processed because a bot likely supplied the information in the hidden field.

For bloggers, even if you protect your email address by using code in a widget, it could still be showing up in ways you might not be aware. For instance, if you offer an RSS feed for your posts, your email address may be displayed in the author section of the feed just below the title. You won’t see this on your actual post, just on the feed. It is being provided from your profile information. If you have a Blogger account, it’s easy to check the information in your profile. Instructions follow below, but there’s something to keep in mind about using this method.

It will remove your email address from your RSS feed and will display the following by line information: “by noreply@blogger.com (your name)”, where “your name” is the display name you chose in your profile. Electing to not display your email address will also remove it from your profile information that is publicly displayed. This means that if your blog page is your only Web presence, you will need to display your contact information in the sidebar. If your blog is a supplement to a Web site, then you can simply include your site URL in your profile as well as a link to the site URL in the sidebar. This will direct folks to the contact form on your site where you have better anti-spam security.

To check your profile information on Blogger, do the following:

  • Log in to your Blogger account.
  • From the Dashboard, in the left corner, click “Edit Profile”.
  • In the “Privacy” section, ensure the box for “show my email address” is unchecked.
  • You do not need to make any changes to the email address you have listed in the “Identity” section.
  • Click the “Save Profile” button at the bottom of the page.

Another important thing to keep in mind is that if your current email address is already being deluged with spam, these methods won’t stop spammers who already have it from sending you more unwanted emails. If you decide to use one of the tricks listed here, you’ll want to do it on a new email address.

No matter what method you use to curtail spam, the bot creators are continually improving their methods to gain access to your email address. It’s likely that you will need to update your method of protection every two years or less.


 

Share/Save/Bookmark

Download a PDF of this article

 
Home |Buy eBooks |

copyright 2010 Just the FAQs