<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0">
   <channel>
      <title>JM&apos;s Blog</title>
      <link>http://w2.syronex.com/jmr/blog/</link>
      <description>Web, Society, Technology, and Innovation</description>
      <language>en</language>
      <copyright>Copyright 2007</copyright>
      <lastBuildDate>Sat, 17 Nov 2007 09:22:37 -0600</lastBuildDate>
      <generator>http://www.sixapart.com/movabletype/?v=3.2</generator>
      <docs>http://blogs.law.harvard.edu/tech/rss</docs> 

            <item>
         <title>Email Link Versus Contact Form</title>
         <description><![CDATA[<p>There are today two main ways for a visitor of your website to contact you, the site's owner: email links and <a href="http://formsmarts.com/contact-form"  title="Get a Free Contact Form">contact forms</a>.<br />
 <br />
<strong>Rigidity of forms vs flexibility of email links.</strong><br />
There are situations where it is desirable to make sure the <em>recipient</em> and the <em>subject</em> are set to specific values so the message is routed to the appropriate person, team, or sadly, automated-response system.<br />
Forms allow to <em>structure</em> the message into different fields and <em>constrain</em> the values allowed for each field.</p>

<p><strong>Email links are symmetric</strong><br />
Email links are symmetric in two ways. </p>

<p>First because both the sender and the recipient use the same communication protocol. This implies the possibility of conversations (two-way communication) rather than the broadcast of a message to an unknown and unnamed recipient. <br />
As it is always possible that I mistyped my email address in the form, I'm not even sure that the other party will be able (or willing) to reply.</p>

<p>Second, as a side effect of sending a message I retain an exact copy of it. So I can refer to this copy later to either refresh my mind about what exactly I wrote or to prove that I actually sent the message in the first place. Sending an acknowledgment email after the submission of a form may achieve the same effect, but only if it contains the original message.</p>

<p>Note that some websites allow web users to <a href="http://formsmarts.com/weblog/tag/contact-form"  title="Build a HTML Contact Form">create free contact forms</a>.</p>]]></description>
         <link>http://w2.syronex.com/jmr/blog/2007/11/email_link_versus_contact_form</link>
         <guid>http://w2.syronex.com/jmr/blog/2007/11/email_link_versus_contact_form</guid>
         <category>Email</category>
         <pubDate>Sat, 17 Nov 2007 09:22:37 -0600</pubDate>
      </item>
            <item>
         <title>Using Secure Email &amp; Digital Signatures</title>
         <description><![CDATA[<p>Many businesses have a policy to include a lamentable disclaimer at the bottom of every email they send:</p>

<blockquote>
This e-mail and its attachments are confidential and intended solely for the addressees. If you are not the intended recipient of this message, then please delete it and notify the sender. Since the integrity of this message cannot be guaranteed on the Internet, Example, Inc. cannot be considered responsible for its content. </blockquote>

<p>The reason I say this is lamentable is that ensuring the confidentiality of message and the authentication of its sender requires very little effort. All you need to start sending secure emails (S/MIME) is a regular email client such as <a rel="nofollow" href="http://www.mozilla.com/thunderbird/">Thunderbird</a> and a digital certificate. </p>

<p>Several certification authorities (CA) give personal email certificates for free. Thawte has <a rel="nofollow" href="http://www.thawte.com/secure-email/web-of-trust-wot/">scheme</a> that allows accredited individuals to check your identity so that your full name can be included in your digital certificate. From this point you'll be able to digitally sign all your emails; thus allowing your correspondants to: <ol><li>trust that a message was really sent by you and not someone pretending to be you</li><li>verify that a message hasn't been changed by a third party, and therefore that its content can be trusted</li></ol></p>

<p>In order to ensure the confidentiality of messages, you will additionally need that all recipients also own a digital certificate, so that messages can be encrypted. </p>

<p>Digitally signed emails are accepted as legally binding in the US and many countries in Europe.</p>]]></description>
         <link>http://w2.syronex.com/jmr/blog/2006/08/digital_certificate_secure_email</link>
         <guid>http://w2.syronex.com/jmr/blog/2006/08/digital_certificate_secure_email</guid>
         <category>Security</category>
         <pubDate>Fri, 18 Aug 2006 10:13:35 -0600</pubDate>
      </item>
            <item>
         <title>Buying a SSL Certificate: What Do You Get?</title>
         <description><![CDATA[<p>There are considerable variations in the cost of SSL certificates. But what you get in the end also varies considerably. So what do you actually buy?</p>

<p>You buy trust and brand image.</p>

<p>A SSL certificate is a guarantee from a certification authority that the server presenting the certificate to a browser is indeed who it claims to be, but this means different things for different SSL certificates:<ul><br />
<li>for low cost certificate that means assuring your browser is indeed connecting to the host www.example.com</li><br />
<li>for expensive certificates that means assuring your browser is connecting to www.example.com, which is operated by Example, inc. ; an organization that the authority has checked existed.</li><br />
</ul></p>

<p>In the first case, all the authority validates is that the person who is delivered a certificate has also some administrative rights on the domain example.com. In the second case, they also manually check that the organization Example, inc. has a legal existence.</p>

<p>Beyond that, big names like Verisign and Thawte (now part of Verisign) give you a  logo to show on your site, using their brand name to help you build confidence with your customers.</p>

<p>How much?</p>

<p>High-assurance certificates cost hundreds of USD.  Comodo sells simple domain validated certificates for $15/year.</p>

<p>Can you get SSL certificates for free?</p>

<p>If you only need a secure SSL connection without the authentication aspect because you are just trying to secure your webmail access and know your users, you can generate a self-signed certificate, or get one from CAcert.org. In either cases, users will be asked to accept your certificate before they can proceed. If this is likely to be a problem, go for the $15 option. Some large institutions use self signed certificates and that's really a shame.</p>]]></description>
         <link>http://w2.syronex.com/jmr/blog/2006/08/buy_ssl_certificate</link>
         <guid>http://w2.syronex.com/jmr/blog/2006/08/buy_ssl_certificate</guid>
         <category>Security</category>
         <pubDate>Tue, 15 Aug 2006 05:02:51 -0600</pubDate>
      </item>
            <item>
         <title>A simple trick to help prevent form spam (2)</title>
         <description><![CDATA[<p>This is a more sophisticated setup of <a href="http://w2.syronex.com/jmr/blog/2005/12/prevent_comment_spam">this blog comment spam prevention</a> tactic. I've implemented this several weeks ago and it seems to work well.</p>

<ol>
<li>After changing the URL of your blog's comment executable, <a href="http://w2.syronex.com/jmr/safemailto/" target="syronex_main">encode the new URL here</a>.
For this, enter your URL where indicted "Step 1: enter your e-mail
address". Copy the code given to you and extract the strings <em>string1</em> and <em>string2</em>, which look respectively like <tt>FPZEVPKWZXDS2M</tt> and <tt>%27%23%3E%237%23/1%3C7+1S%3F</tt>.</li>
<li>Place the following code after the <tt>&lt;head&gt;</tt> tag of the page, after replacing <em>string1</em> and <em>string2</em> by their actual values:
<pre>&lt;script type='text/javascript'&gt;&lt;!--
function blurl(){var v2="<b>string1</b>";var v7=unescape("<b>string2</b>");
var v5=v2.length;var v1="";for(var v4=0;v4&lt;v5;v4++)
{v1+=String.fromCharCode(v2.charCodeAt(v4)^v7.charCodeAt(v4));}return v1}
//--&gt;&lt;/script&gt;</pre>
</li>

<p><li>Locate the comment form in the template of your blog, and remove the <em>action</em> attribute specifying the URL of the executable used to process the form results.</li></p>

<p><li>Finally, add <tt>this.action=blurl()</tt> to the <em>onsubmit</em> handler of the form:<br />
<pre>&lt;form method="post" onsubmit="this.action=blurl()"&gt; ... &lt;/form&gt;</pre><br />
</li><br />
</ol><br />
<b>Updates</b>: There is an <a href="http:///www.syronex.com/antispam/help/form-spam">updated version</a> of this document, and you can now get <strong><a href="http://formsmarts.com/" target="blank">free comment spam protection with FormSmarts</a></strong>. Plus, an easy form builder, a form handler, form hosting and more. All free.</p>]]></description>
         <link>http://w2.syronex.com/jmr/blog/2006/03/prevent_comment_spam_2</link>
         <guid>http://w2.syronex.com/jmr/blog/2006/03/prevent_comment_spam_2</guid>
         <category>Spam &amp; Scam</category>
         <pubDate>Sun, 19 Mar 2006 09:29:35 -0600</pubDate>
      </item>
            <item>
         <title>Unprotected email addresses on blogs get spammed</title>
         <description><![CDATA[<p>How many of the owners of the <a rel="nofollow" target="_blank" href="http://online.wsj.com/public/article/0,,SB111685593903640572,00.html?mod=2_1125_1">10 millions</a> existing blogs show their email address on their blog, ready for spammers to collect?  </p>

<p>A <a rel="nofollow"  href="http://news.bbc.co.uk/2/hi/technology/2969783.stm">study</a> shows that out of the many ways email addresses are collected online, 97% of spam originates from addresses harvest on websites or blogs. Our <a href="http://www.syronex.com/antispam/efficiency-study">own study</a> shows that it takes as little as two days after an email address is published online before it gets spammed.</p>

<p>It's of course legitimate to publish you email address on a personal blog, but there are few reasons why you should not <a href="http://www.syronex.com/antispam/help/blogger">protect it from spam</a>. This page shows you to generate an encoded link and integrate it with your template.</p>]]></description>
         <link>http://w2.syronex.com/jmr/blog/2006/03/email_on_blog_gets_you_spam</link>
         <guid>http://w2.syronex.com/jmr/blog/2006/03/email_on_blog_gets_you_spam</guid>
         <category>Spam &amp; Scam</category>
         <pubDate>Sun, 19 Mar 2006 09:20:32 -0600</pubDate>
      </item>
            <item>
         <title>Choosing a web hosting</title>
         <description><![CDATA[<p>What should you consider when looking for a web hosting provider?</p>

<p>Obviously, the hosting plan must technically fit your needs, and there is a wide choice of offerings ranging from expensive dedicated server hosting to cheap PHP/MySQL hosting. </p>

<p>What makes the differences between the hosting companies has little to do with the disk space you get or how many free software they allow you to use. In my experience, there are two elements that make the difference: how fast is their network, and how responsive and qualified is their support staff. I have seen in this respect huge variations.<br />
Big names are not a guarantee you'll get a good service. For example, although I've never tried Yahoo hosting, Yahoo domains is notoriously very bad. So, if you consider buying the service of a hosting company, I suggest you contact their support staff to find out what kind of reply you get, and how long it takes. <br />
If you can't find how to contact them: run.</p>

<p>Speed information is available for most hosts from websites like <a rel="nofollow" href="http://www.alexa.com">Alexa</a>  and <a rel="nofollow" href="http://www.netcraft.com">Netcraft</a>, so do your homework and check this as well.</p>]]></description>
         <link>http://w2.syronex.com/jmr/blog/2006/02/web_hosting</link>
         <guid>http://w2.syronex.com/jmr/blog/2006/02/web_hosting</guid>
         <category>Web</category>
         <pubDate>Fri, 24 Feb 2006 18:30:20 -0600</pubDate>
      </item>
            <item>
         <title>Adsense spam</title>
         <description><![CDATA[<p>There two great things about Google's Adsense contextual ads. Firstly, they are not aggressive and quite repectful of users both in terms of content and bandwidth (compare this with Flash ads or popups). Secondly, they are targeted to be as close a possible to the content of the webpage that hosts them, in the hope that the user will find the ads relevant to himself in his particular situation, and will see the advertised websites as a valuable content, rather than as something that is being imposed on him.</p>

<p>Being mostly an automated system however, Adsense is being abused by both publishers and advertisers. The case of publishers is well know. Some sites automatically generate thousands of pages by aggregating search results, replicating content from the Open Directory, Wikipedia, RSS feeds, etc. and generously including Adsense banners among these. Google is certainly working on ways to reduce this particular type of spam, being doubly involved as a seach provider and an advertising network.</p>

<p>The second type of Adsense spam is the act of some ad publishers, and as Google gets revenues from publishers spam, it seems they are far less concerned about it. Type any keyword of no commercial value in Google and you'll see ads like the one below:</p>

<p><img alt="Adsense Spam by Ebay" src="http://w2.syronex.com/jmr/blog/img/adsense-spam-ebay.png" width="326" height="112" /> <img alt="adsense-spam-3.png" src="http://w2.syronex.com/jmr/blog/home/jmrose/hp/img/adsense-spam-3.png" width="150" height="55" /></p>

<p><br />
Adwords allows Ebay and others to buy hundreds of low-value keywords and automatically generate text ad banners. You'll find these not only along Google's search results, but also on other sites publishing Adsense ads. The problem with these ads, of course, is relevance. I'm not aware of idiots being sold on Ebay, but this is what the ad says. Is it what the publisher wants us to believe? probably not. They just are just using those ads as cheap bates to attract you into their site.</p>

<p>Those ads have a negative impact on the content of the hosting site, so that's bad for content publishers that value more the general quality of their site than the cent they'll get for the click. </p>

<p>Some publishers are buying those cheap keywords to publish unrelated ads. That's how I got ads for a religious group on a foreign languages website, and ads for adult content alongside Google search results.<br />
</p>]]></description>
         <link>http://w2.syronex.com/jmr/blog/2006/02/adsense_spam</link>
         <guid>http://w2.syronex.com/jmr/blog/2006/02/adsense_spam</guid>
         <category>Spam &amp; Scam</category>
         <pubDate>Sat, 11 Feb 2006 08:03:34 -0600</pubDate>
      </item>
            <item>
         <title>A simple trick to help prevent form spam</title>
         <description><![CDATA[<p>If you have a blog or a webpage with a guestbook facility, you will have certainly experienced automated posts known as comment/form spam. </p>

<p>There is a simple thing you can try to prevent spammers from posting to your blog: make it more difficult for them to find how to post to your site using automated means. The HTML code for submitting a form looks as follows: <br />
<tt><br />
&lt;form method="post" action="http://www.example.com/bin/comment"&gt;...&lt;/form&gt;<br />
</tt></p>

<p>The <em>action</em> attribute above specifies the web address (URL) of the executable used to process the form results. Simply remove it and add an <em>onsubmit</em> attribute as shown below. <br />
<tt><br />
&lt;form method="post"<br />
onsubmit="this.action='http://www.example.com/'+'bin/post-a-comment'"&gt;<br />
...&lt;/form&gt;<br />
</tt></p>

<p>I have made two changes. I now use a trivial JavaScript snippet to build the URL of the executable, and I have renamed it. The reason for using JavaScript is because comment-spamming programs are most unlikely to be sophisticated enough to understand it. Since once spammers have found a comment posting URL they will keep it and reuse it every now and then,  I have also renamed the executable to start with a fresh URL.</p>

<p><b>Update</b>:  The code above is shown for demonstration purposes only. if you are thinking of implementing this setting, you should use a <a href="http://www.syronex.com/antispam/help/form-spam">more serious procedure</a>. It doesn't involve much more work.</p>

<p><b>Update</b>: You can now get <strong><a href="http://formsmarts.com/" target="blank">free comment spam protection with FormSmarts</a></strong>. Plus, an easy form builder, a form handler, form hosting and more. All free.</p>]]></description>
         <link>http://w2.syronex.com/jmr/blog/2005/12/prevent_comment_spam</link>
         <guid>http://w2.syronex.com/jmr/blog/2005/12/prevent_comment_spam</guid>
         <category>Spam &amp; Scam</category>
         <pubDate>Thu, 29 Dec 2005 03:29:02 -0600</pubDate>
      </item>
            <item>
         <title>Yahoo domains: read this before you register a domain name with Yahoo</title>
         <description><![CDATA[<p>Yahoo! has a service called Yahoo! Small Business that sells domain names at a very attractive price. If you think of registering a domain through them, read on. And maybe also have a look at <a rel="nofollow" href="http://www.webmasterworld.com/forum25/685.htm ">what others say</a>. </p>

<p>First Yahoo! is not an accredited domain registrar, but simply a reseller for the Australian domain registrar Melbourne IT. If you ever have a problem with you domain registration however, you'll have to deal with Yahoo!, and their support is <em>appalling</em> to an extent I have never met before. They consistently and repeatedly reply to emails with irrelevant pre-made messages: this did not happen to me once or twice, but about fifteen times. </p>

<p>Given my first experience with their support staff, I rushed to <em>transfer</em> my domain to another registrar as soon as the 60-day no-transfer period after registration was over. The transfer was systematically denied. Whenever I got a human to reply, their answer showed a striking ignorance of .com transfer procedures, replying with transfer requirement only applicable to other types of domains (.info,&hellip;).</p>

<p>After three days, I ended up calling them and I was told that they could not do anything and referred me to Melbourne IT. Emailing and calling Autralia several times, I was told that I had to deal with the reseller (Yahoo!) directly.</p>

<p>I don't know at this point if my transfer will finally go through, but I have already spent <em>much more in international calls</em> than even the most expensive domain registrar around (e.g. Network Solutions sell domains at USD35).</p>

<p>If you want a cheap domain registrar with capable support staff, I recommend, for having tested them for years: <br />
<ul><br />
<li>If you only want the basic registration with no options (e.g. private registration), <a rel="nofollow" href="http://www.godaddy.com">GoDaddy</a> will do the job. But beware, these people are master marketers and they will try every trick to sell you additional products.</li><br />
<li><a href="http://gandi.net/" rel="nofollow">Gandi</a> is a bit more expensive mostly owing to the low rate of USD, but their service <em>includes at no extra cost</em> features sold under "private registration" by some other registrars, allowing e.g. to hide your email address in the WHOIS database to avoid spam. They also include virtually unlimited email and web redirections.</li><br />
</ul></p>]]></description>
         <link>http://w2.syronex.com/jmr/blog/2005/10/yahoo_domain_registration</link>
         <guid>http://w2.syronex.com/jmr/blog/2005/10/yahoo_domain_registration</guid>
         <category>Web</category>
         <pubDate>Mon, 24 Oct 2005 14:04:19 -0600</pubDate>
      </item>
            <item>
         <title>More about SMS.ac: how good (bad) is their site?</title>
         <description><![CDATA[<p>SMS.ac is a company that uses social engineering and "invitation" spam to enrol new customers. I won't further comment on their business model/practice, but just how good is their website?<br />
I will <a rel="nofollow" href="http://www.jeffooi.com/archives/2005/03/smsac.php">leave to others</a> to say whether their SMS service is any good, but here are some elements of answer gathered from the web:</p>

<p><b>Speed</b><br />
Alexa <a rel="nofollow" href="http://www.alexa.com/data/details/main?q=&url=http://www.sms.ac">rates</a> SMS.ac as "very slow".<br />
<img style="color:#808080;border:1px solid" src="/jmr/blog/img/sms.ac.alexa.png" alt="sms.ac alexa"  height="69" width="364"/></p>

<p><b>Robustness</b> <br />
When using the site, you should expect to occasionally get crude messages reporting "ASP Exceptions", "SQL Server Errors" and other technical faults. It's amusing that Yahoo crawlers caught one of these while indexing the site&hellip;<br />
<img style="color:#808080;border:1px solid" src="/jmr/blog/img/sms.ac.yahoo.png" alt="sms.ac yahoo"  height="59" width="277" /></p>

<p><b>Useability</b> <br />
They don't seem to be aware that 10% of web users don't use Internet Explorer. With Netscape/Mozilla the site is barely useable, with a tiny font.<br />
<img style="color:#808080;border:1px solid" src="/jmr/blog/img/sms.ac.screen.png" alt="sms.ac homepage" height="297" width="422"/></p>

<p><small>All trademarks, names, and services referenced above belong to their respective owner. </small></p>]]></description>
         <link>http://w2.syronex.com/jmr/blog/2005/09/sms_ac_website</link>
         <guid>http://w2.syronex.com/jmr/blog/2005/09/sms_ac_website</guid>
         <category>Web</category>
         <pubDate>Fri, 16 Sep 2005 19:30:56 -0600</pubDate>
      </item>
            <item>
         <title>Beware of SMS.ac invitation emails!</title>
         <description><![CDATA[<p>For many months I have been receiving regular emails inviting me to join a service named "SMS.ac". All messages claimed to be from people I more or less knew of (e.g. former students of mine). After receiving one of these emails pretending to be from a friend, I queried her about this and she denied having sent or instructed SMS.ac to send me any of these emails.<br/>Trying to understand, I went to their registration page and I was amazed: innocently, as part of the registration process, they <em>ask your hotmail login and password!</em> <br />
I assume people that give away their details in that way think SMS.ac are only going to use them to import their contacts in a <em>passive</em> way, as done by some other web services. The site does mention in convoluted ways that they will use the imported email addresses to "invite" people in your address book to SMS.ac. And they do mention the clause that one should obtain his contacts' "implicit consent" (they must agree but you don't need to ask them!) before proceeding&hellip; but everything is phrased carefully to avoid startling candidates. So it is probably fair to say that a large share of the people facing this situation are no way near realizing that SMS.ac are going to repeatedly send spam to all their friends, <em>in their name</em>, until they signup or explicitly opt out. <br />
But just imagine someone <em>knowingly</em> allowing sms.ac to send ads to his contacts.<ul><li>this person has made the effort to read and understand the small prints and believes it's fine</li><li>he finds sms.ac so good&mdash;<em>even before trying it </em> (remember, we are still in the signup process)&mdash;that he wants to tell everyone.</li></ul> I am not sure how realistic this scenario is, but I carry on: is this person <em>authorized</em> to disclose the email addresses contained in his address book to a third party? As far as I am concerned the answer is clearly no: if I give you my email address, I grant <em>you</em> the right to use it, but it is a <em>non-transmissible</em> right. I'm obviously not trying to show here that you should blame your friends for spamming you, but simply that sms.ac makes them do things they are not entitled to do anyway.<br />
Many reputable sites allow Alice to email a link to Bob or to refer Bob to their site. Such legitimate referrals tend to have the following properties:<ul><li>Alice knows Bob, and believes that he&mdash;<em>individually</em>&mdash;would be interested in the information or service referred</li><li>Bob's email address is <em>manually</em> entered in the referral form</li><li>the email is sent <em>once</em> when Alice submits the form</li><li>Alice's email address is <em>not kept</em> by the referral system, since further emails would not be legitimate</li><li>the message makes the <em>circumstances</em> in which the email was sent clear and explicit, so that Bob knows from reading the message that Alice visited a site and genuinely thought he would be interested.</li></ul><br />
In contrast, sms.ac<ul><li>operate a bulk, automated, and <em>non-discriminative</em> collection of email addresses</li><li>they keep Bob's email address and send him recurring emails on behalf of Alice, without Alice being aware of it</li><li>they imply Alice was involved in sending the messages</li><li>Bob has to take explicit action to stop receiving emails.</li></ul><br />
I wrote to SMS.ac to demand they cease sending me these unsolicited emails. I noted in my email that their messages were <em>unsolicited, automated and repeated commercial emails</em>, with no prior contact between the parties; therefore clearly qualifying as spam. I also noted that by using your acquaintance's name in the emails they imply the messages come from him/her, which is a form of <em>identity theft</em>. I got a childish and contemptuous reply, but I have not received any spam from them since. So complaining to them can help your particular case, but it won't make them change their practice: SMS.ac boast a very large customer base, which is certainly largely due to the unethical methods they employ to enroll them. If you don't like the way they are doing business, tell their prospective customers <em>openly</em>.</p>

<p><em>Some actions we can take to oppose and help publicize sms.ac's fraudulent practices are:<ol><li>systematically report their mails as spam ("report spam" button) if you are using a webmail service (Yahoo!,...)</li><li>post an appropriate <a href="http://www.alexa.com/data/details/main?q=&url=http://www.sms.ac" rel="nofollow">review</a> on Alexa and/or blog about it</li><li>make sure the person who gave out your email address to SMS.ac is aware of the scam.</li></ol></em> </p>

<p><em>If you have also been spammed by SMS.ac, please post a comment below.</em></p>

<p><small>All trademarks, names, and services referenced above belong to their respective owner. </small></p>]]></description>
         <link>http://w2.syronex.com/jmr/blog/2005/08/sms_ac_invitation</link>
         <guid>http://w2.syronex.com/jmr/blog/2005/08/sms_ac_invitation</guid>
         <category>Spam &amp; Scam</category>
         <pubDate>Wed, 03 Aug 2005 09:21:45 -0600</pubDate>
      </item>
            <item>
         <title>Spam: you probably know about email address harvesting, what about email gleaning?</title>
         <description><![CDATA[<p>Most people are aware that e-mail addresses posted on the web are being harvested by spammers using software that crawles the web from page to page following links and retrieves everythings that looks like an email address. This happens on any website or newsgroup, including forums and of course blogs. <br />
The one thing spammers need to decide is where on the web to start collecting addresses. On this matter, they proceed as everyone else looking for information: they either use a <em>directory</em>, or use a <em>search engine</em>. Initiating the crawl from a large web directory like the Open Directory and its derivatives gives them the option to target certain categories of victims: personal sites, small businesses, or universities, thereby giving more value to their email dataset. Using search engines allows them to attempt to shortlist sites containing up-to-date contact information (e.g. searching for &ldquo;contact 2005&rdquo;). The search results can then be used as pre-processed data for further automated email address extraction. </p>

<p>More surprising is the fact that there are some people out there spending their days manually gleaning email addresses on the web. They are mostly connecting from Internet caf&eacute;s in places like Ivory-Coast or Nigeria and use tools such as Google, Yahoo or search engine aggregators to look for email addresses using queries like &ldquo;contact john 2005&rdquo; or &ldquo;email me 2005&rdquo;. Look for &ldquo;2005&rdquo; in your webserver log and chances are you will find evidences of this happening on your site.<br />
There are ways to help avoid automated email harvesting without sacrificing too much web usability (i.e. using <a href="http://w2.syronex.com/jmr/safemailto/">encoded email links</a>). There are also ways to help prevent manual email address collection: a simple thing to do is to remove the year appearing in the copyright notice of your contact page, and replace it with a simple script: <br />
<tt><br />
&lt;script type="text/javascript"&gt;&lt;!--<br />
document.write((new Date()).getFullYear())<br />
//--&gt;&lt;/script&gt;<br />
</tt></p>]]></description>
         <link>http://w2.syronex.com/jmr/blog/2005/06/spam_email_gleaning</link>
         <guid>http://w2.syronex.com/jmr/blog/2005/06/spam_email_gleaning</guid>
         <category>Spam &amp; Scam</category>
         <pubDate>Sat, 04 Jun 2005 09:06:03 -0600</pubDate>
      </item>
      
   </channel>
</rss>
