<?xml version="1.0" encoding="utf-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Blocklist</title>
	<atom:link href="http://mjtsai.com/blog/2003/12/18/blocklist/feed/" rel="self" type="application/rss+xml" />
	<link>http://mjtsai.com/blog/2003/12/18/blocklist/</link>
	<description></description>
	<lastBuildDate>Sat, 20 Mar 2010 14:01:58 -0700</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: brian</title>
		<link>http://mjtsai.com/blog/2003/12/18/blocklist/comment-page-1/#comment-567</link>
		<dc:creator>brian</dc:creator>
		<pubDate>Tue, 30 Nov 1999 00:00:00 +0000</pubDate>
		<guid isPermaLink="false">/?p=742#comment-567</guid>
		<description>thanks michael! i was wondering how those have been slipping through.</description>
		<content:encoded><![CDATA[<p>thanks michael! i was wondering how those have been slipping through.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Robb Beal</title>
		<link>http://mjtsai.com/blog/2003/12/18/blocklist/comment-page-1/#comment-568</link>
		<dc:creator>Robb Beal</dc:creator>
		<pubDate>Tue, 30 Nov 1999 00:00:00 +0000</pubDate>
		<guid isPermaLink="false">/?p=742#comment-568</guid>
		<description>Why isn&#039;t there a rule mime-type or url scheme?</description>
		<content:encoded><![CDATA[<p>Why isn't there a rule mime-type or url scheme?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Michael</title>
		<link>http://mjtsai.com/blog/2003/12/18/blocklist/comment-page-1/#comment-569</link>
		<dc:creator>Michael</dc:creator>
		<pubDate>Tue, 30 Nov 1999 00:00:00 +0000</pubDate>
		<guid isPermaLink="false">/?p=742#comment-569</guid>
		<description>Great idea; thanks, Robb!</description>
		<content:encoded><![CDATA[<p>Great idea; thanks, Robb!</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Nat Irons</title>
		<link>http://mjtsai.com/blog/2003/12/18/blocklist/comment-page-1/#comment-570</link>
		<dc:creator>Nat Irons</dc:creator>
		<pubDate>Tue, 30 Nov 1999 00:00:00 +0000</pubDate>
		<guid isPermaLink="false">/?p=742#comment-570</guid>
		<description>I ran that first rule against the last thousand or so legit messages I&#039;d received and came up with what would have been 76 false positives, including book release announcements, bug report replies, and complaints about the lack of coherence of the current season of 24.

Matching the expression from the beginning of the email subject reduced the number of false positives to three, and even caught a couple of spams that had snuck in from somewhere.

^(Re: )?[A-Z]{2,8}, [ a-z]*

But even that version matched 44 legit messages in my mailbox for the last year-to-date or so. I don&#039;t think that&#039;s safe enough.

The first time I saw &quot;%RND_UC_CHAR&quot; in a spam subject I laughed out loud, but I must have seen dozens of them since. (No knock on SpamSieve, since I see a lot of my mail at the shell before SpamSieve gets ahold of it.)

This bizarre glitch feels like confirmation of the worst fears about spam&#039;s business model. Anyone who can screw up so comprehensively and apparently not notice for weeks isn&#039;t going broke.</description>
		<content:encoded><![CDATA[<p>I ran that first rule against the last thousand or so legit messages I'd received and came up with what would have been 76 false positives, including book release announcements, bug report replies, and complaints about the lack of coherence of the current season of 24.</p>
<p>Matching the expression from the beginning of the email subject reduced the number of false positives to three, and even caught a couple of spams that had snuck in from somewhere.</p>
<p>^(Re: )?[A-Z]{2,8}, [ a-z]*</p>
<p>But even that version matched 44 legit messages in my mailbox for the last year-to-date or so. I don't think that's safe enough.</p>
<p>The first time I saw "%RND_UC_CHAR" in a spam subject I laughed out loud, but I must have seen dozens of them since. (No knock on SpamSieve, since I see a lot of my mail at the shell before SpamSieve gets ahold of it.)</p>
<p>This bizarre glitch feels like confirmation of the worst fears about spam's business model. Anyone who can screw up so comprehensively and apparently not notice for weeks isn't going broke.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Michael</title>
		<link>http://mjtsai.com/blog/2003/12/18/blocklist/comment-page-1/#comment-571</link>
		<dc:creator>Michael</dc:creator>
		<pubDate>Tue, 30 Nov 1999 00:00:00 +0000</pubDate>
		<guid isPermaLink="false">/?p=742#comment-571</guid>
		<description>Thanks, Nat. The ^ should definitely be there. I&#039;ve made a few other improvements to the pattern:

(?:(?-i)^(Re: )?[A-Z]{2,8}, [ a-z]*$)

and updated the entry and screenshot. I just used Mailsmith to test it on my last 10,000 good messages and got no false positives.</description>
		<content:encoded><![CDATA[<p>Thanks, Nat. The ^ should definitely be there. I've made a few other improvements to the pattern:</p>
<p>(?:(?-i)^(Re: )?[A-Z]{2,8}, [ a-z]*$)</p>
<p>and updated the entry and screenshot. I just used Mailsmith to test it on my last 10,000 good messages and got no false positives.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Michael</title>
		<link>http://mjtsai.com/blog/2003/12/18/blocklist/comment-page-1/#comment-572</link>
		<dc:creator>Michael</dc:creator>
		<pubDate>Tue, 30 Nov 1999 00:00:00 +0000</pubDate>
		<guid isPermaLink="false">/?p=742#comment-572</guid>
		<description>I&#039;ve seen some more characters after the comma, so perhaps:

(?:(?-i)^(Re: )?[A-Z]{2,8}, [ a-z0-9&#039;?]*$)</description>
		<content:encoded><![CDATA[<p>I've seen some more characters after the comma, so perhaps:</p>
<p>(?:(?-i)^(Re: )?[A-Z]{2,8}, [ a-z0-9'?]*$)</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Nat Irons</title>
		<link>http://mjtsai.com/blog/2003/12/18/blocklist/comment-page-1/#comment-573</link>
		<dc:creator>Nat Irons</dc:creator>
		<pubDate>Tue, 30 Nov 1999 00:00:00 +0000</pubDate>
		<guid isPermaLink="false">/?p=742#comment-573</guid>
		<description>That&#039;s much improved. I see only three false positives against my whole personal mail archive with the first revised pattern, and four with the second. They&#039;re all some variation of, &quot;OK, declaratory statement&quot; or, &quot;ACRONYM, pithy comment&quot;, and look enough like the spam pattern to verge on mechanical indistinguishability.
</description>
		<content:encoded><![CDATA[<p>That's much improved. I see only three false positives against my whole personal mail archive with the first revised pattern, and four with the second. They're all some variation of, "OK, declaratory statement" or, "ACRONYM, pithy comment", and look enough like the spam pattern to verge on mechanical indistinguishability.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
