Boing Boing has something up about ASCII spam.
The general idea being that if images are going to be filtered out and the Bayesian analysis stuff is getting good at blocking text... then the next step must be ASCII art.
Off the top of my head, I would guess that eventually the Bayesian filtering software would learn to weigh "pre" tags more towards spam, as well as large patches of spaces (which actually are more important than the "pre" tags in this sense, but the pre tag is what allows the spaces to survive on the screen in web browsers).
So while the Boing Boing mentions that it is hard to block - I would bet that the filters that can learn actually do fairly well at blocking them (assuming that they don't compress spaces).
Posted by Eric at December 3, 2004 01:32 AM
| TrackBack