DNA technique to fight spam
|
The bioinformatics research group at IBM's Thomas J Watson Research Center in New York has adapted a technique originally designed to analyse DNA sequences to catch spam.
The system is based on the Chung-Kwei algorithm, designed to search different DNA and amino acid sequences for recurring patterns. The algorithm was fed with 65,000 examples of known spam, each email was treated as a DNA-like chain of characters and Chung-Kwei identified six million recurring patterns in this collection, such as "Viagra". Each pattern represented a common sequence of letters and numbers that had appeared in unsolicited message. The researchers then ran a collection of known non-spam through the same process, and removed the patterns that occurred in both groups. Incoming email was given a score based on how many spam patterns it had. A long email that only had a few spammy sentences would get a relatively low score; but one with many patterns would score much higher. The Chung-Kwei correctly identified nearly 97% of the test messages as being spam. |
1 TrackBacks
Listed below are links to blogs that reference this entry: DNA technique to fight spam.
TrackBack URL for this entry: http://www.we-make-money-not-art.com/cgi-bin/mt-tb.cgi/1537
In your free time, visit the sites about online poker interest only mortgage Read More
![17388[1].gif](http://www.we-make-money-not-art.com/xxx/17388[1].gif)
it seems very intyeresting, anyway i've my own idea.
new protocol for emails based on:
From -> who is sending email
To -> who will receive the message
Subject -> the subject of the email
Public -> a little password that "To" person gives to "From" person before startin sending email
Body -> normal body
The to person could give different password, one for text message, one for email with attachments...
what happen if the password in the email is uncorrect?
the server refuses to parse the email to "To" person.
this is the dream i had this night,
truly Sat_