"Approximate" hashes

Jerrold Leichter jerrold.leichter at smarts.com
Wed Sep 1 18:05:24 EDT 2004

| nilsimsa
| Computes nilsimsa codes of messages and compares the codes and finds
| clusters of similar messages so as to trash spam.
| What's a nilsimsa code?
| A nilsimsa code is something like a hash, but unlike hashes, a small change
| in the message results in a small change in the nilsimsa code.
| http://lexx.shinn.net/cmeclax/nilsimsa.html
I had a look at the code (which isn't easy to follow).  This appears to be a
new application of Bloom filters.
							-- Jerry

