[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

RE: [Sheflug] Reg exp for V.I.A.G.R.A



> In it's rawest form, it'd look summat like:
>
>         [Vv][_ .]*[Ii][_ .]*[Aa][_ .]*[Gg][_ .]*[Rr][_ .]*[Aa]
>
> Where [Vv] et al match either upper or lower case version of
> character,
> and the other pattern matches all your seprators.

Thanks, that looks fairly easy.  Now I need to build a list of banned words
and a script to parse that, and create a new "rules" file containing the Reg
exp's for my mail client to use to sort the mail.
I suppose I should also write it to use a 'set list' of separators and to do
other character equivalent substitions (1 instead of i/I, 0 for o/O, 5/S
etc) as well so a search for 'viagra' becomes.

[Vv][_-' .]*[1Ii][_-' .]*[Aa@][_-' .]*[9Gg][_-' .]*[Rr][_-' .]*[ [at] Aa]

First I need to check the reg exps my mail app uses
I'm still using Mailwasher on Windows as I've yet to find a Linux one that
will read multi pop accounts and delete from the remote pop server (or
download and delete the mail) without removing the valid mail from the pop
server).  I need this to happen so that I only have to maintain one copy of
the spam filter software but it can check for my spam on both my work, home
and other managed accounts and still let me download the  mail to the
relevant computer from wherever I am.

I'm hoping to look at the spam assassin modules for perl to see what's
possible so I can run something under linux on our mail server at work that
despams all my accounts on a regular basis without doing anything to the
mail it thinks is not spam.

Has anyone had a go at doing anything similar?

ChrisJ

---
Outgoing mail is certified Virus Free.
Checked by AVG anti-virus system (http://www.grisoft.com).
Version: 6.0.516 / Virus Database: 313 - Release Date: 01/09/2003


___________________________________________________________________

Sheffield Linux User's Group -
http://www.sheflug.co.uk/mailfaq.html

  GNU the choice of a complete generation.