This is a discussion on non-phish corpus? - SpamAssassin ; hey -- while working on this rule automation stuff, something occurred to me. I have a little corpus of my own transactional mail, the stuff that phishers love to impersonate. But is anyone collecting samples of this on a larger ...
hey -- while working on this rule automation stuff, something occurred to
me. I have a little corpus of my own transactional mail, the stuff that
phishers love to impersonate. But is anyone collecting samples of this on
a larger scale, that I can get a copy of?
It doesn't need to be complete -- the actual transaction details (items,
usernames, passwords, id numbers etc.) can be xxxx'd out; it's just the
common text like "PayPal is a trademark of eBay Inc" that I'd be after.
--j.