Data Mining, Down Under

Welcome to “Data Mining, Down Under”, a blog by Aussie data miner Shane Butler.

Data Mining, Down Under header image 4

Smart SPAM & Fighting it

May 13th, 2006 · 1 Comment · Research

For any machine learning based SPAM filters, such as the popular Bayesian methods, the key to success is the body of previously identified SPAM and HAM (valid emails) or training data. In order for the spammer to trick the filter, they must try to be more HAM-like. The way to beat this is by giving [...]

[Read more →]

Tags:··

Got Zeitgeist? Mining Online Trends

March 6th, 2006 · 2 Comments · Research

Each week, Google provides a taste of the top search queries, a site called Google Zeitgeist. At the end of each year, they compile a more comprehensive report of what people have been searching for. The 2005 Zeitgeist has been out since December and provides some interesting insights into online trends over past year. My [...]

[Read more →]

Tags:·