Data Mining, Down Under

Welcome to “Data Mining, Down Under”, a blog by Aussie data miner Shane Butler.

Data Mining, Down Under header image 4

Entries Tagged as 'Software'

PMML Tree Model to Code Converter

January 30th, 2010 · 5 Comments · Software

Lately I’ve been trying to come up with a generic way to deploy models on any platform.  So I’d like to share some early code that takes a PMML TreeModel and converts it to R code.  The intention is to get the R code generation working right, then extend to support generation for other languages. [...]

[Read more →]

Tags:·

PMML 4.0 Released

June 18th, 2009 · No Comments · Industry, News, Software

The DMG has released a new version of the PMML open format for representing predictive models. The new version includes support for ensembles, new model types and more built in functions to name just a few of the enhancements. For a detailed summary, see the Zementis blog.

[Read more →]

Tags:··

RapidMiner to get dual GUIs

May 14th, 2009 · 1 Comment · Software

A forum post by Ingo Mierswa of Rapid-I indicates the upcoming RapidMiner v5 will feature two GUIs: the existing tree-based designer and a new graph-based designer! I’m quite excited about this because I’ve personally found the existing UI a bit clunky. Details and screenshots over at the user forum.

[Read more →]

Tags:·

SAS hints at future R integration

February 17th, 2009 · No Comments · News, Software

In more R news, it appears SAS isn’t as worried about airplane safety as originally thought, and has indicated they will include R support in an upcoming update to the SAS/IML product.  For details see NYTimes & Adventures in Consulting.

[Read more →]

Tags:·

R in the New York Times

January 8th, 2009 · 3 Comments · Software

The New York Times has an interesting story on the increasing use of R for data analysis within academia and industry.  Several large corporates are cited as having selected R over commercial conterparts such as S and SAS. [via Slashdot] Update: For more R news, see also Ajay Ohri’s interview with Dr Graham Williams, the [...]

[Read more →]

Tags:··

RapidMiner 4.3 Released

November 28th, 2008 · 3 Comments · Software, Tips & Tutorials

Rapid-I has released an new and improved version of the open source data mining suite RapidMiner (formely called YALE).  I’ve been evaluating RapidMiner lately as a possible addition to my data mining toolbox.  I’ve found the biggest hurdle in learning how to use it is probably the GUI.  It is a tree-based GUI which I [...]

[Read more →]

Tags:···

SAS Forum (Australia) presentations available online

September 30th, 2008 · No Comments · Australia, Industry, Software

The SAS Forum (Australia) was held in Sydney back in August.  I was unable to attend but luckily the presentations have been put online.  Here are some that I found interesting: Make Sure Your Insight is Insightful: Analytical Marketing at NAB by Antony Ugoni (National Australia Bank) Model Deployment and Management – The ATO Story [...]

[Read more →]

Tags:·····

Data Mining with Oracle

May 30th, 2006 · No Comments · Software, Tips & Tutorials

If you are interested in data mining and haven’t already seen the Oracle Data Mining and Analytics blog, it is worth checking out. It has some great how to’s, including time series forcasting (parts 1, 2, 3) and real-time scoring & model management (parts 1, 2, 3).

[Read more →]

Tags:·

Getting to know R Graphs

April 7th, 2006 · 1 Comment · Software

Check out the R Graph Gallery which includes not only detailed descriptions of graphs you can produce in R, but also R source! Props to Martin for the link.

[Read more →]

Tags:·

YALE Data Mining Environment

March 24th, 2006 · No Comments · Software

YALE is a data mining and machine learning environment that integrates WEKA and some other SVM related tools into one GUI tool. Looks pretty spiffy – the GUI looks much better than Weka’s, and its Java/cross-platform also. Screenshots here.

[Read more →]

Tags:···