<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Data Mining, Down Under &#187; Tips &amp; Tutorials</title>
	<atom:link href="http://www.dataminingdownunder.com/category/tips-and-tutorials/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.dataminingdownunder.com</link>
	<description>Welcome to "Data Mining, Down Under", a blog by Aussie data miner Shane Butler.</description>
	<lastBuildDate>Tue, 23 Feb 2010 09:34:28 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Ten Data Mining Mistakes to Avoid</title>
		<link>http://www.dataminingdownunder.com/2009/05/ten-mistakes-to-avoid/</link>
		<comments>http://www.dataminingdownunder.com/2009/05/ten-mistakes-to-avoid/#comments</comments>
		<pubDate>Fri, 15 May 2009 10:19:37 +0000</pubDate>
		<dc:creator>Shane Butler</dc:creator>
				<category><![CDATA[Tips & Tutorials]]></category>
		<category><![CDATA[john elder]]></category>
		<category><![CDATA[video]]></category>

		<guid isPermaLink="false">http://www.dataminingdownunder.com/?p=258</guid>
		<description><![CDATA[Some really good advice here from John Elder in a series of video tutorials on data mining mistakes to avoid.  Tip #5, regarding contaminating the project with future data is a good one, although sometimes it can be quite tricky (if not impossible) to &#8216;rewind&#8217; the data!  I believe the video series is [...]]]></description>
			<content:encoded><![CDATA[<p>Some really good advice here from John Elder in a <a href="http://www.youtube.com/view_play_list?p=79E8168EA02996A3&#038;sort_field=title">series of video tutorials on data mining mistakes to avoid</a>.  Tip #5, regarding contaminating the project with future data is a good one, although sometimes it can be quite tricky (if not impossible) to &#8216;rewind&#8217; the data!  I believe the video series is a part of the launch of <a href="http://www.elsevierdirect.com/datamining">The Handbook of Statistical Analysis and Data Mining Applications</a>.  You can watch part one below or head over to YouTube for the <a href="http://www.youtube.com/view_play_list?p=79E8168EA02996A3&#038;sort_field=title">entire series</a>.</p>
<p><object width="532" height="323"><param name="movie" value="http://www.youtube.com/v/Rd60vmoMMRY&#038;hl=en&#038;fs=1"></param><param name="allowFullScreen" value="true"></param><param name="allowscriptaccess" value="always"></param><embed src="http://www.youtube.com/v/Rd60vmoMMRY&#038;hl=en&#038;fs=1" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="532" height="323"></embed></object></p>
]]></content:encoded>
			<wfw:commentRss>http://www.dataminingdownunder.com/2009/05/ten-mistakes-to-avoid/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>RapidMiner 4.3 Released</title>
		<link>http://www.dataminingdownunder.com/2008/11/rapidminer-43-released/</link>
		<comments>http://www.dataminingdownunder.com/2008/11/rapidminer-43-released/#comments</comments>
		<pubDate>Fri, 28 Nov 2008 04:19:56 +0000</pubDate>
		<dc:creator>Shane Butler</dc:creator>
				<category><![CDATA[Software]]></category>
		<category><![CDATA[Tips & Tutorials]]></category>
		<category><![CDATA[lift chart]]></category>
		<category><![CDATA[open source]]></category>
		<category><![CDATA[rapid miner]]></category>
		<category><![CDATA[video]]></category>

		<guid isPermaLink="false">http://www.dataminingdownunder.com/?p=211</guid>
		<description><![CDATA[Rapid-I has released an new and improved version of the open source data mining suite RapidMiner (formely called YALE).  I&#8217;ve been evaluating RapidMiner lately as a possible addition to my data mining toolbox.  I&#8217;ve found the biggest hurdle in learning how to use it is probably the GUI.  It is a tree-based GUI which I [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://rapid-i.com">Rapid-I</a> has released an new and <a href="http://rapid-i.com/content/view/133/1/">improved</a> version of the open source data mining suite <a href="http://rapidminer.com">RapidMiner</a> (formely called YALE).  I&#8217;ve been evaluating RapidMiner lately as a possible addition to my data mining toolbox.  I&#8217;ve found the biggest hurdle in learning how to use it is probably the GUI.  It is a tree-based GUI which I find much harder to understand than the graph-style approach used by <a href="http://www.spss.com/clementine/">many</a> <a href="http://www.sas.com/technologies/analytics/datamining/miner/">others</a>.  However RapidMiner is quite a powerful tool, and the Community Edition is free, so there is probably a lot of benefit in getting used to the strange GUI.</p>
<p>The built in tutorial is a really good way to get a grasp of the system and I highly recommend spending some time on this if you are interested in learning RapidMiner.  I would also recommend a series of <a href="http://www.neuralmarkettrends.com/tutorials/">RapidMiner video turtorials</a> over at <a href="http://www.neuralmarkettrends.com/">Neural Market Trends</a> that are worth checking out too.</p>
<div id="attachment_216" class="wp-caption aligncenter" style="width: 211px"><a href="http://rapid-i.com/images/stories/rapidi/yale/releases/4_3/01_lift.jpg"><img class="size-full wp-image-216" title="RapidMiner 4.3" src="http://www.dataminingdownunder.com/wp-content/uploads/2008/11/rmnewsml.jpg" alt="RapidMiner 4.3 includes a 3d lift chart" width="201" height="150" /></a><p class="wp-caption-text">RapidMiner 4.3 includes a 3D lift chart</p></div>
]]></content:encoded>
			<wfw:commentRss>http://www.dataminingdownunder.com/2008/11/rapidminer-43-released/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Data Mining the Financial Markets</title>
		<link>http://www.dataminingdownunder.com/2008/04/data-mining-the-financial-markets/</link>
		<comments>http://www.dataminingdownunder.com/2008/04/data-mining-the-financial-markets/#comments</comments>
		<pubDate>Fri, 25 Apr 2008 06:32:10 +0000</pubDate>
		<dc:creator>Shane Butler</dc:creator>
				<category><![CDATA[Industry]]></category>
		<category><![CDATA[Tips & Tutorials]]></category>
		<category><![CDATA[bonds]]></category>
		<category><![CDATA[finance]]></category>

		<guid isPermaLink="false">http://sbutler.com/blog/2008/04/data-mining-the-financial-markets/</guid>
		<description><![CDATA[Thomas A. Rathburn has written a series of three articles on data mining the financial markets.  Rathburn takes a detailed look into the success and failures of his efforts in the markets and with 10 year US bonds in particular.  You can check it out here part 1, part 2, and part 3. [...]]]></description>
			<content:encoded><![CDATA[<p>Thomas A. Rathburn has written a series of three articles on data mining the financial markets.  Rathburn takes a detailed look into the success and failures of his efforts in the markets and with 10 year US bonds in particular.  You can check it out here <a href="http://www.b-eye-network.com/view/6386">part 1</a>, <a href="http://www.b-eye-network.com/view/6655">part 2</a>, and <a href="http://www.b-eye-network.com/view/7189">part 3</a>.  The articles are also available as a podcast here: <a href="http://www.b-eye-network.com/includes/audio/6386.mp3">1</a>, <a href="http://www.b-eye-network.com/includes/audio/6655.mp3">2</a>, <a href="http://www.b-eye-network.com/includes/audio/7189.mp3">3</a>.</p>
<p align="right">[via <a href="http://www.kdnuggets.com">KDnuggets</a>]</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dataminingdownunder.com/2008/04/data-mining-the-financial-markets/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
<enclosure url="http://www.b-eye-network.com/includes/audio/6386.mp3" length="14730638" type="audio/mpeg" />
<enclosure url="http://www.b-eye-network.com/includes/audio/6655.mp3" length="13275487" type="audio/mpeg" />
<enclosure url="http://www.b-eye-network.com/includes/audio/7189.mp3" length="11392788" type="audio/mpeg" />
		</item>
		<item>
		<title>In-cell Graphing</title>
		<link>http://www.dataminingdownunder.com/2006/08/in-cell-graphing/</link>
		<comments>http://www.dataminingdownunder.com/2006/08/in-cell-graphing/#comments</comments>
		<pubDate>Fri, 11 Aug 2006 09:30:11 +0000</pubDate>
		<dc:creator>Shane Butler</dc:creator>
				<category><![CDATA[Tips & Tutorials]]></category>
		<category><![CDATA[excel]]></category>
		<category><![CDATA[graphs]]></category>

		<guid isPermaLink="false">http://sbutler.com/blog/2006/08/in-cell-graphing/</guid>
		<description><![CDATA[The guys from Juice Analytics have put together an interesting series on in cell graphing (parts 1, 2, &#38; 3). This is a feature that is due in the upcoming version of Excel 2007, however the technique the Juice guys use works across all versions of Excel and is quite visually appealing too. Added bonus, [...]]]></description>
			<content:encoded><![CDATA[<p>The guys from <a href="http://juiceanalytics.com/weblog/" target="_blank">Juice Analytics</a> have put together an interesting series on in cell graphing (parts <a href="http://www.juiceanalytics.com/weblog/?p=236" target="_blank">1</a>, <a href="http://www.juiceanalytics.com/weblog/?p=239">2</a>, &amp; <a href="http://www.juiceanalytics.com/weblog/?p=240" target="_blank">3</a>). This is a feature that is due in the upcoming version of Excel 2007, however the technique the Juice guys use works across all versions of Excel and is quite visually appealing too. Added bonus, I can confirm it works in <a href="http://openoffice.org">OpenOffice.org</a>, <a href="http://www.gnome.org/projects/gnumeric/">Gnumeric</a> and even <a href="http://spreadsheets.google.com" target="_blank">Google Spreadsheets</a> (all to varying degrees).</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dataminingdownunder.com/2006/08/in-cell-graphing/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Data Mining with Oracle</title>
		<link>http://www.dataminingdownunder.com/2006/05/oracle-data-mining/</link>
		<comments>http://www.dataminingdownunder.com/2006/05/oracle-data-mining/#comments</comments>
		<pubDate>Tue, 30 May 2006 11:45:32 +0000</pubDate>
		<dc:creator>Shane Butler</dc:creator>
				<category><![CDATA[Software]]></category>
		<category><![CDATA[Tips & Tutorials]]></category>
		<category><![CDATA[oracle]]></category>
		<category><![CDATA[sql]]></category>

		<guid isPermaLink="false">http://sbutler.com/blog/2006/05/oracle-data-mining/</guid>
		<description><![CDATA[If you are interested in data mining and haven&#8217;t already seen the Oracle Data Mining and Analytics blog, it is worth checking out. It has some great how to&#8217;s, including time series forcasting (parts 1, 2, 3) and real-time scoring &#38; model management (parts 1, 2, 3).
]]></description>
			<content:encoded><![CDATA[<p>If you are interested in data mining and haven&#8217;t already seen the <a href="http://oracledmt.blogspot.com/">Oracle Data Mining and Analytics blog</a>, it is worth checking out. It has some great how to&#8217;s, including time series forcasting (parts <a href="http://oracledmt.blogspot.com/2006/01/time-series-forecasting-part-1_23.html">1</a>, <a href="http://oracledmt.blogspot.com/2006/03/time-series-forecasting-2-single-step.html">2</a>, <a href="http://oracledmt.blogspot.com/2006/05/time-series-forecasting-3-multi-step.html">3</a>) and real-time scoring &amp; model management (parts <a href="http://oracledmt.blogspot.com/2006/02/real-time-scoring-model-management-1.html">1</a>, <a href="http://oracledmt.blogspot.com/2006/02/real-time-scoring-model-management-2.html">2</a>, <a href="http://oracledmt.blogspot.com/2006/02/real-time-scoring-model-management-3.html">3</a>).</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dataminingdownunder.com/2006/05/oracle-data-mining/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Fayyad on Data Mining (cont)</title>
		<link>http://www.dataminingdownunder.com/2006/01/fayyad-interview-23/</link>
		<comments>http://www.dataminingdownunder.com/2006/01/fayyad-interview-23/#comments</comments>
		<pubDate>Sat, 14 Jan 2006 23:55:51 +0000</pubDate>
		<dc:creator>Shane Butler</dc:creator>
				<category><![CDATA[Industry]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[Tips & Tutorials]]></category>
		<category><![CDATA[fayyad]]></category>
		<category><![CDATA[interview]]></category>
		<category><![CDATA[yahoo]]></category>

		<guid isPermaLink="false">http://sbutler.com/blog/2006/01/fayyad-interview-23/</guid>
		<description><![CDATA[Dont forget to read the other KDnuggets interviews with Usama Fayyad:

Part 2: his Data Mining work at Yahoo!


Part 3: his interests, hobbies and outlook for the future, including this gem: &#8220;Any smart young (and good) data miners out there: I think I can keep you very busy indeed!&#8221;

]]></description>
			<content:encoded><![CDATA[<p>Dont forget to read the other <a href="http://www.kdnuggets.com/">KDnuggets</a> interviews with Usama Fayyad:</p>
<ul>
<li><a href="http://www.kdnuggets.com/news/2005/n21/">Part 2</a>: his Data Mining work at <a href="http://yahoo.com">Yahoo!</a></li>
</ul>
<ul>
<li><a href="http://www.kdnuggets.com/news/2005/n22/">Part 3</a>: his interests, hobbies and outlook for the future, including this gem: <em>&#8220;Any smart young (and good) data miners out there: I think I can keep you very busy indeed!&#8221;</em></li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.dataminingdownunder.com/2006/01/fayyad-interview-23/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>KDnuggets Interviews Fayyad</title>
		<link>http://www.dataminingdownunder.com/2005/11/fayyad-interview-1/</link>
		<comments>http://www.dataminingdownunder.com/2005/11/fayyad-interview-1/#comments</comments>
		<pubDate>Tue, 08 Nov 2005 01:23:53 +0000</pubDate>
		<dc:creator>Shane Butler</dc:creator>
				<category><![CDATA[Industry]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[Tips & Tutorials]]></category>
		<category><![CDATA[fayyad]]></category>
		<category><![CDATA[interview]]></category>
		<category><![CDATA[yahoo]]></category>

		<guid isPermaLink="false">http://sbutler.com/blog/?p=71</guid>
		<description><![CDATA[In the latest issue of KDnuggets News Gregory Piatetsky-Shapiro talks to Usama Fayyad in an interview series covering his time at Yahoo, data mining challenges, data mining start-up DigiMine, the resulting spin-off consulting company DMX Group and consulting lessons. I have to say, what a great interview. Fayyad points to lessons on data stratergy, consulting [...]]]></description>
			<content:encoded><![CDATA[<p>In the latest issue of <a href="http://www.kdnuggets.com/news/2005/n20/">KDnuggets News</a> Gregory Piatetsky-Shapiro talks to <a href="http://docs.yahoo.com/docs/pr/executives/fayyad.html">Usama Fayyad</a> in an interview series covering his time at <a href="http://www.kdnuggets.com/news/2005/n20/3i.html">Yahoo</a>, <a href="http://www.kdnuggets.com/news/2005/n20/4i.html">data mining challenges</a>, data mining start-up <a href="http://www.kdnuggets.com/news/2005/n20/5i.html">DigiMine</a>, the resulting spin-off consulting company <a href="http://www.kdnuggets.com/news/2005/n20/6i.html">DMX Group and consulting lessons</a>. I have to say, what a great interview. Fayyad points to lessons on data stratergy, consulting and the potential of web mining. I am surprised at how fast a company could be founded, then have a spin-off and then be purchased &#8212; quite impressive. The interview will continue in the next issue of KDnuggets.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dataminingdownunder.com/2005/11/fayyad-interview-1/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
