<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments for Data into results</title>
	<atom:link href="http://www.dataintoresults.com/comments/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.dataintoresults.com</link>
	<description>A data miner diary</description>
	<lastBuildDate>Fri, 30 Oct 2009 23:50:44 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
	<item>
		<title>Comment on Machine learning vs simulation by Sébastien Derivaux</title>
		<link>http://www.dataintoresults.com/2009/10/machine-learning-vs-simulation/comment-page-1/#comment-90</link>
		<dc:creator>Sébastien Derivaux</dc:creator>
		<pubDate>Fri, 30 Oct 2009 23:50:44 +0000</pubDate>
		<guid isPermaLink="false">http://www.dataintoresults.com/?p=193#comment-90</guid>
		<description>I agree on your comment but it is only when you choose the method before applying it and not trying other methods. Usually, I pick many methods and try all choosing the best or the best combination. Sometimes it is not the method I would choose by &quot;domain knowledge&quot; which is the best. For unsupervised classification, the choice rely more on knowledge than evaluation (usually not possible).

Thanks for your PS.</description>
		<content:encoded><![CDATA[<p>I agree on your comment but it is only when you choose the method before applying it and not trying other methods. Usually, I pick many methods and try all choosing the best or the best combination. Sometimes it is not the method I would choose by &#8220;domain knowledge&#8221; which is the best. For unsupervised classification, the choice rely more on knowledge than evaluation (usually not possible).</p>
<p>Thanks for your PS.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Machine learning vs simulation by Steffen</title>
		<link>http://www.dataintoresults.com/2009/10/machine-learning-vs-simulation/comment-page-1/#comment-89</link>
		<dc:creator>Steffen</dc:creator>
		<pubDate>Thu, 29 Oct 2009 17:14:35 +0000</pubDate>
		<guid isPermaLink="false">http://www.dataintoresults.com/?p=193#comment-89</guid>
		<description>Strictly speaking, the usage of a specific machine learning method already adds domain knowledge ;) (e.g. in a specific area some methods are more suitable than others). 

I think an example for the merging of both &quot;tools&quot; is the EM-Algorithm (read: expectation maximization) for e.g. estimating the parameters of a multi-gaussian-distribution. 

kind regards,

Steffen

PS: I like your style of writing and the topics you are covering. I am looking forward to read more posts (more often ;) ) in the future.</description>
		<content:encoded><![CDATA[<p>Strictly speaking, the usage of a specific machine learning method already adds domain knowledge <img src='http://www.dataintoresults.com/wp-includes/images/smilies/icon_wink.gif' alt=';)' class='wp-smiley' />  (e.g. in a specific area some methods are more suitable than others). </p>
<p>I think an example for the merging of both &#8220;tools&#8221; is the EM-Algorithm (read: expectation maximization) for e.g. estimating the parameters of a multi-gaussian-distribution. </p>
<p>kind regards,</p>
<p>Steffen</p>
<p>PS: I like your style of writing and the topics you are covering. I am looking forward to read more posts (more often <img src='http://www.dataintoresults.com/wp-includes/images/smilies/icon_wink.gif' alt=';)' class='wp-smiley' />  ) in the future.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Using MySQL as a Data Warehouse by Sébastien Derivaux</title>
		<link>http://www.dataintoresults.com/2009/06/using-mysql-as-a-data-warehouse/comment-page-1/#comment-88</link>
		<dc:creator>Sébastien Derivaux</dc:creator>
		<pubDate>Wed, 28 Oct 2009 22:36:16 +0000</pubDate>
		<guid isPermaLink="false">http://www.dataintoresults.com/?p=40#comment-88</guid>
		<description>Yes for the index. 

Not easy to managed so many engines when no one is perfect and still no column oriented storage engine in the main Mysql version.</description>
		<content:encoded><![CDATA[<p>Yes for the index. </p>
<p>Not easy to managed so many engines when no one is perfect and still no column oriented storage engine in the main Mysql version.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Data mining tools by Sébastien Derivaux</title>
		<link>http://www.dataintoresults.com/2009/07/data-mining-tools/comment-page-1/#comment-87</link>
		<dc:creator>Sébastien Derivaux</dc:creator>
		<pubDate>Wed, 28 Oct 2009 22:33:16 +0000</pubDate>
		<guid isPermaLink="false">http://www.dataintoresults.com/?p=122#comment-87</guid>
		<description>@Steffen I was not having the chance to use Rapidminer. Of course it could be interresting as it become most popular than Weka.

@dodo data preparation is of course a big topic. I&#039;m using Talend to do this job, very efficient and easy (and you could add Java in it). Most ETL package will work.</description>
		<content:encoded><![CDATA[<p>@Steffen I was not having the chance to use Rapidminer. Of course it could be interresting as it become most popular than Weka.</p>
<p>@dodo data preparation is of course a big topic. I&#8217;m using Talend to do this job, very efficient and easy (and you could add Java in it). Most ETL package will work.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Data mining tools by Steffen</title>
		<link>http://www.dataintoresults.com/2009/07/data-mining-tools/comment-page-1/#comment-86</link>
		<dc:creator>Steffen</dc:creator>
		<pubDate>Thu, 24 Sep 2009 11:27:36 +0000</pubDate>
		<guid isPermaLink="false">http://www.dataintoresults.com/?p=122#comment-86</guid>
		<description>&lt;a href=&quot;#comment-84&quot; rel=&quot;nofollow&quot;&gt;@dodo &lt;/a&gt; 

I disagree for two reasons:
- I cannot count the algorithms I can give you a understandable description of in 1 minute, but when it comes to a real data analysis you will meet special cases where you have to know EXACTLY how this algorithm is implemented. That is the reason I could never work with non-open source programs
- If you are not able to write code (at least for changing the behavior of present algorithms or create new ones) you restrict yourself to use only what&#039;s available. Are you sure your data mining environment is prepared for every possible data analysis problem ?

@tools: You forgot RapidMiner (former Yale) which does an excellent job in handling large datasets and data preparation (its key focus). It is free, it is open source and it is written in java.

@this blog: This is my first visit and I really like it. Design, content ... sweet and technical, that is how I like it.</description>
		<content:encoded><![CDATA[<p><a href="#comment-84" rel="nofollow">@dodo </a> </p>
<p>I disagree for two reasons:<br />
- I cannot count the algorithms I can give you a understandable description of in 1 minute, but when it comes to a real data analysis you will meet special cases where you have to know EXACTLY how this algorithm is implemented. That is the reason I could never work with non-open source programs<br />
- If you are not able to write code (at least for changing the behavior of present algorithms or create new ones) you restrict yourself to use only what&#8217;s available. Are you sure your data mining environment is prepared for every possible data analysis problem ?</p>
<p>@tools: You forgot RapidMiner (former Yale) which does an excellent job in handling large datasets and data preparation (its key focus). It is free, it is open source and it is written in java.</p>
<p>@this blog: This is my first visit and I really like it. Design, content &#8230; sweet and technical, that is how I like it.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Using MySQL as a Data Warehouse by dodo</title>
		<link>http://www.dataintoresults.com/2009/06/using-mysql-as-a-data-warehouse/comment-page-1/#comment-85</link>
		<dc:creator>dodo</dc:creator>
		<pubDate>Wed, 23 Sep 2009 11:26:00 +0000</pubDate>
		<guid isPermaLink="false">http://www.dataintoresults.com/?p=40#comment-85</guid>
		<description>Nice article, thanks for it!
Looks like storage engine is an another dimension in MySQL DWH design :)
Have you created indexes for innodb in &quot;innodb	memory	memory	28s&quot; ?</description>
		<content:encoded><![CDATA[<p>Nice article, thanks for it!<br />
Looks like storage engine is an another dimension in MySQL DWH design <img src='http://www.dataintoresults.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /><br />
Have you created indexes for innodb in &#8220;innodb	memory	memory	28s&#8221; ?</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Data mining tools by dodo</title>
		<link>http://www.dataintoresults.com/2009/07/data-mining-tools/comment-page-1/#comment-84</link>
		<dc:creator>dodo</dc:creator>
		<pubDate>Wed, 23 Sep 2009 10:11:28 +0000</pubDate>
		<guid isPermaLink="false">http://www.dataintoresults.com/?p=122#comment-84</guid>
		<description>Hi,
what I see as the biggest challenge in DM is data-preparation. That&#039;s not task for Weka, not for Excel and I&#039;m not familiar with R.

SAS Base does the great job there. SPSS Modeler as well. SPSS Statistic trial is available at www.spss.com. 

Java is cool. But you are wasting time with programming. Data-miner has more important task to do than generating tons of code. 
Doesn&#039;t (s)he?</description>
		<content:encoded><![CDATA[<p>Hi,<br />
what I see as the biggest challenge in DM is data-preparation. That&#8217;s not task for Weka, not for Excel and I&#8217;m not familiar with R.</p>
<p>SAS Base does the great job there. SPSS Modeler as well. SPSS Statistic trial is available at <a href="http://www.spss.com" rel="nofollow">http://www.spss.com</a>. </p>
<p>Java is cool. But you are wasting time with programming. Data-miner has more important task to do than generating tons of code.<br />
Doesn&#8217;t (s)he?</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on How to : What to do when your model fails? by dodo</title>
		<link>http://www.dataintoresults.com/2009/07/what-to-do-when-model-fails/comment-page-1/#comment-83</link>
		<dc:creator>dodo</dc:creator>
		<pubDate>Wed, 23 Sep 2009 10:03:25 +0000</pubDate>
		<guid isPermaLink="false">http://www.dataintoresults.com/?p=142#comment-83</guid>
		<description>&quot;... use genetic programming ... You could look at GenIQ pages for a description. 99.9% of the time I think it will not work&quot;
Good point! Just what would people behind the GenIQ say :)</description>
		<content:encoded><![CDATA[<p>&#8220;&#8230; use genetic programming &#8230; You could look at GenIQ pages for a description. 99.9% of the time I think it will not work&#8221;<br />
Good point! Just what would people behind the GenIQ say <img src='http://www.dataintoresults.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on About me by Sandro</title>
		<link>http://www.dataintoresults.com/about/comment-page-1/#comment-67</link>
		<dc:creator>Sandro</dc:creator>
		<pubDate>Fri, 11 Sep 2009 10:36:20 +0000</pubDate>
		<guid isPermaLink="false">http://dataintoresults.homeip.net/?page_id=2#comment-67</guid>
		<description>Do you have a RSS please?
Thanks and nice blog btw...
Regards.
Sandro.</description>
		<content:encoded><![CDATA[<p>Do you have a RSS please?<br />
Thanks and nice blog btw&#8230;<br />
Regards.<br />
Sandro.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on How to : What to do when your model fails? by Sébastien Derivaux</title>
		<link>http://www.dataintoresults.com/2009/07/what-to-do-when-model-fails/comment-page-1/#comment-23</link>
		<dc:creator>Sébastien Derivaux</dc:creator>
		<pubDate>Thu, 30 Jul 2009 21:17:54 +0000</pubDate>
		<guid isPermaLink="false">http://www.dataintoresults.com/?p=142#comment-23</guid>
		<description>Thanks for your comment. You are of course right, but I don&#039;t know any statistical test which tell you can&#039;t do significantly better. Maybe thinking of an unexpected attribute could bring new insights on the business.
It&#039;s not an easy decision to say stop. It will always be too soon or to late.
If you have many to problems to solve, a breadth first strategy should be better than a depth first one. Find a first solution for each, then look for something better.</description>
		<content:encoded><![CDATA[<p>Thanks for your comment. You are of course right, but I don&#8217;t know any statistical test which tell you can&#8217;t do significantly better. Maybe thinking of an unexpected attribute could bring new insights on the business.<br />
It&#8217;s not an easy decision to say stop. It will always be too soon or to late.<br />
If you have many to problems to solve, a breadth first strategy should be better than a depth first one. Find a first solution for each, then look for something better.</p>
]]></content:encoded>
	</item>
</channel>
</rss>

