<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:media="http://search.yahoo.com/mrss/"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	
	>
<channel>
	<title>
	Comments on: Mahout - Clustering (聚类篇)	</title>
	<atom:link href="https://www.coder4.com/archives/4181/feed" rel="self" type="application/rss+xml" />
	<link>https://www.coder4.com/archives/4181</link>
	<description>Keep It Simple and Stupid</description>
	<lastBuildDate>Thu, 14 Aug 2025 05:56:58 +0000</lastBuildDate>
	<sy:updatePeriod>
	hourly	</sy:updatePeriod>
	<sy:updateFrequency>
	1	</sy:updateFrequency>
	<generator>https://wordpress.org/?v=6.8.3</generator>
	<item>
		<title>
		By: Anonymous		</title>
		<link>https://www.coder4.com/archives/4181#comment-1307</link>

		<dc:creator><![CDATA[Anonymous]]></dc:creator>
		<pubDate>Thu, 09 Jul 2015 13:16:45 +0000</pubDate>
		<guid isPermaLink="false">http://www.coder4.com/?p=4181#comment-1307</guid>

					<description><![CDATA[In reply to &lt;a href=&quot;https://www.coder4.com/archives/4181#comment-1301&quot;&gt;ok&lt;/a&gt;.

执行mahout org.apache.lucene.benchmark.utils.ExtractReuters ./reuters-sgm ./reuters-out时报错说没有添加“mahout org.apache.lucene.benchmark.utils.ExtractReuters”这个类，这个类不是mahout自带的吗，还是跟版本有关系，劳烦博主指教。]]></description>
			<content:encoded><![CDATA[<p>In reply to <a href="https://www.coder4.com/archives/4181#comment-1301">ok</a>.</p>
<p>执行mahout org.apache.lucene.benchmark.utils.ExtractReuters ./reuters-sgm ./reuters-out时报错说没有添加“mahout org.apache.lucene.benchmark.utils.ExtractReuters”这个类，这个类不是mahout自带的吗，还是跟版本有关系，劳烦博主指教。</p>
]]></content:encoded>
		
			</item>
		<item>
		<title>
		By: ok		</title>
		<link>https://www.coder4.com/archives/4181#comment-1301</link>

		<dc:creator><![CDATA[ok]]></dc:creator>
		<pubDate>Thu, 28 May 2015 04:48:16 +0000</pubDate>
		<guid isPermaLink="false">http://www.coder4.com/?p=4181#comment-1301</guid>

					<description><![CDATA[In reply to &lt;a href=&quot;https://www.coder4.com/archives/4181#comment-1276&quot;&gt;darrenan&lt;/a&gt;.

用这个来解决 mahout clusterdump -i /user/coder4/reuters-kmeans/clusters-2-final -d /user/coder4/reuters-sparse/dictionary.file-0 -dt sequencefile -o reuters-kmeans-cluster-dump -b 10 -n 20]]></description>
			<content:encoded><![CDATA[<p>In reply to <a href="https://www.coder4.com/archives/4181#comment-1276">darrenan</a>.</p>
<p>用这个来解决 mahout clusterdump -i /user/coder4/reuters-kmeans/clusters-2-final -d /user/coder4/reuters-sparse/dictionary.file-0 -dt sequencefile -o reuters-kmeans-cluster-dump -b 10 -n 20</p>
]]></content:encoded>
		
			</item>
		<item>
		<title>
		By: darrenan		</title>
		<link>https://www.coder4.com/archives/4181#comment-1277</link>

		<dc:creator><![CDATA[darrenan]]></dc:creator>
		<pubDate>Tue, 09 Dec 2014 09:15:27 +0000</pubDate>
		<guid isPermaLink="false">http://www.coder4.com/?p=4181#comment-1277</guid>

					<description><![CDATA[In reply to &lt;a href=&quot;https://www.coder4.com/archives/4181#comment-1276&quot;&gt;darrenan&lt;/a&gt;.

烦请楼主帮忙解决一下]]></description>
			<content:encoded><![CDATA[<p>In reply to <a href="https://www.coder4.com/archives/4181#comment-1276">darrenan</a>.</p>
<p>烦请楼主帮忙解决一下</p>
]]></content:encoded>
		
			</item>
		<item>
		<title>
		By: darrenan		</title>
		<link>https://www.coder4.com/archives/4181#comment-1276</link>

		<dc:creator><![CDATA[darrenan]]></dc:creator>
		<pubDate>Tue, 09 Dec 2014 09:13:17 +0000</pubDate>
		<guid isPermaLink="false">http://www.coder4.com/?p=4181#comment-1276</guid>

					<description><![CDATA[&lt;span class=&quot;crayon-e&quot;&gt;mahout &lt;/span&gt;&lt;span class=&quot;crayon-v&quot;&gt;kmeans &lt;/span&gt;&lt;span class=&quot;crayon-o&quot;&gt;-&lt;/span&gt;&lt;span class=&quot;crayon-v&quot;&gt;i&lt;/span&gt;&lt;span class=&quot;crayon-o&quot;&gt;/&lt;/span&gt;&lt;span class=&quot;crayon-v&quot;&gt;user&lt;/span&gt;&lt;span class=&quot;crayon-o&quot;&gt;/&lt;/span&gt;&lt;span class=&quot;crayon-v&quot;&gt;coder4&lt;/span&gt;&lt;span class=&quot;crayon-o&quot;&gt;/&lt;/span&gt;&lt;span class=&quot;crayon-v&quot;&gt;reuters&lt;/span&gt;&lt;span class=&quot;crayon-o&quot;&gt;-&lt;/span&gt;&lt;span class=&quot;crayon-v&quot;&gt;sparse&lt;/span&gt;&lt;span class=&quot;crayon-o&quot;&gt;/&lt;/span&gt;&lt;span class=&quot;crayon-v&quot;&gt;tfidf&lt;/span&gt;&lt;span class=&quot;crayon-o&quot;&gt;-&lt;/span&gt;&lt;span class=&quot;crayon-v&quot;&gt;vectors &lt;/span&gt;&lt;span class=&quot;crayon-o&quot;&gt;-&lt;/span&gt;&lt;span class=&quot;crayon-v&quot;&gt;c&lt;/span&gt;&lt;span class=&quot;crayon-o&quot;&gt;/&lt;/span&gt;&lt;span class=&quot;crayon-v&quot;&gt;user&lt;/span&gt;&lt;span class=&quot;crayon-o&quot;&gt;/&lt;/span&gt;&lt;span class=&quot;crayon-v&quot;&gt;coder4&lt;/span&gt;&lt;span class=&quot;crayon-o&quot;&gt;/&lt;/span&gt;&lt;span class=&quot;crayon-v&quot;&gt;reuters&lt;/span&gt;&lt;span class=&quot;crayon-o&quot;&gt;-&lt;/span&gt;&lt;span class=&quot;crayon-v&quot;&gt;canopy&lt;/span&gt;&lt;span class=&quot;crayon-o&quot;&gt;-&lt;/span&gt;&lt;span class=&quot;crayon-v&quot;&gt;centroids&lt;/span&gt;&lt;span class=&quot;crayon-o&quot;&gt;/&lt;/span&gt;&lt;span class=&quot;crayon-v&quot;&gt;clusters&lt;/span&gt;&lt;span class=&quot;crayon-o&quot;&gt;-&lt;/span&gt;&lt;span class=&quot;crayon-cn&quot;&gt;0&lt;/span&gt;&lt;span class=&quot;crayon-o&quot;&gt;-&lt;/span&gt;&lt;span class=&quot;crayon-m&quot;&gt;final &lt;/span&gt;&lt;span class=&quot;crayon-o&quot;&gt;-&lt;/span&gt;&lt;span class=&quot;crayon-v&quot;&gt;o&lt;/span&gt;&lt;span class=&quot;crayon-o&quot;&gt;/&lt;/span&gt;&lt;span class=&quot;crayon-v&quot;&gt;user&lt;/span&gt;&lt;span class=&quot;crayon-o&quot;&gt;/&lt;/span&gt;&lt;span class=&quot;crayon-v&quot;&gt;coder4&lt;/span&gt;&lt;span class=&quot;crayon-o&quot;&gt;/&lt;/span&gt;&lt;span class=&quot;crayon-v&quot;&gt;reuters&lt;/span&gt;&lt;span class=&quot;crayon-o&quot;&gt;-&lt;/span&gt;&lt;span class=&quot;crayon-v&quot;&gt;kmeans &lt;/span&gt;&lt;span class=&quot;crayon-o&quot;&gt;-&lt;/span&gt;&lt;span class=&quot;crayon-e&quot;&gt;dm &lt;/span&gt;&lt;span class=&quot;crayon-v&quot;&gt;org&lt;/span&gt;&lt;span class=&quot;crayon-sy&quot;&gt;.&lt;/span&gt;&lt;span class=&quot;crayon-v&quot;&gt;apache&lt;/span&gt;&lt;span class=&quot;crayon-sy&quot;&gt;.&lt;/span&gt;&lt;span class=&quot;crayon-v&quot;&gt;mahout&lt;/span&gt;&lt;span class=&quot;crayon-sy&quot;&gt;.&lt;/span&gt;&lt;span class=&quot;crayon-v&quot;&gt;common&lt;/span&gt;&lt;span class=&quot;crayon-sy&quot;&gt;.&lt;/span&gt;&lt;span class=&quot;crayon-v&quot;&gt;distance&lt;/span&gt;&lt;span class=&quot;crayon-sy&quot;&gt;.&lt;/span&gt;&lt;span class=&quot;crayon-v&quot;&gt;CosineDistanceMeasure &lt;/span&gt;&lt;span class=&quot;crayon-o&quot;&gt;-&lt;/span&gt;&lt;span class=&quot;crayon-i&quot;&gt;x&lt;/span&gt;&lt;span class=&quot;crayon-cn&quot;&gt;200&lt;/span&gt;&lt;span class=&quot;crayon-o&quot;&gt;- &lt;/span&gt;&lt;span class=&quot;crayon-v&quot;&gt;ow &lt;/span&gt;&lt;span class=&quot;crayon-o&quot;&gt;--&lt;/span&gt;&lt;span class=&quot;crayon-v&quot;&gt;clustering&lt;/span&gt;

按照楼主的指示，一直无法执行，报错如下：

INFO hdfs.DFSClient: Could not complete  /sport/sport_result/clusters-0/part-00003 retrying...

INFO hdfs.DFSClient: Could not complete  /sport/sport_result/clusters-0/part-00009 retrying...]]></description>
			<content:encoded><![CDATA[<p><span class="crayon-e">mahout </span><span class="crayon-v">kmeans </span><span class="crayon-o">-</span><span class="crayon-v">i</span><span class="crayon-o">/</span><span class="crayon-v">user</span><span class="crayon-o">/</span><span class="crayon-v">coder4</span><span class="crayon-o">/</span><span class="crayon-v">reuters</span><span class="crayon-o">-</span><span class="crayon-v">sparse</span><span class="crayon-o">/</span><span class="crayon-v">tfidf</span><span class="crayon-o">-</span><span class="crayon-v">vectors </span><span class="crayon-o">-</span><span class="crayon-v">c</span><span class="crayon-o">/</span><span class="crayon-v">user</span><span class="crayon-o">/</span><span class="crayon-v">coder4</span><span class="crayon-o">/</span><span class="crayon-v">reuters</span><span class="crayon-o">-</span><span class="crayon-v">canopy</span><span class="crayon-o">-</span><span class="crayon-v">centroids</span><span class="crayon-o">/</span><span class="crayon-v">clusters</span><span class="crayon-o">-</span><span class="crayon-cn">0</span><span class="crayon-o">-</span><span class="crayon-m">final </span><span class="crayon-o">-</span><span class="crayon-v">o</span><span class="crayon-o">/</span><span class="crayon-v">user</span><span class="crayon-o">/</span><span class="crayon-v">coder4</span><span class="crayon-o">/</span><span class="crayon-v">reuters</span><span class="crayon-o">-</span><span class="crayon-v">kmeans </span><span class="crayon-o">-</span><span class="crayon-e">dm </span><span class="crayon-v">org</span><span class="crayon-sy">.</span><span class="crayon-v">apache</span><span class="crayon-sy">.</span><span class="crayon-v">mahout</span><span class="crayon-sy">.</span><span class="crayon-v">common</span><span class="crayon-sy">.</span><span class="crayon-v">distance</span><span class="crayon-sy">.</span><span class="crayon-v">CosineDistanceMeasure </span><span class="crayon-o">-</span><span class="crayon-i">x</span><span class="crayon-cn">200</span><span class="crayon-o">- </span><span class="crayon-v">ow </span><span class="crayon-o">--</span><span class="crayon-v">clustering</span></p>
<p>按照楼主的指示，一直无法执行，报错如下：</p>
<p>INFO hdfs.DFSClient: Could not complete  /sport/sport_result/clusters-0/part-00003 retrying...</p>
<p>INFO hdfs.DFSClient: Could not complete  /sport/sport_result/clusters-0/part-00009 retrying...</p>
]]></content:encoded>
		
			</item>
		<item>
		<title>
		By: question		</title>
		<link>https://www.coder4.com/archives/4181#comment-1268</link>

		<dc:creator><![CDATA[question]]></dc:creator>
		<pubDate>Tue, 14 Oct 2014 07:16:51 +0000</pubDate>
		<guid isPermaLink="false">http://www.coder4.com/?p=4181#comment-1268</guid>

					<description><![CDATA[如何发给你邮箱，有些问题请教]]></description>
			<content:encoded><![CDATA[<p>如何发给你邮箱，有些问题请教</p>
]]></content:encoded>
		
			</item>
		<item>
		<title>
		By: 啊		</title>
		<link>https://www.coder4.com/archives/4181#comment-1266</link>

		<dc:creator><![CDATA[啊]]></dc:creator>
		<pubDate>Tue, 16 Sep 2014 13:25:06 +0000</pubDate>
		<guid isPermaLink="false">http://www.coder4.com/?p=4181#comment-1266</guid>

					<description><![CDATA[mahout clusterdump -i /user/coder4/reuters-kmeans/clusters-2-final -d ./reuters-sparse/dictionary.file-0 -dt sequencefile -o ./reuters-kmeans-cluster-dump/ -n 20
这一步失败，出现数组下表超出的异常，同样的现象参考：http://www.dataguru.cn/forum.php?mod=viewthread&#038;tid=236472

希望博主能够解决这个问题，谢谢！]]></description>
			<content:encoded><![CDATA[<p>mahout clusterdump -i /user/coder4/reuters-kmeans/clusters-2-final -d ./reuters-sparse/dictionary.file-0 -dt sequencefile -o ./reuters-kmeans-cluster-dump/ -n 20<br />
这一步失败，出现数组下表超出的异常，同样的现象参考：http://www.dataguru.cn/forum.php?mod=viewthread&amp;tid=236472</p>
<p>希望博主能够解决这个问题，谢谢！</p>
]]></content:encoded>
		
			</item>
		<item>
		<title>
		By: Anonymous		</title>
		<link>https://www.coder4.com/archives/4181#comment-1257</link>

		<dc:creator><![CDATA[Anonymous]]></dc:creator>
		<pubDate>Thu, 17 Jul 2014 06:36:25 +0000</pubDate>
		<guid isPermaLink="false">http://www.coder4.com/?p=4181#comment-1257</guid>

					<description><![CDATA[写的非常好！赞+1]]></description>
			<content:encoded><![CDATA[<p>写的非常好！赞+1</p>
]]></content:encoded>
		
			</item>
	</channel>
</rss>
