<?xml version="1.0" encoding="utf-8"?><rss version="2.0"><channel><title>RSS for Metric and trigonometric pruning for clustering of uncertain data in 2D geometric space</title><link>http://journalogy.net/Rss.aspx?cata=9&amp;id=39322680</link><description>Search RSS feed for Microsoft Academic Search</description><generator>MSRA Libra RSS Burner</generator><copyright>(c)2008 Microsoft Corpration, All right reserved.</copyright><pubDate>Thu, 23 May 2013 09:16:06 GMT</pubDate><lastBuildDate>Thu, 23 May 2013 09:16:06 GMT</lastBuildDate><category /><item><title>Metric and trigonometric pruning for clustering of uncertain data in 2D geometric space</title><link>http://journalogy.net/Publication/39322680</link><pubDate>Thu, 23 May 2013 02:16:06 GMT</pubDate><guid isPermaLink="false">393226800</guid><description><![CDATA[<div><a href="http://journalogy.net/Author/3374258">Wang Kay Ngai</a>, <a href="http://journalogy.net/Author/1003055">Ben Kao</a>, <a href="http://journalogy.net/Author/182132">Reynold Cheng</a>, <a href="http://journalogy.net/Author/1120485">Michael Chau</a>, <a href="http://journalogy.net/Author/286931">Sau Dan Lee</a>, <a href="http://journalogy.net/Author/891133">David W. Cheung</a>, <a href="http://journalogy.net/Author/1153658">Kevin Y. Yip</a>:
            
            <span style="margin-left:20px" /><span style="margin-left:20px"><a href="http://www.sciencedirect.com/science/article/pii/S0306437910000992">view publication</a></span></div><div>We study the problem of clustering data objects with location uncertainty. In our model, a data object is represented by an uncertainty region over which a <a href='http://academic.research.microsoft.com/Keyword/32658/probability-density-function'>probability density function</a>  (pdf) is defined. One method to cluster such uncertain objects is to apply the UK-means algorithm [1], an extension of the traditional K-means algorithm, which assigns each object to the cluster whose representative has the smallest expected distance from it. For arbitrary pdf, calculating the expected distance between an object and a cluster representative requires expensive integration of the pdf. We study two pruning methods: pre-computation (PC) and cluster shift (CS) that can significantly reduce the number of integrations computed. Both pruning methods rely on good bounding techniques. We propose and evaluate two such techniques that are based on metric properties (Met) and trigonometry (Tri). Our experimental results show that Tri offers a very high pruning power. In some cases, more than 99.9% of the expected distance calculations are pruned. This results in a very efficient clustering algorithm.11Part of this paper appears in Ngai et al., 2006 [2], in which the algorithms PC and CS were described. The idea of trigonometric pruning (Section 6), most of the empirical performance study (Section 7), discussion (Section 8), and all proofs of theorems (Appendixes), have not been previously published. The new materials amount to about 34 of the current paper.</div><div></div><div>Journal: <a href="http://journalogy.net/Journal/50">Information Systems - IS</a>, vol. 36, no. 2, pp. 476-497, 2011</div><div />]]></description></item></channel></rss>