<?xml version="1.0" encoding="utf-8"?><rss version="2.0"><channel><title>RSS for Automated topic naming to support cross-project analysis of software maintenance activities</title><link>http://journalogy.net/Rss.aspx?cata=9&amp;id=39275548</link><description>Search RSS feed for Microsoft Academic Search</description><generator>MSRA Libra RSS Burner</generator><copyright>(c)2008 Microsoft Corpration, All right reserved.</copyright><pubDate>Thu, 23 May 2013 16:43:39 GMT</pubDate><lastBuildDate>Thu, 23 May 2013 16:43:39 GMT</lastBuildDate><category /><item><title>Automated topic naming to support cross-project analysis of software maintenance activities</title><link>http://journalogy.net/Publication/39275548</link><pubDate>Thu, 23 May 2013 09:43:39 GMT</pubDate><guid isPermaLink="false">392755481</guid><description><![CDATA[<div><a href="http://journalogy.net/Author/652277">Abram Hindle</a>, <a href="http://journalogy.net/Author/2155512">Neil A. Ernst</a>, <a href="http://journalogy.net/Author/803566">Michael W. Godfrey</a>, <a href="http://journalogy.net/Author/61801">John Mylopoulos</a>:
            
            <span style="margin-left:20px">(Citations:1)</span><span style="margin-left:20px"><a href="http://dl.acm.org/citation.cfm?id=1985466">view publication</a></span></div><div>Researchers have employed a variety of techniques to extract underlying topics that relate to <a href='http://academic.research.microsoft.com/Keyword/38541/software-development'>software development</a>  artifacts. Typically, these techniques use semi-unsupervised machine-learning algorithms to suggest candidate word-lists. However, word-lists are difficult to interpret in the absence of meaningful summary labels. Current topic modeling techniques assume manual labelling and do not use domainspecific knowledge to improve, contextualize, or describe results for the developers. We propose a solution: automated labelled topic extraction. Topics are extracted using <a href='http://academic.research.microsoft.com/Keyword/22230/latent-dirichlet-allocation'>Latent Dirichlet Allocation</a>  (LDA) from commit-log comments recovered from <a href='http://academic.research.microsoft.com/Keyword/38888/source-control'>source control</a>  systems such as CVS and Bit-Keeper. These topics are given labels from a generalizable cross-project taxonomy, consisting of non-functional requirements. Our approach was evaluated with experiments and case studies on two large-scale RDBMS projects: MySQL and MaxDB. The case studies show that labelled topic extraction can produce appropriate, context-sensitive labels relevant to these projects, which provides fresh insight into their evolving <a href='http://academic.research.microsoft.com/Keyword/38541/software-development'>software development</a>  activities.</div><div>Conference: <a href="http://journalogy.net/Conference/2208">Mining Software Repositories - MSR</a>, pp. 163-172, 2011</div><div></div><div />]]></description></item></channel></rss>