Kanchev, Georgi M. and Murukannaiah, Pradeep K. and Chopra, Amit K. and Sawyer, Pete (2017) Canary : Extracting Requirements-Related Information from Online Discussions. In: Proceedings of the 25th IEEE International Requirements Engineering Conference :. IEEE, Lisbon, pp. 31-40. ISBN 9781538631928
canary_2017.pdf - Accepted Version
Available under License Creative Commons Attribution-NonCommercial.
Download (1MB)
Abstract
Online discussions about software applications generate a large amount of requirements-related information. This information can potentially be usefully applied in requirements engineering; however currently, there are few systematic approaches for extracting such information. To address this gap, we propose Canary, an approach for extracting and querying requirements-related information in online discussions. The highlight of our approach is a high-level query language that combines aspects of both requirements and discussion in online forums. We give the semantics of the query language in terms of relational databases and SQL. We demonstrate the usefulness of the language using examples on real data extracted from online discussions. Our approach relies on human annotations of online discussions. We highlight the subtleties involved in interpreting the content in online discussions and the assumptions and choices we made to effectively address them. We demonstrate the feasibility of generating high-quality annotations by obtaining them from lay Amazon Mechanical Turk users.