WikiDoMiner: wikipedia domain-specific miner

Ezzini, Saad and Abualhaija, Sallam and Sabetzadeh, Mehrdad (2022) WikiDoMiner: wikipedia domain-specific miner. In: ESEC/FSE 2022 - Proceedings of the 30th ACM Joint Meeting European Software Engineering Conference and Symposium on the Foundations of Software Engineering :. ESEC/FSE 2022 - Proceedings of the 30th ACM Joint Meeting European Software Engineering Conference and Symposium on the Foundations of Software Engineering . Association for Computing Machinery (ACM), pp. 1706-1710. ISBN 9781450394130

Full text not available from this repository.

Abstract

We introduce WikiDoMiner - a tool for automatically generating domain-specific corpora by crawling Wikipedia. WikiDoMiner helps requirements engineers create an external knowledge resource that is specific to the underlying domain of a given requirements specification (RS). Being able to build such a resource is important since domain-specific datasets are scarce. WikiDoMiner generates a corpus by first extracting a set of domain-specific keywords from a given RS, and then querying Wikipedia for these keywords. The output of WikiDoMiner is a set of Wikipedia articles relevant to the domain of the input RS. Mining Wikipedia for domain-specific knowledge can be beneficial for multiple requirements engineering tasks, e.g., ambiguity handling, requirements classification, and question answering. WikiDoMiner is publicly available on Zenodo under an open-source license (https: //doi.org/10.5281/zenodo.6672682)

Item Type:
Contribution in Book/Report/Proceedings
Uncontrolled Keywords:
/dk/atira/pure/subjectarea/asjc/1700/1712
Subjects:
?? domain-specific corpus generationnatural language processingnatural-language requirementsrequirements engineeringwikipediasoftwareartificial intelligence ??
ID Code:
210066
Deposited By:
Deposited On:
07 Dec 2023 11:15
Refereed?:
Yes
Published?:
Published
Last Modified:
16 Jul 2024 05:23