Lancaster EPrints

Selecting query terms to build a specialised corpus from a restricted-access database.

Gabrielatos, Costas (2007) Selecting query terms to build a specialised corpus from a restricted-access database. ICAME Journal, 31. pp. 5-44.

[img]
Preview
PDF (RQTR.pdf)
Download (159Kb) | Preview

    Abstract

    This paper proposes an accessible measure of the relevance of additional terms to a given query, describes and comments on the steps leading to its develop-ment, and discusses its utility. The measure, termed relative query term rele-vance (RQTR), draws on techniques used in information retrieval, and can becombined with a technique used in creating corpora from the world wide web,namely keyword analysis. It is independent of reference corpora, and does notrequire knowledge of the number of (relevant) documents in the database. Although it does not make use of user/expert judgements of document relevance,it does allow for subjective decisions. However, subjective decisions are triangu-lated against two objective indicators: keyness and, mainly, RQTR.

    Item Type: Article
    Journal or Publication Title: ICAME Journal
    Uncontrolled Keywords: corpora ; corpus building ; text database ; query expansion ; query term relevance ; keywords
    Subjects: P Language and Literature > P Philology. Linguistics
    Departments: Faculty of Arts & Social Sciences > Linguistics & English Language
    ID Code: 528
    Deposited By: Mr Costas Gabrielatos
    Deposited On: 06 Jun 2007
    Refereed?: Yes
    Published?: Published
    Last Modified: 04 Jun 2014 09:21
    Identification Number:
    URI: http://eprints.lancs.ac.uk/id/eprint/528

    Actions (login required)

    View Item