878
views
0
recommends
+1 Recommend
1 collections
    0
    shares

      Celebrating 65 years of The Computer Journal - free-to-read perspectives - bcs.org/tcj65

      scite_
       
      • Record: found
      • Abstract: found
      • Conference Proceedings: found
      Is Open Access

      Normalization and Matching in the DORO System

      Published
      proceedings-article
      , , ,
      21st Annual BCS-IRSG Colloquium on IR (IRSG)
      IR Research
      19-20 April 1999
      Bookmark

            Abstract

            This paper is concerned with the use of linguistically motivated phrases as indexing terms in Information Retrieval applications. Apart from the conventional noun phrases, we propose to use verb phrases as index terms for text classification. Techniques for phrase matching through syntactic normalization and semantical matching are described. We discuss the realization of the syntactic normalization of phrases by transduction to frames. Semantical normalization is based on lexico-semantical relations, taking into account certain properties of the classification algorithms used. The ideas described here are being implemented in the Document Routing system DORO, in which statistical learning algorithms are applied to document profiles consisting of phrases. This paper describes the rationale behind work in progress, rather than presenting final results.

            Content

            Author and article information

            Contributors
            Conference
            April 1999
            April 1999
            : 1-13
            Affiliations
            [0001]Department of Computer Science,

            University of Nijmegen, The Netherlands.
            Article
            10.14236/ewic/IRSG1999.8
            77c6790d-6698-43f1-90dc-c2c80070899a
            © C.H.A. Koster et al. Published by BCS Learning and Development Ltd. 21st Annual BCS-IRSG Colloquium on IR, Glasgow

            This work is licensed under a Creative Commons Attribution 4.0 Unported License. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

            21st Annual BCS-IRSG Colloquium on IR
            IRSG
            21
            Glasgow
            19-20 April 1999
            Electronic Workshops in Computing (eWiC)
            IR Research
            History
            Product

            1477-9358 BCS Learning & Development

            Self URI (article page): https://www.scienceopen.com/hosted-document?doi=10.14236/ewic/IRSG1999.8
            Self URI (journal page): https://ewic.bcs.org/
            Categories
            Electronic Workshops in Computing

            Applied computer science,Computer science,Security & Cryptology,Graphics & Multimedia design,General computer science,Human-computer-interaction

            Comments

            Comment on this article