1,373
views
0
recommends
+1 Recommend
1 collections
    0
    shares

      Celebrating 65 years of The Computer Journal - free-to-read perspectives - bcs.org/tcj65

      scite_
       
      • Record: found
      • Abstract: found
      • Conference Proceedings: found
      Is Open Access

      Automatic Natural Language Style Classification and Transformation

      Published
      proceedings-article
      ,
      BCS-IRSG Workshop on Corpus Profiling (IRSG)
      Workshop on Corpus Profiling
      18 October 2008
      style, natural language processing, artificial intelligence, corpus classification, style recognition, style transformation
      Bookmark

            Abstract

            Style is an integral part of natural language in written, spoken or machine generated forms. Humans have been dealing with style in language since the beginnings of language itself, but computers and machine processes have only recently begun to process natural language styles. Automatic processing of styles poses two interrelated challenges: classification and transformation. There have been recent advances in corpus classification, automatic clustering and authorship attribution along many dimensions but little work directly related to writing styles directly and even less in transformation. In this paper we examine relevant literature to define and operationalize a notion of “style” which we employ to designate style markers usable in classification machines. A measurable reading of these markers also helps guide style transformation algorithms. We demonstrate the concept by showing a detectable stylistic shift in a sample piece of text relative to a target corpus. We present ongoing work in building a comprehensive style recognition and transformation system and discuss our results.

            Content

            Author and article information

            Contributors
            Conference
            October 2008
            October 2008
            : 1-11
            Affiliations
            [0001]University of California Santa Cruz

            Department of Computer Science, 1156 High St, Santa Cruz, CA 95064, USA
            Article
            10.14236/ewic/IRSG2008.3
            fcc8be7d-d28a-4817-b42a-8a201f793fa5
            © Foaad Khosmood et al. Published by BCS Learning and Development Ltd. BCS-IRSG Workshop on Corpus Profiling

            This work is licensed under a Creative Commons Attribution 4.0 Unported License. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

            BCS-IRSG Workshop on Corpus Profiling
            IRSG
            London
            18 October 2008
            Electronic Workshops in Computing (eWiC)
            Workshop on Corpus Profiling
            History
            Product

            1477-9358 BCS Learning & Development

            Self URI (article page): https://www.scienceopen.com/hosted-document?doi=10.14236/ewic/IRSG2008.3
            Self URI (journal page): https://ewic.bcs.org/
            Categories
            Electronic Workshops in Computing

            Applied computer science,Computer science,Security & Cryptology,Graphics & Multimedia design,General computer science,Human-computer-interaction
            style,natural language processing,artificial intelligence,corpus classification,style recognition,style transformation

            Comments

            Comment on this article