Etiquetario morfosintáctico del SLI para corpus de lengua gallegaaplicación al corpus paralelo TECTRA

  1. Gómez Guinovart, Xavier
  2. Aguirre Moreno, José Luis
  3. Álvarez Lugrís, Alberto
Journal:
Procesamiento del lenguaje natural

ISSN: 1135-5948

Year of publication: 2002

Issue: 28

Pages: 23-34

Type: Article

More publications in: Procesamiento del lenguaje natural

Abstract

In this article we present a complete and normalized morphosyntactic tagset for the annotation of linguistic corpora in Galician. The elaboration of this tagset, designed by the Computational Linguistics Group (SLI)of the University of Vigo, following strictly the EAGLES recommendations (Leech and Wilson, 1996), includes the creation of an intermediate tagset that allows us to establish a correspondence between the grammatical information encoded for Galician in the CLUVI (Linguistic Corpus of the University of Vigo) and the information encoded in the EAGLES standard format in corpora of other languages