Etiquetario morfosintáctico del SLI para corpus de lengua gallegaaplicación al corpus paralelo TECTRA

  1. Gómez Guinovart, Xavier
  2. Aguirre Moreno, José Luis
  3. Álvarez Lugrís, Alberto
Revista:
Procesamiento del lenguaje natural

ISSN: 1135-5948

Ano de publicación: 2002

Número: 28

Páxinas: 23-34

Tipo: Artigo

Outras publicacións en: Procesamiento del lenguaje natural

Resumo

In this article we present a complete and normalized morphosyntactic tagset for the annotation of linguistic corpora in Galician. The elaboration of this tagset, designed by the Computational Linguistics Group (SLI)of the University of Vigo, following strictly the EAGLES recommendations (Leech and Wilson, 1996), includes the creation of an intermediate tagset that allows us to establish a correspondence between the grammatical information encoded for Galician in the CLUVI (Linguistic Corpus of the University of Vigo) and the information encoded in the EAGLES standard format in corpora of other languages