Tecnologías de la lengua para análisis de opiniones en redes sociales

  1. Manuel Vilares Ferro
  2. Elena Sánchez Trigo
  3. Carlos Gómez Rodríguez
  4. Miguel Ángel Alonso Pardo
Journal:
Procesamiento del lenguaje natural

ISSN: 1135-5948

Year of publication: 2017

Issue: 59

Pages: 125-128

Type: Article

More publications in: Procesamiento del lenguaje natural

Abstract

The recent popularization of social media based on microtexts, among which Twitter stands out, has enabled a globalization of the expression of opinions. Although microtexts present some specific lexical and syntactic properties that differ from those of standard text, certain basic aspects of language must be respected so that they are intelligible. In this project, we propose to exploit this fact in order to improve the linguistic support for processing microtexts in our natural sphere of interest: the Spanish and Galician languages. To do so, it will be necessary to improve the performance of current parsing and analysis techniques on standard text, to design mechanisms so that models and methods effective for analyzing standard language can be adapted to microtexts, and to project effective models, methods and resources across languages.

Bibliographic References

  • Carter, S., W. Weerkamp, y M. Tsagkias. 2013. Microblog language identification: overcoming the limitations of short, unedited and idiomatic text. Language Resources and Evaluation, 47(1):195-215.
  • Doval, Y., C. Gómez-Rodríguez, y J. Vilares. 2016. Segmentación de palabras en español mediante modelos del lenguaje basados en redes neuronales. Procesamiento del Lenguaje Natural, 57:75-82.
  • Esteban, J. L., R. Ferrer-i-Cancho, y C. Gómez-Rodríguez. 2016. The scaling of the minimum sum of edge lengths in uniformly random trees. Journal of Statistical Mechanics: Theory and Experiment, (2016):063401.
  • Ferrer-i-Cancho, R. y C. Gómez-Rodríguez. 2016a. Crossings as a side effect of dependency lengths. Complexity, 21(S2):320-328.
  • Ferrer-i-Cancho, R. y C. Gómez-Rodríguez. 2016b. Liberating language research from dogmas of the 20th century. Glottometrics, 33:33-34.
  • Gamallo, P. Pendiente de publicación. Comparing explicit and predictive distributional semantic models endowed with syntactic contexts. Language Resources and Evaluation.
  • Garcia, M., C. Gómez-Rodríguez, y M. A. Alonso. 2016. Creación de un treebank de dependencias universales mediante recursos existentes para lenguas próximas: el caso del gallego. Procesamiento del Lenguaje Natural, 57:33-40.
  • Gómez Rodriguez, C. 2016a. Natural language processing and the now-or-never bottleneck. Behavioral and Brain Scien- ces, 39:e74.
  • Gómez Rodríguez, C. 2016b. Restricted non-projectivity: Coverage vs. efficiency. Computational Linguistics, 42(4):809-817.
  • Gómez Rodríguez, C. y D. Fernández-González. 2015. An efficient dynamic oracle for unrestricted non-projective parsing. En Proceedings of ACL-IJCNLP 2015, páginas 256-261, Beijing, China.
  • Vilares, D. y M. A. Alonso. 2016. A review on political analysis and social media. Procesamiento del Lenguaje Natural, 56:1323.
  • Vilares, D., M. A. Alonso, y C. Gómez-Rodríguez. 2016. One model, two languages: training bilingual parsers with harmonized treebanks. En Proceedings of ACL 2016, páginas 425-431, Berlin, Germany.
  • Vilares, D., M. A. Alonso, y C. Gómez-Rodríguez. 2017. Supervised sentiment analysis in multilingual environments. Information Processing & Management, 53(3):595-607.
  • Vilares, D., Y. Doval, M. A. Alonso, y C. Gómez-Rodríguez. 2016. Exploiting neural activation values for Twitter sentiment classification and quantification. En Proceedings of SemEval-2016, páginas 79-84, San Diego, California.
  • Vilares, D., C. Gómez-Rodríguez, y M. A. Alonso. 2017. Universal, unsupervised (rule-based), uncovered sentiment analysis. Knowledge-Based Systems, 118:45-55.
  • Vilares, D., M. Thelwall, y M. A. Alonso. 2015. The megaphone of the people? Spanish SentiStrength for real-time analysis of political tweets. Journal of Information Science, 41(6):799-813.
  • Vilares, J., M. A. Alonso, Y. Doval, y M. Vilares. 2016a. Studying the effect and treatment of misspelled queries in cross-language information retrieval. Information Processing & Management, 52(4):646-657.
  • Vilares, J., M. Vilares, M. A. Alonso, y M. P. Oakes. 2016b. On the feasibility of character n-grams pseudotranslation for cross-language information retrieval tasks. Computer Speech and Language, 36(36):136-164.
  • Vilares, M., V. M. Darriba, y F. J. Ribadas. 2017. Modeling of learning curves with applications to POS tagging. Computer Speech and Language, 41:1-28.