Exploiting a Corpus to Compile a Lexical Resource for Academic WritingSpanish Lexical Combinations

  1. Margarita Alonso Ramos
  2. Marcos García Salido
  3. Marcos García
Buch:
Electronic lexicography in the 21st century: Proceedings of eLex 2017 conference
  1. Iztok Kosem (coord.)
  2. Carole Tiberius (coord.)
  3. Miloš Jakubíček (coord.)
  4. Jelena Kallas (coord.)
  5. Simon Krek (coord.)
  6. Vít Baisa (coord.)

Verlag: Lexical Computing

Datum der Publikation: 2017

Seiten: 571-586

Kongress: eLEX : Electronic lexicography in the 21st century (5. 2017. Leiden)

Art: Konferenz-Beitrag

Zusammenfassung

This paper provides insight into ongoing research focusing on the exploitation of Spanish academic corpora in order to build up a lexical tool addressed to novice writers of academic texts. The object of the lexical tool is what we call academic lexical combinations (ALC). By ALC we mean recurrent segments of words that may or may not be semantically compositional and fulfill rhetorical functions such as giving examples, concluding, expressing emphasis, etc. These functions are particularly prominent in academic discourse. ALCs comprise from collocations to idioms as well as formulas, as they are understood in the Meaning-Text Theory (Mel’čuk, 2012). The procedure adopted for the extraction of the ALC from the corpus is described along with how we combine statistical information and native speakers’ intuition. Even if corpora play a leading role in the construction of our lexical tool, we need to filter out corpus output with phraseological criteria, which makes human intervention necessary. Finally, we specify the architecture of the lexical tool and we show different prototype lexicographical entries.