Exploring cross-lingual word embeddings for the inference of bilingual dictionaries

  1. Marcos Garcia 1
  2. Marcos García-Salido 1
  3. Miguel A. Alonso 1
  1. 1 Universidade da Coruña
    info

    Universidade da Coruña

    La Coruña, España

    ROR https://ror.org/01qckj285

Libro:
Proceedings of TIAD-2019 Shared Task – Translation Inference Across Dictionaries co-located with the 2nd Language, Data and Knowledge Conference (LDK 2019): Leipzig, Germany, May 20, 2019
  1. Jorge Gracia (ed. lit.)
  2. Besim Kabashi (ed. lit.)
  3. Ilan Kernerman (ed. lit.)

Editorial: CEUR Workshop Proceedings

Ano de publicación: 2019

Páxinas: 32-41

Congreso: TIAD Shared Task - Translation Inference Across Dictionaries (2. 2019. Leipzig)

Tipo: Achega congreso

Resumo

We describe four systems to generate automatically bilingual dictionaries based on existing ones: three transitive systems differing only in the pivot language used, and a system based on a different approach which only needs monolingual corpora in both the source and target languages. All four methods make use of cross-lingual word embeddings trained on monolingual corpora, and then mapped into a shared vec- tor space. Experimental results confirm that our strategy has a good coverage and recall, achieving a performance comparable to to the best submitted systems on the TIAD 2019 gold standard set among the teams participating at the TIAD shared task.