Heinecke & Shimorina (2022)

De Arbres
  • Heinecke, Johannes & Anastasia Shimorina. 2022. 'Multilingual Abstract Meaning Representation for Celtic Languages', Proceedings of the 4th Celtic Language Technology Workshop within LREC2022, Marseille, France. European Language Resources Association, 1–6. texte.


 Abstract
 "Deep Semantic Parsing into Abstract Meaning Representation (AMR) graphs has reached a high quality with neural-based seq2seq approaches. However, the training corpus for AMR is only available for English. Several approaches to process other languages exist, but only for high resource languages. We present an approach to create a multilingual text-to-AMR model for three Celtic languages, Welsh (P-Celtic) and the closely related Irish and Scottish-Gaelic (Q-Celtic). The main success of this approach are underlying multilingual transformers like mT5. We finally show that machine translated test corpora unfairly improve the AMR evaluation for about 1 or 2 points (depending on the language)."