Maria Kunilovskaya

linguistics, contrastive and computational

bham_vert.jpg

University of Saarland

SFB B7 project

Saarbrücken, Germany

Postdoc Researcher

I am currently a postdoc with University of Saarland (Germany) working on modelling mediated language to explore the memory-surprisal trade-off hypothesis from information theory. My PhD (completed March 2023, supervisor: Prof. Mitkov, RGCL, UK) was on human translation quality estimation.

Before that I held an Associate Professor position at a Translation Studies department, lecturing in Translation Studies, Theoretical Linguistics and Corpus Linguistics. I have a PhD (Candidate of Science) in Contrastive Linguistics (completed 2004, adviser: Prof. Brodovich, Saint Petersburg University).

A lot of my efforts were invested in building learner parallel and comparable corpora, as well as other language resources. I have extensive experience in setting up manual annotation experiments.

My research interests have shifted from corpus- and feature-based approaches to ML, modelling and representation learning. In the past few years, I was involved in several computational humanities projects.

Keywords:

  • language modelling, information theory
  • Python, machine learning, DL, distributional semantics
  • computational humanities, data collection and analysis
  • translation quality estimation, data annotation
  • languages varieties, register studies, text complexity

Download curriculum vitae 2017-2024 publications

recent news

Apr 7, 2025 Back to regular teaching! This semester (SoSe-2025), I volunteered to offer a research seminar Quality in Human and Machine Translation (QH&MT) at the Language Science and Technology Department, University of Saarland. The seminar looks into the properties of MT, especially with regard to how it compares to human translation. It is designed to bring together the linguistic expertise on, and the technological aspects/issues of measuring, quality. We will look into (i) the theoretical pre-requisites of translation quality, (ii) compare approaches applied to humans and machines, and (iii) overview the best practices in manual as well as automatic quality annotation. The proposed research topics include linguistic studies based on comparative-contrastive analysis, developing TQ test sets, investigating existing metrics and designing new methods, tweaking MT and MT quality models to capture specific errors or address specified aspects of production. I invite computationally-minded linguists and NLP students who are curious whether today’s technology is a real competition to human translators, and what nuances there are to this comparison. We start next Monday, 14 April 2025, at 16.15 (Gebäude C7 2 - Seminarraum -1.05).
Feb 15, 2025 – I have three (sic!) posters as the 1st author at an SBF1102-organised RAILS conference. Overachiever, ahem. Slavic intercomprehension, translation task difficulty, cognitive load factors in interpreting
Feb 4, 2025 (1) Had a throwback to the best part of my past life, when I gave a 90 min lecture as part of BA Vorlesung Perspektiven der Linguistik. Oh my, I miss that! (handout)
(2) On the same day, 15 min after the lecture, I had to take the spoken part of the exam at German B2 level. That went surprisingly well.

selected publications

  1. NoDaLiDa-2025
    Predictability of Microsyntactic Units across Slavic Languages: A translation-based Study
    Kunilovskaya, Maria, Zaitova, Iuliia, Xue, Wei, Stenger, Irina, and Avgustinova, Tania
    In The Joint 25th Nordic Conference on Computational Linguistics and 11th Baltic Conference on Human Language Technologies 2025
  2. UM Press
    Confuse and Normalise: Authoritarian Propaganda in a High-Choice Media Environment during Russia’s Invasion of Ukraine
    Alyukov, Maxim,  Kunilovskaya, Maria, and Semenov, Andrei
    In The Joint 25th Nordic Conference on Computational Linguistics and 11th Baltic Conference on Human Language Technologies 2024
  3. EAMT-2024
    Mitigating Translationese with GPT-4: Strategies and Performance
    Kunilovskaya, Maria, Chowdhury, Koel Dutta, Przybyl, Heike, España i Bonet, Cristina, and Van Genabith, Josef
    In Proceedings of the 25th Annual conference of the European Association for Machine Translation 24–27 june 2024
  4. TSAR-2023
    Cross-lingual Mediation: Readability Effects
    Kunilovskaya, Maria, Mitkov, Ruslan, and Wandl-Vogt, Eveline
    In Proceedings of the Workshop on Text Simplification, Accessibility, and Readability (TSAR-2023) 7 september 2023
  5. RANLP
    Simultaneous Interpreting as a Noisy Channel: How Much Information Gets Through
    Kunilovskaya, Maria, Przybyl, Heike, Teich, Elke, and Lapshinova-Koltunski, Ekaterina
    In Proceedings of the International Conference on Recent Advances in Natural Language Processing 7 september 2023
  6. EACL
    Wartime Media Monitor (WarMM-2022): A Study of Information Manipulation on Russian Social Media during the Russia-Ukraine War
    Alyukov, Maxim,  Kunilovskaya, Maria, and Semenov, Andrei
    7 september 2023
  7. PhD
    Translationese indicators for human translation quality estimation (based on English-to-Russian translation of mass-media texts)
    Kunilovskaya, Maria
    7 september 2023
  8. Target
    Source language difficulties in learner translation: Evidence from an error-annotated corpus
    Kunilovskaya, Maria, Ilyushchenya, Tatyana, Morgoun, Natalia, and Mitkov, Ruslan
    Target 7 september 2022
  9. Springer
    Translationese and register variation in English-to-Russian professional translation
    Kunilovskaya, Maria, and Corpas Pastor, Gloria
    7 september 2021
  10. Springer LNCS
    Multilingual Embeddings for Clustering Cultural Events
    Kunilovskaya, Maria, and Kuzmenko, Elizaveta
    In Analysis of Images, Social Networks and Texts 7 september 2021
  11. EMNLP
    Translationese in Russian Literary Texts
    Kunilovskaya, Maria, Lapshinova-koltunski, Ekaterina, and Mitkov, Ruslan
    In Proceddings of the 5th Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature 7 september 2021