Skip to content
Change the repository type filter

All

    Repositories list

    • Code and data for the paper "Turning English-centric LLMs Into Polyglots: How Much Multilinguality Is Needed?"
      Jupyter Notebook
      22300Updated Oct 4, 2024Oct 4, 2024
    • Unsupervised translation direction detection using NMT systems
      Python
      MIT License
      2400Updated Aug 12, 2024Aug 12, 2024
    • Data and code for the paper "Yes, no, maybe? Revisiting language models' response stability under paraphrasing for the assessment of political leaning"
      Python
      GNU General Public License v3.0
      0000Updated Aug 4, 2024Aug 4, 2024
    • Repository for data and evaluation of 2024 Shared Task on SDG classification held by the Swiss Text Conference.
      Python
      GNU Affero General Public License v3.0
      2500Updated Jul 9, 2024Jul 9, 2024
    • Code for the paper "Modular Adaptation of Multilingual Encoders to Written Swiss German Dialect"
      Python
      MIT License
      1200Updated Jun 25, 2024Jun 25, 2024
    • xstance

      Public
      A Multilingual Multi-Target Dataset for Stance Detection
      Python
      MIT License
      43401Updated Jun 17, 2024Jun 17, 2024
    • mbr

      Public
      Minimum Bayes Risk Decoding for Hugging Face Transformers
      Python
      Apache License 2.0
      75610Updated Jun 3, 2024Jun 3, 2024
    • Code for the 2023 ACL Findings paper, Uncovering Hidden Consequences of Pre-training Objectives in Sequence-to-Sequence Models (Kew & Sennrich, 2023)
      Jupyter Notebook
      0100Updated May 31, 2024May 31, 2024
    • Code for the paper "Target-Level Sentence Simplification as Controlled Paraphrasing" (TSAR 2022)
      Jupyter Notebook
      1000Updated Mar 29, 2024Mar 29, 2024
    • sockeye

      Public
      Sequence-to-sequence framework with a focus on Neural Machine Translation based on Apache MXNet
      Python
      Apache License 2.0
      323000Updated Mar 21, 2024Mar 21, 2024
    • Code for hospitality review response generation
      Jupyter Notebook
      0200Updated Feb 15, 2024Feb 15, 2024
    • The implementation of "Mitigating Hallucinations and Off-target Machine Translation with Source-Contrastive and Language-Contrastive Decoding"
      Python
      MIT License
      63200Updated Jan 25, 2024Jan 25, 2024
    • swissbert

      Public
      The multilingual language model for Switzerland
      Jupyter Notebook
      MIT License
      32500Updated Jan 19, 2024Jan 19, 2024
    • nmtscore

      Public
      A library of translation-based text similarity measures
      Python
      MIT License
      62521Updated Dec 11, 2023Dec 11, 2023
    • BLESS

      Public
      Code for the EMNLP 2023 paper "BLESS: Benchmarking Large Language Models on Sentence Simplification"
      Jupyter Notebook
      MIT License
      3610Updated Nov 21, 2023Nov 21, 2023
    • The implementation of "Investigating Multi-Pivot Ensembling with Massively Multilingual Machine Translation Models"
      Python
      MIT License
      1300Updated Nov 14, 2023Nov 14, 2023
    • Code for the paper "Towards Unsupervised Recognition of Token-level Semantic Differences in Related Documents"
      Python
      MIT License
      0200Updated Oct 23, 2023Oct 23, 2023
    • 🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.
      Python
      Apache License 2.0
      27k000Updated Aug 31, 2023Aug 31, 2023
    • 20Minuten

      Public
      Jupyter Notebook
      1310Updated Aug 17, 2023Aug 17, 2023
    • Code for the ACL 2020 paper "Semi-supervised Contextual Historical Text Normalization" by Peter Makarov and Simon Clematide
      Python
      0300Updated Jun 30, 2023Jun 30, 2023
    • Code for the paper "Voting Booklet Bias: Stance Detection in Swiss Federal Communication"
      Jupyter Notebook
      1200Updated Jun 12, 2023Jun 12, 2023
    • Data and code accompanying the paper "As Little as Possible, as Much as Necessary: Detecting Over- and Undertranslations with Contrastive Conditioning" (ACL 2022)
      Python
      42100Updated Apr 13, 2023Apr 13, 2023
    • Code and data for the paper "Don't Discard Fixed-Window Audio Segmentation in Speech-to-Text Translation"
      Python
      MIT License
      0000Updated Apr 11, 2023Apr 11, 2023
    • Data and code for the paper "Identifying Weaknesses in Machine Translation Metrics Through Minimum Bayes Risk Decoding: A Case Study for COMET"
      Python
      MIT License
      0600Updated Apr 11, 2023Apr 11, 2023
    • segtest

      Public
      A Test Suite for Morphological Phenomena in Neural Machine Translation
      Shell
      MIT License
      1700Updated Apr 11, 2023Apr 11, 2023
    • Code for the Paper "On Romanization for Model Transfer Between Scripts in Neural Machine Translation"
      Mathematica
      0100Updated Apr 11, 2023Apr 11, 2023
    • Python
      1002Updated Dec 8, 2022Dec 8, 2022
    • Data and code accompanying the paper "On the Limits of Minimal Pairs in Contrastive Evaluation"
      Python
      MIT License
      1300Updated Nov 11, 2022Nov 11, 2022
    • Shell
      MIT License
      21700Updated Apr 28, 2022Apr 28, 2022
    • Code and data accompanying the paper "Contrastive Conditioning for Assessing Disambiguation in MT: A Case Study of Distilled Bias"
      Python
      MIT License
      3300Updated Nov 4, 2021Nov 4, 2021