CogNet: a large-scale, high-quality cognate database for 338 languages, 1.07M words, and 8.1 million cognates
-
Updated
Jun 15, 2023
CogNet: a large-scale, high-quality cognate database for 338 languages, 1.07M words, and 8.1 million cognates
This repository contains the code, data, and models of the paper titled "CrossSum: Beyond English-Centric Cross-Lingual Summarization for 1,500+ Language Pairs" published in Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (ACL’23), July 9-14, 2023.
Improving Indonesian text classification using multilingual language model
Implementation of ICLR 2022 paper "Enhancing Cross-lingual Transfer by Manifold Mixup".
Code for "Preference Tuning For Toxicity Mitigation Generalizes Across Languages." Paper accepted at Findings of EMNLP 2024
Official implementation of "CONCRETE: Improving Cross-lingual Fact Checking with Cross-lingual Retrieval" (COLING'22)
TaCo: Enhancing Cross-Lingual Transfer for Low-Resource Languages in LLMs through Translation-Assisted Chain-of-Thought Processes
Cross Lingual Language models for making search engines for Holy Quran and Sahih Hadiths
[EMNLP 2022] Discovering Language-neutral Sub-networks in Multilingual Language Models.
Breaking the Script Barrier in Multilingual Pre-Trained Language Models with Transliteration-Based Post-Training Alignment
Cascading Adaptors to Leverage English Data to Improve Performance ofQuestion Answering for Low-Resource Languages
UniBridge: A Unified Approach to Cross-Lingual Transfer Learning for Low-Resource Languages (ACL 2024)
In this work we applied multilingual zero-shot transfer concept for the task of toxic comments detection. This concept allows a model trained only on a single-language dataset to work in arbitrary language, even low-resource.
repo for the LREC-COLING 2024 paper
Code for importance-weighted domain alignment, and the paper “Cross-Lingual Transfer with Class-Weighted Language-Invariant Representations”.
repo for "An Invasive Embedding Model in Favor of Low-Resource Languages Understanding" (2025)
Language Fusion for Parameter-Efficient Cross-lingual Transfer
Official repository of the work titled "High-Dimensional Interlingual Representations of Large Language Models"
This repository contains the code for the experiments related to higher-level semantic tasks and related to the meta-learning from: "From Zero to Hero: On the Limitations of Zero Shot Cross-Lingual Transfer"
Add a description, image, and links to the cross-lingual-transfer topic page so that developers can more easily learn about it.
To associate your repository with the cross-lingual-transfer topic, visit your repo's landing page and select "manage topics."