mPLM-Sim: Better Cross-Lingual Similarity and Transfer in Multilingual Pretrained Language Models
β11Jan 19, 2024Updated 2 years ago
Alternatives and similar repositories for mPLM-Sim
Users that are interested in mPLM-Sim are comparing it to the libraries listed below
Sorting:
- πΈ GlotWeb: Web Indexing for Minority Languages (WWW 2026)β17Aug 13, 2025Updated 6 months ago
- Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023β106Apr 20, 2024Updated last year
- β21Dec 5, 2022Updated 3 years ago
- Python source code for EMNLP 2021 Findings paper: "Subword Mapping and Anchoring Across Languages".β13Sep 17, 2021Updated 4 years ago
- Code and data for "Medical Dialogue Generation via Dual Flow Modeling" (ACL 2023 Findings)β13Nov 22, 2023Updated 2 years ago
- PyTorch implementation of CAREβ16Oct 6, 2023Updated 2 years ago
- π³ PyLoader: An asynchronous Python dataloader for loading big datasets, supporting PyTorch and TensorFlow 2.x.β11Aug 29, 2021Updated 4 years ago
- Official code for the NeurIPS25 paper "RAT: Bridging RNN Efficiencyand Attention Accuracy in Language Modeling" (https://arxiv.org/abs/25β¦β23Dec 10, 2025Updated 2 months ago
- β18Jun 24, 2025Updated 8 months ago
- Code for the paper "Self-Detoxifying Language Models via Toxification Reversal" (EMNLP 2023)β18Oct 17, 2023Updated 2 years ago
- A Framework aims to wisely initialize unseen subword embeddings in PLMs for efficient large-scale continued pretrainingβ18Nov 26, 2023Updated 2 years ago
- Codes for Mitigating Unhelpfulness in Emotional Support Conversations with Multifaceted AI Feedback (ACL 2024 Findings)β16Jul 2, 2024Updated last year
- βοΈ Sentence segmentation with wtpsplit's state-of-the-art Segment any Text (SaT) modelsβ36Oct 1, 2025Updated 5 months ago
- β21Aug 9, 2024Updated last year
- Overview of corpora/datasets for Germanic low-resource languages and dialects. Accompanies "A Survey of Corpora for Germanic Low-Resourceβ¦β26Feb 16, 2026Updated 2 weeks ago
- Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/β¦β28Apr 17, 2024Updated last year
- Dialogue Planning via Brownian Bridge Stochastic Process for Goal-directed Proactive Dialogue (ACL Findings 2023)β21Nov 10, 2025Updated 3 months ago
- Code and data for the paper "Turning English-centric LLMs Into Polyglots: How Much Multilinguality Is Needed?"β26Jun 3, 2025Updated 9 months ago
- Easy-to-use framework for evaluating cross-lingual consistency of factual knowledge (Supported LLaMA, BLOOM, mT5, RoBERTa, etc.) Paper heβ¦β27Aug 8, 2025Updated 6 months ago
- Instruct Once, Chat Consistently in Multiple Rounds: An Efficient Tuning Framework for Dialogue (ACL 2024)β25Oct 18, 2025Updated 4 months ago
- Vecna is a Python chatbot which recommends songs and movies depending upon your feelingsβ12Jun 28, 2022Updated 3 years ago
- Code for the paper "RECAP: Towards Precise Radiology Report Generation via Dynamic Disease Progression Reasoning" (EMNLP'23 Findings).β28Jun 12, 2025Updated 8 months ago
- Target-oriented Proactive Dialogue Systems with Personalization: Problem Formulation and Dataset Curation (EMNLP 2023)β31Oct 18, 2025Updated 4 months ago
- This is the official implementation for Image Clustering via the Principle of Rate Reduction in the Age of Pretrained Models.β32Jun 16, 2023Updated 2 years ago
- PyTorch implementation of experiments in the paper Aligning Language Models with Human Preferences via a Bayesian Approachβ32Nov 6, 2023Updated 2 years ago
- UDapter is a multilingual dependency parser that uses "contextual" adapters together with language-typology features for language-specifiβ¦β31Dec 5, 2022Updated 3 years ago
- β14Jan 17, 2024Updated 2 years ago
- We archive data because we are interested in the diffs. All data is from https://video-api.cartoonnetwork.com. We run the check every minβ¦β10Feb 24, 2026Updated last week
- β10Jul 29, 2022Updated 3 years ago
- Extract information from XBRL files in the ESEF formatβ13Jan 3, 2026Updated 2 months ago
- [NeurIPS 2023] Repetition In Repetition Out: Towards Understanding Neural Text Degeneration from the Data Perspectiveβ40Oct 17, 2023Updated 2 years ago
- Belief Revision based Caption Re-ranker with Visual Semantic Information. COLING 2022β11Apr 13, 2025Updated 10 months ago
- This project predicts wind turbine failure using numerous sensor data by applying classification based ML models that improves predictionβ¦β11Mar 20, 2023Updated 2 years ago
- Official implementation of "Is This Loss Informative? Faster Text-to-Image Customization by Tracking Objective Dynamics" (NeurIPS 2023)β39Nov 8, 2023Updated 2 years ago
- This is a dehazed method for remote sensing image, which based on CycleGAN.β12May 10, 2022Updated 3 years ago
- Automated Question-Answering Over Knowledge Graphs in O&M of Wind Turbinesβ12Aug 16, 2022Updated 3 years ago
- Deep metric learning: Triplet, Magnet and VMF lossβ11Aug 19, 2022Updated 3 years ago
- Tally Prime MCP (Model Context Protocol) Server implementation to feed Tally ERP data to popular LLM like Claude, ChatGPT supporting MCPβ19Nov 11, 2025Updated 3 months ago
- rabitq rust implementationβ10Feb 4, 2026Updated 3 weeks ago