murali1996 / CodemixedNLPLinks

CodemixedNLP: An Extensible and Open NLP Toolkit for Code-Switching

☆18

Alternatives and similar repositories for CodemixedNLP

Users that are interested in CodemixedNLP are comparing it to the libraries listed below

Sorting:

cindyxinyiwang / expand-via-lexicon-based-adaptation
Code for ACL 2022 paper "Expanding Pretrained Models to Thousands More Languages via Lexicon-based Adaptation"
☆30Updated 3 years ago
ethanachi / multilingual-probing-visualization
Codebase for probing and visualizing multilingual models.
☆49Updated 5 years ago
shijie-wu / crosslingual-nlp
This repo supports various cross-lingual transfer learning & multilingual NLP models.
☆92Updated last year
mayhewsw / multilingual-data-stats
Statistics on multilingual datasets
☆17Updated 3 years ago
machelreid / lewis
Official code for LEWIS, from: "LEWIS: Levenshtein Editing for Unsupervised Text Style Transfer", ACL-IJCNLP 2021 Findings by Machel Rei…
☆31Updated 2 years ago
amazon-science / contrastive-controlled-mt
Code and data for the IWSLT 2022 shared task on Formality Control for SLT
☆21Updated 2 years ago
uclanlp / synpg
Code for our EACL-2021 paper "Generating Syntactically Controlled Paraphrases without Using Annotated Parallel Pairs".
☆38Updated last year
timoschick / form-context-model
This repository contains the code for the Form-Context Model and its Attentive Mimicking variant.
☆31Updated 5 years ago
facebookresearch / asset
A Dataset for Tuning and Evaluation of Sentence Simplification Models with Multiple Rewriting Transformations
☆56Updated 2 years ago
bigscience-workshop / multilingual-modeling
BLOOM+1: Adapting BLOOM model to support a new unseen language
☆73Updated last year
martiansideofthemoon / relic-retrieval
Official codebase accompanying our ACL 2022 paper "RELiC: Retrieving Evidence for Literary Claims" (https://relic.cs.umass.edu).
☆20Updated 3 years ago
danieldeutsch / sacrerouge
SacreROUGE is a library dedicated to the use and development of text generation evaluation metrics with an emphasis on summarization.
☆144Updated 2 years ago
sebastianruder / emnlp2021-multiqa-tutorial
EMNLP 2021 Tutorial: Multi-Domain Multilingual Question Answering
☆38Updated 3 years ago
cisnlp / Glot500
Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023
☆103Updated last year
thevasudevgupta / transformers-adapters
This repositary hosts my experiments for the project, I did with OffNote Labs.
☆10Updated 4 years ago
microsoft / LID-tool
This code provides word level language identification tool for identifying language for individual words in Code-Mixed text. e.g. The tex…
☆55Updated 4 years ago
jwieting / paraphrastic-representations-at-scale
☆75Updated 4 years ago
wietsedv / gpt2-recycle
As good as new. How to successfully recycle English GPT-2 to make models for other languages (ACL Findings 2021)
☆48Updated 4 years ago
facebookresearch / muss
Code and models used in "MUSS Multilingual Unsupervised Sentence Simplification by Mining Paraphrases".
☆99Updated 2 years ago
ahmetustun / udapter
UDapter is a multilingual dependency parser that uses "contextual" adapters together with language-typology features for language-specifi…
☆31Updated 2 years ago
neulab / langrank
A program to choose transfer languages for cross-lingual learning
☆72Updated 2 years ago
UKPLab / maps
Multicultural Proverbs and Sayings
☆11Updated 6 months ago
deep-spin / qaware-decode
A repository for experiments in quality-aware decoding
☆17Updated 3 years ago
timoschick / bertram
This repository contains the code for "BERTRAM: Improved Word Embeddings Have Big Impact on Contextualized Representations".
☆64Updated 4 years ago
g8a9 / ear
Code associated with the paper "Entropy-based Attention Regularization Frees Unintended Bias Mitigation from Lists"
☆49Updated 3 years ago
adapter-hub / hgiyt
Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"
☆27Updated 3 years ago
ivanmontero / autobot
Implementation of the paper 'Sentence Bottleneck Autoencoders from Transformer Language Models'
☆17Updated 3 years ago
mhardalov / exams-qa
A Multi-subject High School Examinations Dataset for Cross-lingual and Multilingual Question Answering
☆44Updated 3 years ago
qiang2100 / BERT-LS
Lexical Simplification with Pretrained Encoders
☆70Updated 4 years ago
mprompting / xlmrprompt
☆11Updated 3 years ago