sagorbrur / codeswitch
CodeSwitch is a NLP tool, can use for language identification, pos tagging, name entity recognition, sentiment analysis of code mixed data.
☆31Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for codeswitch
- A Benchmark Dataset for Understanding Disfluencies in Question Answering☆61Updated 3 years ago
- ☆38Updated 4 years ago
- Neural network sequence labeling model☆11Updated 4 years ago
- Pre-trained, multilingual sequence-to-sequence models for Indian languages☆45Updated 2 years ago
- Codebase for probing and visualizing multilingual models.☆45Updated 4 years ago
- Dataset of sentences from Hindi stories tagged with different emotion tags☆10Updated 4 years ago
- Statistics on multilingual datasets☆17Updated 2 years ago
- A web application that interfaces two GEC systems. [web instance is down]☆31Updated 3 months ago
- Contains all teaching material used in ACL 2020 Tutorial "Reviewing NLP" given on July 5 2020☆16Updated 4 years ago
- A program to choose transfer languages for cross-lingual learning☆71Updated last year
- Dynamic ensemble decoding with transformer-based models☆29Updated last year
- A lightweight but powerful library to build token indices for NLP tasks, compatible with major Deep Learning frameworks like PyTorch and …☆49Updated 4 years ago
- BERT models for many languages created from Wikipedia texts☆34Updated 4 years ago
- This repo contains a set of neural transducer, e.g. sequence-to-sequence model, focusing on character-level tasks.☆72Updated last year
- Tooling to play around with multilingual machine translation for Indian Languages.☆21Updated 2 years ago
- Factorization of the neural parameter space for zero-shot multi-lingual and multi-task transfer☆39Updated 4 years ago
- ☆29Updated last year
- Fine-tune transformers with pytorch-lightning☆44Updated 2 years ago
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆22Updated last year
- The repository for the paper "When Do You Need Billions of Words of Pretraining Data?"☆20Updated 4 years ago
- ☆16Updated 3 months ago
- The Benchmark of Linguistic Minimal Pairs☆142Updated last year
- ☆23Updated 4 years ago
- Survey on machine learning.☆14Updated 3 years ago
- CodemixedNLP: An Extensible and Open NLP Toolkit for Code-Switching☆18Updated 3 years ago
- EXPATS: A Toolkit for Explainable Automated Text Scoring☆21Updated 3 years ago
- A simple neural truecaser written in pytorch and allennlp.☆32Updated 5 months ago
- This repositary hosts my experiments for the project, I did with OffNote Labs.☆11Updated 3 years ago
- Implementation of the paper "Fine-Tuning Transformers: Vocabulary Transfer" https://arxiv.org/pdf/2112.14569.pdf☆20Updated 2 years ago
- As good as new. How to successfully recycle English GPT-2 to make models for other languages (ACL Findings 2021)☆46Updated 3 years ago