Language identification and normalisation in code switching data tailored with a three-step decoding process
☆24Dec 23, 2019Updated 6 years ago
Alternatives and similar repositories for csnli
Users that are interested in csnli are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Hindi-English Dataset for Text Normalization☆17Jan 3, 2022Updated 4 years ago
- A benchmark for code-switched NLP, ACL 2020☆76May 28, 2024Updated last year
- Archive of my notes taken at lectures in IIITH☆25Aug 15, 2021Updated 4 years ago
- ☆10Aug 1, 2018Updated 7 years ago
- Indian Language Tagger and Chunker (Hindi, Telugu, Tamil, Marathi, Punjabi, Kanada, Malayalam, Urdu, Bengali)☆42Feb 2, 2023Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A collaborative catalog of NLP resources for Indic languages☆630Dec 14, 2024Updated last year
- A collection of basic text processing modules focused on Gujarati☆10Oct 24, 2017Updated 8 years ago
- [AAAI‘24] The official PyTorch implimentation of our AAAI 2024 paper: Personalized LoRA for Human-Centered Text Understanding☆14Dec 11, 2023Updated 2 years ago
- ☆22Oct 11, 2020Updated 5 years ago
- Chu-Lui-Edmonds decoding extracted from TurboParser☆14May 16, 2017Updated 8 years ago
- A curated list of research papers and resources on code-switching☆334Jan 31, 2026Updated 2 months ago
- Pytorch implementation of 'Improving Self-supervised Lightweight Model Learning via Hard-aware Metric Distillation. In ECCV 2022'☆12Mar 22, 2023Updated 3 years ago
- Hindi-English Transliteration Using sequence to sequence learning☆17Apr 3, 2017Updated 9 years ago
- ☆13Mar 25, 2021Updated 5 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆10Sep 19, 2018Updated 7 years ago
- This tool helps automatic generation of grammatically valid synthetic Code-mixed data by utilizing linguistic theories such as Equivalenc…☆60Jul 30, 2024Updated last year
- ☆38May 14, 2024Updated last year
- This repository contains the PyTorch implementation of the paper STaCK: Sentence Ordering with Temporal Commonsense Knowledge appearing a…☆28Mar 15, 2023Updated 3 years ago
- VAD + resampling | High resolution spectrogram☆14Nov 29, 2022Updated 3 years ago
- Convert Numerical Representations to Korean Pronunciation☆14Apr 20, 2020Updated 5 years ago
- Corpus and a baseline neural network system for Named Entity Recognition in Hindi-English Code-Mixed social media text.☆46Sep 25, 2020Updated 5 years ago
- CodemixedNLP: An Extensible and Open NLP Toolkit for Code-Switching☆18Mar 29, 2021Updated 5 years ago
- a modified version of FunSeq2 using new data context☆15Aug 18, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- MA-BERT Learning Representation by Incorporating Multi-Attribute Knowledge in Transformers☆16May 13, 2021Updated 4 years ago
- Topic clustering library built on Transformer embeddings and cosine similarity metrics.Compatible with all BERT base transformers from hu…☆44Jun 11, 2021Updated 4 years ago
- This repository contains papers and resources pertaining to Hate speech research.☆44May 30, 2021Updated 4 years ago
- Benchmark datasets for sentiment analysis☆12May 18, 2020Updated 5 years ago
- This repository creates speaker diarization recipes to be used within the egs folder of kaldi.☆17Aug 12, 2024Updated last year
- Official Implementation of "Domain Adaptive Few-Shot Open-Set Learning" in IEEE/CVF International Conference on Computer Vision (ICCV'23)☆17Dec 18, 2023Updated 2 years ago
- ☆47Jan 23, 2020Updated 6 years ago
- Compute the most likely permutation of a lattice given an LM☆10Jan 3, 2013Updated 13 years ago
- Xlit-Crowd: Hindi-English Transliteration Corpus☆38Feb 17, 2015Updated 11 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This code provides word level language identification tool for identifying language for individual words in Code-Mixed text. e.g. The tex…☆59Aug 11, 2020Updated 5 years ago
- The project aims on adding a state-of-the-art transliteration module for cross transliterations among all Indian languages including Engl…☆274Oct 28, 2022Updated 3 years ago
- This repository contains code for a tutorial on end to end automatic speech recognition.☆18Sep 10, 2019Updated 6 years ago
- Notification Triggers for Python☆19Apr 15, 2025Updated last year
- Codebase for probing and visualizing multilingual models.☆49May 13, 2020Updated 5 years ago
- Implementation and Benchmark Splits to study Out-of-Distribution Generalization in Deep Metric Learning.☆25Oct 2, 2021Updated 4 years ago
- Pronunciation-assisted Subword Modeling☆31May 30, 2019Updated 6 years ago