Language identification and normalisation in code switching data tailored with a three-step decoding process
☆24Dec 23, 2019Updated 6 years ago
Alternatives and similar repositories for csnli
Users that are interested in csnli are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Repository for the English-Hindi Codemixed to Monolingual English Parallel Corpus☆13Feb 17, 2019Updated 7 years ago
- steps to perform text-based speaker diarization with kaldi toolkit☆12Nov 2, 2018Updated 7 years ago
- A Hindi-English Dataset for Text Normalization☆17Jan 3, 2022Updated 4 years ago
- Language Identification and transliteration tool for Indian language code mixed data.☆24Feb 29, 2016Updated 10 years ago
- A benchmark for code-switched NLP, ACL 2020☆76May 28, 2024Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Archive of my notes taken at lectures in IIITH☆25Aug 15, 2021Updated 4 years ago
- Codebase for Indic-Transliteration using Seq2Seq RNN. For latest repo with Transformer-based models, check: https://github.com/AI4Bharat/…☆59Jul 9, 2021Updated 4 years ago
- A pipeline for transliteration, spell correction, POS tagging and word sense disambiguation of Hinglish code mixed data to Hindi Devanaga…☆37Jan 14, 2024Updated 2 years ago
- A helper repository for converting Jupyter notebooks into a wordpress-friendly format☆12Dec 11, 2016Updated 9 years ago
- Indian Language Tagger and Chunker (Hindi, Telugu, Tamil, Marathi, Punjabi, Kanada, Malayalam, Urdu, Bengali)☆42Feb 2, 2023Updated 3 years ago
- A collaborative catalog of NLP resources for Indic languages☆629Dec 14, 2024Updated last year
- POS tagging models for Hindi English Code Mixed Tweets☆11Aug 1, 2018Updated 7 years ago
- A collection of basic text processing modules focused on Gujarati☆10Oct 24, 2017Updated 8 years ago
- A tutorial on how to build your own Neural Language Model☆10Dec 8, 2022Updated 3 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- ☆22Oct 11, 2020Updated 5 years ago
- Chu-Lui-Edmonds decoding extracted from TurboParser☆14May 16, 2017Updated 8 years ago
- A curated list of research papers and resources on code-switching☆333Jan 31, 2026Updated last month
- Competitive Programming C++ Sublime Text Snippets☆14Oct 12, 2019Updated 6 years ago
- ☆13Oct 3, 2024Updated last year
- A tool for correcting misspellings in textual input using the Noisy Channel Model.☆11Sep 26, 2020Updated 5 years ago
- Hindi-English Transliteration Using sequence to sequence learning☆17Apr 3, 2017Updated 8 years ago
- This tool helps automatic generation of grammatically valid synthetic Code-mixed data by utilizing linguistic theories such as Equivalenc…☆59Jul 30, 2024Updated last year
- This repository contains the PyTorch implementation of the paper STaCK: Sentence Ordering with Temporal Commonsense Knowledge appearing a…☆28Mar 15, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- VAD + resampling | High resolution spectrogram☆14Nov 29, 2022Updated 3 years ago
- Convert Numerical Representations to Korean Pronunciation☆14Apr 20, 2020Updated 5 years ago
- ☆14Jul 7, 2021Updated 4 years ago
- Corpus and a baseline neural network system for Named Entity Recognition in Hindi-English Code-Mixed social media text.☆46Sep 25, 2020Updated 5 years ago
- CodemixedNLP: An Extensible and Open NLP Toolkit for Code-Switching☆18Mar 29, 2021Updated 4 years ago
- Topic clustering library built on Transformer embeddings and cosine similarity metrics.Compatible with all BERT base transformers from hu…☆44Jun 11, 2021Updated 4 years ago
- annotated hateful speech☆24Apr 6, 2019Updated 6 years ago
- This is our rad 90's website.☆15Mar 13, 2026Updated last week
- [JOHD 23] This repository hosts the code to get the artifects of Cuneiform in the paper CuneiML: A Cuneiform Dataset for Machine Learning…☆15Jul 26, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆47Jan 23, 2020Updated 6 years ago
- Compute the most likely permutation of a lattice given an LM☆10Jan 3, 2013Updated 13 years ago
- Xlit-Crowd: Hindi-English Transliteration Corpus☆38Feb 17, 2015Updated 11 years ago
- Fun pet project for creating Ukrainian-speaking Conversational AI☆20May 4, 2023Updated 2 years ago
- Catalog of abusive language data (PLoS 2020)☆323Jun 14, 2024Updated last year
- The project aims on adding a state-of-the-art transliteration module for cross transliterations among all Indian languages including Engl…☆274Oct 28, 2022Updated 3 years ago
- This repository contains code for a tutorial on end to end automatic speech recognition.☆17Sep 10, 2019Updated 6 years ago