☆70Jun 29, 2023Updated 2 years ago
Alternatives and similar repositories for indian-parallel-corpora
Users that are interested in indian-parallel-corpora are comparing it to the libraries listed below
Sorting:
- A parallel evaluation data set of SAP software documentation with document structure annotation☆14Jul 30, 2025Updated 7 months ago
- Tamil Language words list☆12Jul 2, 2016Updated 9 years ago
- This project scrapes text from Telugu books(Novels)☆10Aug 3, 2021Updated 4 years ago
- Cross-domain word representation learning☆10May 23, 2015Updated 10 years ago
- Language Modelling, CMI vs Perplexity☆11Mar 17, 2018Updated 7 years ago
- ☆10Aug 1, 2018Updated 7 years ago
- Collection of Common Machine Translation Tools☆11Jul 26, 2022Updated 3 years ago
- ParCourE - Parallel Corpus Explorer☆12Dec 27, 2021Updated 4 years ago
- Efficient and easy to use transliteration for Indian languages☆50Aug 7, 2020Updated 5 years ago
- This repository contains additional reference translations for the WMT'14 En-De (newstest2014) and WMT'19 En-Ru (newstest2019) test sets …☆15Aug 31, 2021Updated 4 years ago
- Transliteration module for Indian Languages☆79Oct 24, 2025Updated 4 months ago
- Hinglish Text Classification☆30Jun 12, 2023Updated 2 years ago
- Speeech Recognition for Indic languages.☆13Apr 3, 2021Updated 4 years ago
- Code Repository for the IndicXNLI paper.☆15Jul 8, 2023Updated 2 years ago
- ACL Paper Lists(machine translation)☆13Mar 23, 2022Updated 3 years ago
- ☆30Nov 1, 2019Updated 6 years ago
- ☆34Nov 22, 2021Updated 4 years ago
- A collaborative catalog of NLP resources for Indic languages☆627Dec 14, 2024Updated last year
- Description Describes the IndicNLP corpus and associated datasets☆195Apr 16, 2023Updated 2 years ago
- NAACL 2019 paper: Density Matching for Bilingual Word Embedding (Zhou et al., 2019)☆63Dec 8, 2022Updated 3 years ago
- It is a simple tool to convert roman script to indic(Devanagari) script. As most Keyboards are English and to write in Indic script is di…☆13Aug 31, 2016Updated 9 years ago
- The baseline model code for WMT 2021 Triangular MT☆13Apr 7, 2021Updated 4 years ago
- This repository contains the resources used for presentation/discussion in weekly iRE Lab meetings.☆14Sep 8, 2017Updated 8 years ago
- ☆32Jun 2, 2021Updated 4 years ago
- Exploring the Limits of Low-Resource Neural Machine Translation☆34Feb 16, 2023Updated 3 years ago
- Automatically harvested multilingual contrastive word sense disambiguation test sets for machine translation☆17Jan 18, 2021Updated 5 years ago
- Statistics on multilingual datasets☆17Jul 12, 2022Updated 3 years ago
- A Python based API to access Indian language WordNets.☆38Apr 26, 2022Updated 3 years ago
- Resources to go with the Indic NLP Library☆78Jun 12, 2022Updated 3 years ago
- Hindi-English Transliteration Using sequence to sequence learning☆17Apr 3, 2017Updated 8 years ago
- Softcatalà neural translation models☆20Jan 17, 2026Updated last month
- Indic-BERT-v1: BERT-based Multilingual Model for 11 Indic Languages and Indian-English. For latest Indic-BERT v2, check: https://github.c…☆291May 11, 2023Updated 2 years ago
- Indian Language Tagger and Chunker (Hindi, Telugu, Tamil, Marathi, Punjabi, Kanada, Malayalam, Urdu, Bengali)☆42Feb 2, 2023Updated 3 years ago
- motivational website to do something special this month☆21Jan 11, 2024Updated 2 years ago
- English-French MT dialogue dataset☆17Apr 29, 2022Updated 3 years ago
- ☆23May 5, 2022Updated 3 years ago
- Dockerized NMT frameworks for nmt-wizard☆39Apr 18, 2023Updated 2 years ago
- Transliterating English to Hindi using Recurrent Neural Networks☆45May 3, 2017Updated 8 years ago
- Versioned Sanskrit linguistic data☆20Nov 5, 2024Updated last year