indicTranslate v1 - Machine Translation for 11 Indic languages. For latest v2, check: https://github.com/AI4Bharat/IndicTrans2
☆137Jan 2, 2024Updated 2 years ago
Alternatives and similar repositories for indicTrans
Users that are interested in indicTrans are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A collaborative catalog of NLP resources for Indic languages☆627Dec 14, 2024Updated last year
- Indic-BERT-v1: BERT-based Multilingual Model for 11 Indic Languages and Indian-English. For latest Indic-BERT v2, check: https://github.c…☆292May 11, 2023Updated 2 years ago
- Codebase for Indic-Transliteration using Seq2Seq RNN. For latest repo with Transformer-based models, check: https://github.com/AI4Bharat/…☆59Jul 9, 2021Updated 4 years ago
- The Dakshina dataset is a collection of text in both Latin and native scripts for 12 South Asian languages. For each language, the datase…☆206May 27, 2020Updated 5 years ago
- Shoonya - Platform to Annotate and label data at scale.☆65Oct 31, 2025Updated 4 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Official code for "Too Brittle To Touch: Comparing the Stability of Quantization and Distillation Towards Developing Lightweight Low-Reso…☆18Oct 9, 2025Updated 5 months ago
- Code for extracting parallel corpora from pmindia☆17Jan 28, 2020Updated 6 years ago
- Code for EMNLP 2022 Paper: On the Calibration of Massively Multilingual Language Models☆15Jun 12, 2023Updated 2 years ago
- Pre-trained, multilingual sequence-to-sequence models for Indian languages☆51Jul 20, 2022Updated 3 years ago
- This repo contains a demo of adversarial strings poisoning vector database and forching specific hallucinations on RAG chatbot.☆10May 2, 2024Updated last year
- Resources to go with the Indic NLP Library☆78Jun 12, 2022Updated 3 years ago
- Transliteration models for 21 Indic languages☆114Oct 13, 2023Updated 2 years ago
- Generate large textual corpora for almost any language by crawling the web☆13Feb 17, 2024Updated 2 years ago
- ☆15Jan 29, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Tooling to play around with multilingual machine translation for Indian Languages.☆22Mar 5, 2022Updated 4 years ago
- Repository for the English-Hindi Codemixed to Monolingual English Parallel Corpus☆13Feb 17, 2019Updated 7 years ago
- Cross-lingual Fact-to-Text Alignment and Generation for Low-Resource Languages☆11Jan 1, 2023Updated 3 years ago
- Extract metadata from a video to an sqlite database☆20May 23, 2024Updated last year
- Vistaar: Diverse Benchmarks and Training Sets for Indian Language ASR☆78Jun 8, 2025Updated 9 months ago
- Chitralekha - A video transcreation platform for Indic languages, supporting transcription, translation and voice-over☆113Oct 31, 2025Updated 4 months ago
- NLP Application Project☆21May 4, 2019Updated 6 years ago
- Longformer Encoder Decoder model for the legal domain, trained for long document abstractive summarization task.☆10Feb 26, 2021Updated 5 years ago
- Yet Another Neural Machine Translation Toolkit☆179Mar 7, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- This repository contains the HiNER dataset released with our paper at LREC 2022☆16Jun 6, 2023Updated 2 years ago
- Aksharamukha Python Library☆59Feb 2, 2025Updated last year
- This repo is containing notes and implementations for cherry-picked publications of my particular interest☆12May 14, 2020Updated 5 years ago
- FBI: Finding Blindspots in LLM Evaluations with Interpretable Checklists☆31Aug 14, 2025Updated 7 months ago
- Submission for the Programming Task for the Precog Recruitment Process (II)☆14Jul 29, 2016Updated 9 years ago
- Text to Speech for Indic languages☆52Mar 23, 2022Updated 4 years ago
- NPTEL2020: Speech2Text dataset for Indian-English Accent☆82Dec 24, 2021Updated 4 years ago
- Improving Low-Resource Neural Machine Translation of Related Languages by Transfer Learning☆19Nov 3, 2022Updated 3 years ago
- Source code and dataset for the paper 'Saamayik: A Benchmark and Dataset for English-Sanskrit Translation'☆15Oct 11, 2025Updated 5 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆10Jan 31, 2022Updated 4 years ago
- Portfolio with data science and machine learning projects I developed during my training in data science.☆10Jan 4, 2021Updated 5 years ago
- Tool to fix bitexts and tag near-duplicates for removal☆34Sep 4, 2025Updated 6 months ago
- generate video with voice narration from ppt/pdf Slides☆10Sep 4, 2023Updated 2 years ago
- ☆58Jan 26, 2026Updated last month
- This is a niche collection of research papers which are proven to be gradients pushing the field of Natural Language Processing, Deep Lea…☆25Nov 19, 2024Updated last year
- Curated list of publicly available parallel corpus for Indian Languages☆37Jul 15, 2021Updated 4 years ago