Code for extracting parallel corpora from pmindia
☆17Jan 28, 2020Updated 6 years ago
Alternatives and similar repositories for pmindia-crawler
Users that are interested in pmindia-crawler are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Tooling to play around with multilingual machine translation for Indian Languages.☆22Mar 5, 2022Updated 4 years ago
- Zero-Shot Translation implemented by Transformer☆14Mar 24, 2023Updated 3 years ago
- a ducttape workflow for neural machine translation☆14Mar 23, 2021Updated 5 years ago
- ☆23Feb 4, 2020Updated 6 years ago
- State of the Art Language models and Classifier for Odia, which is spoken in the Indian state of Odisha☆14Aug 7, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Machine Translation from English to Odia language.☆10Aug 9, 2021Updated 4 years ago
- Mini-Projects using Cutting-Edge AI Frameworks☆15Apr 3, 2026Updated last month
- Find duplicate text files.☆14Jan 14, 2025Updated last year
- This repository contains dataset for english to gujarati translation☆10Dec 27, 2020Updated 5 years ago
- ☆44Aug 2, 2021Updated 4 years ago
- Reader Translator Generator - NMT toolkit based on pytorch☆32Sep 12, 2023Updated 2 years ago
- Source Code for "Improved Embeddings for Learning Prerequisite Chains" (CPSC 490 - Senior Project)☆11May 2, 2019Updated 7 years ago
- Extend bert-nmt to context-aware translation.☆11May 24, 2021Updated 5 years ago
- Command line tool and async library to perform basic file operations on local paths, Google Cloud Storage paths and Azure Blob Storage pa…☆39Apr 7, 2026Updated last month
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Which ML are you?☆13Jan 3, 2023Updated 3 years ago
- Transfer learning for neural machine translation using cross-lingual word embeddings☆10Dec 17, 2025Updated 5 months ago
- Stanford CoreNLP NER addon for Apache Tika's NamerEntityParser☆13Feb 26, 2022Updated 4 years ago
- ☆14Jan 4, 2021Updated 5 years ago
- A recipe for creating a Speaker Identification system built on Kaldi.☆15Jan 2, 2020Updated 6 years ago
- This repo is containing notes and implementations for cherry-picked publications of my particular interest☆12May 14, 2020Updated 6 years ago
- A repository with the code related to experiments around context-aware machine translation☆51Sep 22, 2025Updated 8 months ago
- Information about the CodedotAI reading group sessions.☆12Aug 16, 2021Updated 4 years ago
- Lexically Constrained Neural Machine Translation with Levenshtein Transformer☆40Jul 14, 2020Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- An extended version of Scala's scaladoc command☆21Jul 2, 2011Updated 14 years ago
- Translation quality evaluation for Firefox Translations models☆12Oct 23, 2023Updated 2 years ago
- A collaborative catalog of NLP resources for Indic languages☆632Dec 14, 2024Updated last year
- A tool that locates, downloads, and extracts machine translation corpora☆164Apr 13, 2026Updated last month
- Repository for the English-Hindi Codemixed to Monolingual English Parallel Corpus☆13Feb 17, 2019Updated 7 years ago
- ☆73Nov 27, 2025Updated 6 months ago
- ☆22Sep 19, 2023Updated 2 years ago
- This is the repository for my version of Kaldi for Dummies example.☆17Nov 18, 2018Updated 7 years ago
- Spoken Language Translation System☆20Jul 26, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A small utility for converting Stanford GloVe vectors to HDF5 / NumPy☆12Apr 4, 2017Updated 9 years ago
- Reduce the size of pretrained Hugging Face models via vocabulary trimming.☆49Dec 28, 2022Updated 3 years ago
- universal tokenizer☆17Nov 29, 2021Updated 4 years ago
- A library for data streaming and augmentation☆21May 5, 2025Updated last year
- Resources and tools for Indian language Natural Language Processing☆637Jun 7, 2024Updated last year
- Code for the paper "Improving Robustness of Machine Translation with Synthetic Noise"☆21Dec 23, 2019Updated 6 years ago
- indicTranslate v1 - Machine Translation for 11 Indic languages. For latest v2, check: https://github.com/AI4Bharat/IndicTrans2☆139Jan 2, 2024Updated 2 years ago