bhaddow/pmindia-crawler

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/bhaddow/pmindia-crawler)

bhaddow / pmindia-crawler

Code for extracting parallel corpora from pmindia

☆17

Alternatives and similar repositories for pmindia-crawler

Users that are interested in pmindia-crawler are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

transducens / LASERtrain
View on GitHub
☆22Dec 20, 2019Updated 6 years ago
Joon-Park92 / Zero-Shot-Translation-Transformer
View on GitHub
Zero-Shot Translation implemented by Transformer
☆14Mar 24, 2023Updated 3 years ago
rwth-i6 / CharacTER
View on GitHub
☆24Feb 4, 2020Updated 6 years ago
LCS2-IIITD / DaSLaM
View on GitHub
☆17Oct 31, 2023Updated 2 years ago
goru001 / nlp-for-odia
View on GitHub
State of the Art Language models and Classifier for Odia, which is spoken in the Indian state of Odisha
☆14Aug 7, 2020Updated 5 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
soumendrak / MTEnglish2Odia
View on GitHub
Machine Translation from English to Odia language.
☆10Aug 9, 2021Updated 4 years ago
shahparth123 / eng_guj_parallel_corpus
View on GitHub
This repository contains dataset for english to gujarati translation
☆10Dec 27, 2020Updated 5 years ago
zyocum / dedup
View on GitHub
Find duplicate text files.
☆14Jan 14, 2025Updated last year
ashwanitanwar / nmt-transfer-learning-xlm-r
View on GitHub
Improving Low-Resource Neural Machine Translation of Related Languages by Transfer Learning
☆20Nov 3, 2022Updated 3 years ago
gchhablani / multilingual-image-captioning
View on GitHub
☆43Aug 2, 2021Updated 4 years ago
tezansahu / ai-garage
View on GitHub
Mini-Projects using Cutting-Edge AI Frameworks
☆16Jul 7, 2026Updated 3 weeks ago
isi-nlp / rtg
View on GitHub
Reader Translator Generator - NMT toolkit based on pytorch
☆31Sep 12, 2023Updated 2 years ago
Krushna-007 / adrishyam
View on GitHub
☆10Jun 13, 2025Updated last year
jsenellart / papers
View on GitHub
This repo is containing notes and implementations for cherry-picked publications of my particular interest
☆12May 14, 2020Updated 6 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
eloffel / improved_embeddings
View on GitHub
Source Code for "Improved Embeddings for Learning Prerequisite Chains" (CPSC 490 - Senior Project)
☆11May 2, 2019Updated 7 years ago
bert-nmt / ctx-bert-nmt
View on GitHub
Extend bert-nmt to context-aware translation.
☆11May 24, 2021Updated 5 years ago
thammegowda / tika-ner-corenlp
View on GitHub
Stanford CoreNLP NER addon for Apache Tika's NamerEntityParser
☆13Feb 26, 2022Updated 4 years ago
hauntsaninja / boostedblob
View on GitHub
Command line tool and async library to perform basic file operations on local paths, Google Cloud Storage paths and Azure Blob Storage pa…
☆39Apr 7, 2026Updated 3 months ago
yunsukim86 / sockeye-transfer
View on GitHub
Transfer learning for neural machine translation using cross-lingual word embeddings
☆10Dec 17, 2025Updated 7 months ago
crockpotveggies / dl4j-examples
View on GitHub
Deeplearning4j Examples (DL4J, DL4J Spark, DataVec)
☆10Aug 16, 2018Updated 7 years ago
neulab / contextual-mt
View on GitHub
A repository with the code related to experiments around context-aware machine translation
☆51Sep 22, 2025Updated 10 months ago
CodedotAl / reading-group
View on GitHub
Information about the CodedotAI reading group sessions.
☆13Aug 16, 2021Updated 4 years ago
OdiaGenAI / GenerativeAI_and_LLM_Odia
View on GitHub
☆31Oct 8, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
AI4Bharat / indicnlp_catalog
View on GitHub
A collaborative catalog of NLP resources for Indic languages
☆638Dec 14, 2024Updated last year
thammegowda / mtdata
View on GitHub
A tool that locates, downloads, and extracts machine translation corpora
☆167Apr 13, 2026Updated 3 months ago
NVIDIA / image-captioner
View on GitHub
A tool for captioning, visualizing and analyzing image datasets
☆25Oct 23, 2025Updated 9 months ago
mrinaldhar / en-hi-codemixed-corpus
View on GitHub
Repository for the English-Hindi Codemixed to Monolingual English Parallel Corpus
☆13Feb 17, 2019Updated 7 years ago
lyy1994 / reformer
View on GitHub
An NMT framework built on Joint Representation
☆12Feb 19, 2020Updated 6 years ago
marian-nmt / sotastream
View on GitHub
A library for data streaming and augmentation
☆22May 5, 2025Updated last year
SimengSun / alpaca_farm_lora
View on GitHub
☆22Sep 19, 2023Updated 2 years ago
isl-mt / SLT.KIT
View on GitHub
Spoken Language Translation System
☆20Jul 26, 2021Updated 5 years ago
IamAdiSri / hf-trim
View on GitHub
Reduce the size of pretrained Hugging Face models via vocabulary trimming.
☆49Dec 28, 2022Updated 3 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
midas-research / sismo-wsdm
View on GitHub
Code release for "Towards Ordinal Suicide Ideation Detection on Social Media", WSDM 2021.
☆15Mar 8, 2021Updated 5 years ago
JohannesBuchner / condor_optimization
View on GitHub
CONDOR (COnstrained, Non-linear, Direct, parallel Optimization using trust Region method for high-computing load function) allows continu…
☆19Apr 26, 2009Updated 17 years ago
CPSSD / LUCAS
View on GitHub
The repository for the LUCAS/Lucify project
☆11Apr 4, 2020Updated 6 years ago
j-luo93 / MorphForest
View on GitHub
Code for Unsupervised Learning of Morphological Forest
☆14Aug 12, 2019Updated 6 years ago
MysteryVaibhav / robust_mtnt
View on GitHub
Code for the paper "Improving Robustness of Machine Translation with Synthetic Noise"
☆21Dec 23, 2019Updated 6 years ago
AI4Bharat / indicTrans
View on GitHub
indicTranslate v1 - Machine Translation for 11 Indic languages. For latest v2, check: https://github.com/AI4Bharat/IndicTrans2
☆141Jan 2, 2024Updated 2 years ago
bentrevett / pytorch-neural-style-transfer
View on GitHub
☆13Jul 10, 2020Updated 6 years ago