Code for extracting parallel corpora from pmindia
☆17Jan 28, 2020Updated 6 years ago
Alternatives and similar repositories for pmindia-crawler
Users that are interested in pmindia-crawler are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆22Dec 20, 2019Updated 6 years ago
- Zero-Shot Translation implemented by Transformer☆14Mar 24, 2023Updated 3 years ago
- ☆23Feb 4, 2020Updated 6 years ago
- ☆17Oct 31, 2023Updated 2 years ago
- State of the Art Language models and Classifier for Odia, which is spoken in the Indian state of Odisha☆14Aug 7, 2020Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Machine Translation from English to Odia language.☆10Aug 9, 2021Updated 4 years ago
- Find duplicate text files.☆15Jan 14, 2025Updated last year
- 🔍 Multilingual Evaluation of English-Centric LLMs via Cross-Lingual Alignment☆11Apr 6, 2025Updated 11 months ago
- ☆44Aug 2, 2021Updated 4 years ago
- Reader Translator Generator - NMT toolkit based on pytorch☆32Sep 12, 2023Updated 2 years ago
- Source Code for "Improved Embeddings for Learning Prerequisite Chains" (CPSC 490 - Senior Project)☆11May 2, 2019Updated 6 years ago
- Extend bert-nmt to context-aware translation.☆11May 24, 2021Updated 4 years ago
- Continuous Space Language and Translation Model Toolkit☆12Aug 12, 2015Updated 10 years ago
- Which ML are you?☆13Jan 3, 2023Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Transfer learning for neural machine translation using cross-lingual word embeddings☆10Dec 17, 2025Updated 3 months ago
- ☆14Jan 4, 2021Updated 5 years ago
- This repo is containing notes and implementations for cherry-picked publications of my particular interest☆12May 14, 2020Updated 5 years ago
- Deeplearning4j Examples (DL4J, DL4J Spark, DataVec)☆10Aug 16, 2018Updated 7 years ago
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…☆18Nov 30, 2022Updated 3 years ago
- Information about the CodedotAI reading group sessions.☆12Aug 16, 2021Updated 4 years ago
- Lexically Constrained Neural Machine Translation with Levenshtein Transformer☆40Jul 14, 2020Updated 5 years ago
- An extended version of Scala's scaladoc command☆21Jul 2, 2011Updated 14 years ago
- A tool that locates, downloads, and extracts machine translation corpora☆163Updated this week
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Repository for the English-Hindi Codemixed to Monolingual English Parallel Corpus☆13Feb 17, 2019Updated 7 years ago
- An NMT framework built on Joint Representation☆12Feb 19, 2020Updated 6 years ago
- ☆22Sep 19, 2023Updated 2 years ago
- ☆31Oct 8, 2023Updated 2 years ago
- ISI tutorials☆12Oct 28, 2016Updated 9 years ago
- Spoken Language Translation System☆20Jul 26, 2021Updated 4 years ago
- A small utility for converting Stanford GloVe vectors to HDF5 / NumPy☆12Apr 4, 2017Updated 8 years ago
- Reduce the size of pretrained Hugging Face models via vocabulary trimming.☆48Dec 28, 2022Updated 3 years ago
- Code release for "Towards Ordinal Suicide Ideation Detection on Social Media", WSDM 2021.☆15Mar 8, 2021Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- The repository for the LUCAS/Lucify project☆11Apr 4, 2020Updated 5 years ago
- Code for Unsupervised Learning of Morphological Forest☆14Aug 12, 2019Updated 6 years ago
- Resources and tools for Indian language Natural Language Processing☆632Jun 7, 2024Updated last year
- universal tokenizer☆17Nov 29, 2021Updated 4 years ago
- indicTranslate v1 - Machine Translation for 11 Indic languages. For latest v2, check: https://github.com/AI4Bharat/IndicTrans2☆137Jan 2, 2024Updated 2 years ago
- ☆13Jul 10, 2020Updated 5 years ago
- PyTorch - Albert Large V2, Bert Base Uncased, Bert Large Uncased WWM Finetuned Squad, Distil Roberta Base, Roberta Base Squad2, Roberta l…☆11Jul 10, 2020Updated 5 years ago