☆179Mar 28, 2025Updated last year
Alternatives and similar repositories for pycld2
Users that are interested in pycld2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Python3 bindings for the Compact Language Detector v3 (CLD3)☆154Apr 19, 2026Updated last month
- Language detection extension for spaCy 2.0+☆114Feb 12, 2019Updated 7 years ago
- A fully customisable language detection pipeline for spaCy☆93May 2, 2019Updated 7 years ago
- Python bindings to the Compact Language Detector☆33Apr 30, 2020Updated 6 years ago
- Port of Google's language-detection library to Python.☆1,890Mar 3, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Multilingual text (NLP) processing toolkit☆2,366Nov 10, 2023Updated 2 years ago
- Coursera Corpus Mining and Multistage Fine-Tuning for Improving Lectures Translation☆15Aug 27, 2024Updated last year
- c++ mosestokenizer☆18Mar 13, 2024Updated 2 years ago
- Automatically exported from code.google.com/p/chromium-compact-language-detector☆160Oct 1, 2020Updated 5 years ago
- Stand-alone language identification system☆2,460Jan 1, 2020Updated 6 years ago
- A tool that locates, downloads, and extracts machine translation corpora☆165Apr 13, 2026Updated 2 months ago
- Scripts to preprocess training and test data and to run fast_align and giza☆107Nov 2, 2021Updated 4 years ago
- Live survey of off-the-shelf language identification tools for python☆27Apr 13, 2022Updated 4 years ago
- Python bindings for CLD2.☆17Aug 9, 2018Updated 7 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Natural language detection, Java bindings for CLD2☆17Feb 26, 2026Updated 3 months ago
- Lightning Fast Language Prediction 🚀☆168Aug 22, 2025Updated 9 months ago
- Code for the paper "Balancing Training for Multilingual Neural Machine Translation, ACL 2020"☆23May 26, 2021Updated 5 years ago
- A simple semi-supervised approach for creating huggingface data script loaders and upload to the hub.☆11Jun 23, 2024Updated last year
- NJUNMT for docNMT☆16Sep 9, 2020Updated 5 years ago
- A repository with the code related to experiments around context-aware machine translation☆51Sep 22, 2025Updated 8 months ago
- Data collection, alignment and TAUS repository☆24Nov 30, 2017Updated 8 years ago
- NLP moudle for Golang☆13Jul 19, 2017Updated 8 years ago
- Collection of Common Machine Translation Tools☆11Jul 26, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Language detection using Spacy and Fasttext☆54Dec 17, 2023Updated 2 years ago
- Document-Level Neural Machine Translation with Hierarchical Attention Networks☆67May 9, 2022Updated 4 years ago
- Part-of-speech tagger implemented using a feedforward network in TensorFlow☆14Jan 15, 2018Updated 8 years ago
- Improved ParaBank Rewriter☆22Jan 22, 2020Updated 6 years ago
- Zero-Shot Cross-Lingual Abstractive Sentence Summarization through Teaching Generation and Attention☆28Oct 22, 2020Updated 5 years ago
- Post-processing OCR errors with seq2seq models☆28Jul 30, 2020Updated 5 years ago
- Bot for collecting tickers from popular exchanges using API. (poloniex, bittrex, bitmex, bitfinex, gdax)☆11Dec 8, 2022Updated 3 years ago
- Small utility to monitor fairseq training in tensorboard☆21Apr 28, 2019Updated 7 years ago
- Zero-Shot Translation implemented by Transformer☆14Mar 24, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆18Sep 16, 2022Updated 3 years ago
- Project OCELoT: an Open, Collaborative Evaluation Leaderboard of Translations☆23Nov 5, 2025Updated 7 months ago
- NLP, before and after spaCy☆2,241Sep 22, 2023Updated 2 years ago
- Repository for the English-Hindi Codemixed to Monolingual English Parallel Corpus☆13Feb 17, 2019Updated 7 years ago
- Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.☆258Nov 7, 2022Updated 3 years ago
- Some good(maybe) papers about NMT (Neural Machine Translation).☆85Jan 15, 2020Updated 6 years ago
- fasttext with wheels and no external dependency, but only the predict method (<1MB)☆20Nov 23, 2024Updated last year