☆179Mar 28, 2025Updated last year
Alternatives and similar repositories for pycld2
Users that are interested in pycld2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Python3 bindings for the Compact Language Detector v3 (CLD3)☆154Apr 19, 2026Updated last month
- Language detection extension for spaCy 2.0+☆114Feb 12, 2019Updated 7 years ago
- Compact Language Detector 2☆899May 22, 2021Updated 5 years ago
- A fully customisable language detection pipeline for spaCy☆93May 2, 2019Updated 7 years ago
- Python bindings to the Compact Language Detector☆33Apr 30, 2020Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Port of Google's language-detection library to Python.☆1,885Mar 3, 2025Updated last year
- OpusFilter - Parallel corpus processing toolkit☆115May 13, 2026Updated last week
- Multilingual text (NLP) processing toolkit☆2,367Nov 10, 2023Updated 2 years ago
- ☆884May 24, 2023Updated 2 years ago
- c++ mosestokenizer☆18Mar 13, 2024Updated 2 years ago
- 💫 Runtime performance comparison of spaCy against other NLP libraries☆20Aug 31, 2022Updated 3 years ago
- Automatically exported from code.google.com/p/chromium-compact-language-detector☆160Oct 1, 2020Updated 5 years ago
- Stand-alone language identification system☆2,458Jan 1, 2020Updated 6 years ago
- A tool that locates, downloads, and extracts machine translation corpora☆163Apr 13, 2026Updated last month
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Scripts to preprocess training and test data and to run fast_align and giza☆107Nov 2, 2021Updated 4 years ago
- Live survey of off-the-shelf language identification tools for python☆27Apr 13, 2022Updated 4 years ago
- Natural language detection, Java bindings for CLD2☆17Feb 26, 2026Updated 2 months ago
- Lightning Fast Language Prediction 🚀☆168Aug 22, 2025Updated 9 months ago
- Code for the paper "Balancing Training for Multilingual Neural Machine Translation, ACL 2020"☆23May 26, 2021Updated 4 years ago
- A simple semi-supervised approach for creating huggingface data script loaders and upload to the hub.☆11Jun 23, 2024Updated last year
- NJUNMT for docNMT☆16Sep 9, 2020Updated 5 years ago
- A repository with the code related to experiments around context-aware machine translation☆51Sep 22, 2025Updated 8 months ago
- Data collection, alignment and TAUS repository☆24Nov 30, 2017Updated 8 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Collection of Common Machine Translation Tools☆11Jul 26, 2022Updated 3 years ago
- Language detection using Spacy and Fasttext☆54Dec 17, 2023Updated 2 years ago
- Document-Level Neural Machine Translation with Hierarchical Attention Networks☆67May 9, 2022Updated 4 years ago
- ☆10Oct 15, 2020Updated 5 years ago
- Part-of-speech tagger implemented using a feedforward network in TensorFlow☆14Jan 15, 2018Updated 8 years ago
- Zero-Shot Cross-Lingual Abstractive Sentence Summarization through Teaching Generation and Attention☆28Oct 22, 2020Updated 5 years ago
- Post-processing OCR errors with seq2seq models☆28Jul 30, 2020Updated 5 years ago
- PETCI: A Parallel English Translation Dataset of Chinese Idioms☆31Jan 30, 2026Updated 3 months ago
- Python で全角・半角・ひらがな・カタカナ等を変換する☆16Sep 20, 2016Updated 9 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Small utility to monitor fairseq training in tensorboard☆21Apr 28, 2019Updated 7 years ago
- Zero-Shot Translation implemented by Transformer☆14Mar 24, 2023Updated 3 years ago
- a ducttape workflow for neural machine translation☆14Mar 23, 2021Updated 5 years ago
- Project OCELoT: an Open, Collaborative Evaluation Leaderboard of Translations☆23Nov 5, 2025Updated 6 months ago
- NLP, before and after spaCy☆2,242Sep 22, 2023Updated 2 years ago
- Repository for the English-Hindi Codemixed to Monolingual English Parallel Corpus☆13Feb 17, 2019Updated 7 years ago
- Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.☆258Nov 7, 2022Updated 3 years ago