☆179Mar 28, 2025Updated last year
Alternatives and similar repositories for pycld2
Users that are interested in pycld2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Python3 bindings for the Compact Language Detector v3 (CLD3)☆155Jun 26, 2023Updated 2 years ago
- Language detection extension for spaCy 2.0+☆114Feb 12, 2019Updated 7 years ago
- Compact Language Detector 2☆896May 22, 2021Updated 4 years ago
- A fully customisable language detection pipeline for spaCy☆93May 2, 2019Updated 6 years ago
- Python bindings to the Compact Language Detector☆33Apr 30, 2020Updated 5 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Port of Google's language-detection library to Python.☆1,881Mar 3, 2025Updated last year
- OpusFilter - Parallel corpus processing toolkit☆115Apr 1, 2026Updated last week
- Multilingual text (NLP) processing toolkit☆2,369Nov 10, 2023Updated 2 years ago
- Coursera Corpus Mining and Multistage Fine-Tuning for Improving Lectures Translation☆15Aug 27, 2024Updated last year
- ☆876May 24, 2023Updated 2 years ago
- c++ mosestokenizer☆18Mar 13, 2024Updated 2 years ago
- 💫 Runtime performance comparison of spaCy against other NLP libraries☆20Aug 31, 2022Updated 3 years ago
- Stand-alone language identification system☆2,453Jan 1, 2020Updated 6 years ago
- A tool that locates, downloads, and extracts machine translation corpora☆163Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Scripts to preprocess training and test data and to run fast_align and giza☆107Nov 2, 2021Updated 4 years ago
- Live survey of off-the-shelf language identification tools for python☆27Apr 13, 2022Updated 4 years ago
- Natural language detection, Java bindings for CLD2☆17Feb 26, 2026Updated last month
- A simple semi-supervised approach for creating huggingface data script loaders and upload to the hub.☆11Jun 23, 2024Updated last year
- NJUNMT for docNMT☆16Sep 9, 2020Updated 5 years ago
- Data collection, alignment and TAUS repository☆23Nov 30, 2017Updated 8 years ago
- Zero -- A neural machine translation system☆152May 8, 2023Updated 2 years ago
- NLP moudle for Golang☆13Jul 19, 2017Updated 8 years ago
- Collection of Common Machine Translation Tools☆11Jul 26, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Language detection using Spacy and Fasttext☆56Dec 17, 2023Updated 2 years ago
- Document-Level Neural Machine Translation with Hierarchical Attention Networks☆67May 9, 2022Updated 3 years ago
- ☆10Oct 15, 2020Updated 5 years ago
- Zero-Shot Cross-Lingual Abstractive Sentence Summarization through Teaching Generation and Attention☆28Oct 22, 2020Updated 5 years ago
- Post-processing OCR errors with seq2seq models☆28Jul 30, 2020Updated 5 years ago
- PETCI: A Parallel English Translation Dataset of Chinese Idioms☆31Jan 30, 2026Updated 2 months ago
- ☆18Sep 16, 2022Updated 3 years ago
- a ducttape workflow for neural machine translation☆14Mar 23, 2021Updated 5 years ago
- Project OCELoT: an Open, Collaborative Evaluation Leaderboard of Translations☆23Nov 5, 2025Updated 5 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Confection: the sweetest config system for Python☆194Mar 27, 2026Updated 2 weeks ago
- NLP, before and after spaCy☆2,239Sep 22, 2023Updated 2 years ago
- AfroLID, a powerful neural toolkit for African languages identification which covers 517 African languages.☆38Feb 5, 2026Updated 2 months ago
- Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.☆259Nov 7, 2022Updated 3 years ago
- Some good(maybe) papers about NMT (Neural Machine Translation).☆85Jan 15, 2020Updated 6 years ago
- fasttext with wheels and no external dependency, but only the predict method (<1MB)☆19Nov 23, 2024Updated last year
- Tools to download and cleanup Common Crawl data☆1,040Apr 25, 2023Updated 2 years ago