☆178Mar 28, 2025Updated 11 months ago
Alternatives and similar repositories for pycld2
Users that are interested in pycld2 are comparing it to the libraries listed below
Sorting:
- Python3 bindings for the Compact Language Detector v3 (CLD3)☆155Jun 26, 2023Updated 2 years ago
- A fully customisable language detection pipeline for spaCy☆93May 2, 2019Updated 6 years ago
- Language detection extension for spaCy 2.0+☆114Feb 12, 2019Updated 7 years ago
- Port of Google's language-detection library to Python.☆1,872Mar 3, 2025Updated 11 months ago
- Compact Language Detector 2☆894May 22, 2021Updated 4 years ago
- Multilingual text (NLP) processing toolkit☆2,364Nov 10, 2023Updated 2 years ago
- Python bindings to the Compact Language Detector☆33Apr 30, 2020Updated 5 years ago
- A tool that locates, downloads, and extracts machine translation corpora☆162Sep 18, 2025Updated 5 months ago
- 💫 Runtime performance comparison of spaCy against other NLP libraries☆20Aug 31, 2022Updated 3 years ago
- OpusFilter - Parallel corpus processing toolkit☆115Feb 11, 2026Updated 2 weeks ago
- ☆873May 24, 2023Updated 2 years ago
- Coursera Corpus Mining and Multistage Fine-Tuning for Improving Lectures Translation☆15Aug 27, 2024Updated last year
- Stand-alone language identification system☆2,453Jan 1, 2020Updated 6 years ago
- Live survey of off-the-shelf language identification tools for python☆27Apr 13, 2022Updated 3 years ago
- Scripts to preprocess training and test data and to run fast_align and giza☆107Nov 2, 2021Updated 4 years ago
- Automatically exported from code.google.com/p/chromium-compact-language-detector☆161Oct 1, 2020Updated 5 years ago
- Natural language detection, Java bindings for CLD2☆17Updated this week
- Document-Level Neural Machine Translation with Hierarchical Attention Networks☆67May 9, 2022Updated 3 years ago
- Lightning Fast Language Prediction 🚀☆167Aug 22, 2025Updated 6 months ago
- Confection: the sweetest config system for Python☆193Feb 9, 2026Updated 2 weeks ago
- A repository with the code related to experiments around context-aware machine translation☆51Sep 22, 2025Updated 5 months ago
- Project OCELoT: an Open, Collaborative Evaluation Leaderboard of Translations☆23Nov 5, 2025Updated 3 months ago
- NLP moudle for Golang☆13Jul 19, 2017Updated 8 years ago
- Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.☆257Nov 7, 2022Updated 3 years ago
- Code for the paper "Balancing Training for Multilingual Neural Machine Translation, ACL 2020"☆23May 26, 2021Updated 4 years ago
- PETCI: A Parallel English Translation Dataset of Chinese Idioms☆29Jan 30, 2026Updated last month
- Small utility to monitor fairseq training in tensorboard☆21Apr 28, 2019Updated 6 years ago
- A simple semi-supervised approach for creating huggingface data script loaders and upload to the hub.☆11Jun 23, 2024Updated last year
- Collection of Common Machine Translation Tools☆11Jul 26, 2022Updated 3 years ago
- NLP, before and after spaCy☆2,235Sep 22, 2023Updated 2 years ago
- Data collection, alignment and TAUS repository☆23Nov 30, 2017Updated 8 years ago
- ☆70Nov 30, 2022Updated 3 years ago
- Universal End2End Training Platform, including pre-training, classification tasks, machine translation, and etc.☆45Nov 2, 2022Updated 3 years ago
- Contrastive evaluation of pronoun translation in neural machine translation☆26Aug 22, 2019Updated 6 years ago
- High Concurrency Bloom Filter Implementations for Go☆12Oct 10, 2023Updated 2 years ago
- Gather module dependencies of source code☆13Jul 21, 2023Updated 2 years ago
- a slackbot allowing you to search commandlinefu from within slack☆11Sep 8, 2016Updated 9 years ago
- unofficial pytorch implementation of HiFi-GAN with fast MISR.☆15Mar 21, 2023Updated 2 years ago
- ☆29Jun 10, 2024Updated last year