☆178Mar 28, 2025Updated 11 months ago
Alternatives and similar repositories for pycld2
Users that are interested in pycld2 are comparing it to the libraries listed below
Sorting:
- Python3 bindings for the Compact Language Detector v3 (CLD3)☆155Jun 26, 2023Updated 2 years ago
- Language detection extension for spaCy 2.0+☆114Feb 12, 2019Updated 7 years ago
- Compact Language Detector 2☆894May 22, 2021Updated 4 years ago
- A fully customisable language detection pipeline for spaCy☆93May 2, 2019Updated 6 years ago
- Python bindings to the Compact Language Detector☆33Apr 30, 2020Updated 5 years ago
- Port of Google's language-detection library to Python.☆1,877Mar 3, 2025Updated last year
- OpusFilter - Parallel corpus processing toolkit☆115Feb 11, 2026Updated last month
- Multilingual text (NLP) processing toolkit☆2,368Nov 10, 2023Updated 2 years ago
- Coursera Corpus Mining and Multistage Fine-Tuning for Improving Lectures Translation☆15Aug 27, 2024Updated last year
- ☆874May 24, 2023Updated 2 years ago
- c++ mosestokenizer☆18Mar 13, 2024Updated 2 years ago
- 💫 Runtime performance comparison of spaCy against other NLP libraries☆20Aug 31, 2022Updated 3 years ago
- Automatically exported from code.google.com/p/chromium-compact-language-detector☆161Oct 1, 2020Updated 5 years ago
- Stand-alone language identification system☆2,454Jan 1, 2020Updated 6 years ago
- A tool that locates, downloads, and extracts machine translation corpora☆163Sep 18, 2025Updated 6 months ago
- Scripts to preprocess training and test data and to run fast_align and giza☆107Nov 2, 2021Updated 4 years ago
- Live survey of off-the-shelf language identification tools for python☆27Apr 13, 2022Updated 3 years ago
- Python bindings for CLD2.☆16Aug 9, 2018Updated 7 years ago
- Natural language detection, Java bindings for CLD2☆17Feb 26, 2026Updated 3 weeks ago
- Code for the paper "Balancing Training for Multilingual Neural Machine Translation, ACL 2020"☆23May 26, 2021Updated 4 years ago
- Lightning Fast Language Prediction 🚀☆167Aug 22, 2025Updated 7 months ago
- A simple semi-supervised approach for creating huggingface data script loaders and upload to the hub.☆11Jun 23, 2024Updated last year
- NJUNMT for docNMT☆16Sep 9, 2020Updated 5 years ago
- A repository with the code related to experiments around context-aware machine translation☆51Sep 22, 2025Updated 6 months ago
- NLP moudle for Golang☆13Jul 19, 2017Updated 8 years ago
- Collection of Common Machine Translation Tools☆11Jul 26, 2022Updated 3 years ago
- Language detection using Spacy and Fasttext☆56Dec 17, 2023Updated 2 years ago
- Document-Level Neural Machine Translation with Hierarchical Attention Networks☆67May 9, 2022Updated 3 years ago
- ☆10Oct 15, 2020Updated 5 years ago
- Improved ParaBank Rewriter☆22Jan 22, 2020Updated 6 years ago
- Zero-Shot Cross-Lingual Abstractive Sentence Summarization through Teaching Generation and Attention☆28Oct 22, 2020Updated 5 years ago
- PETCI: A Parallel English Translation Dataset of Chinese Idioms☆29Jan 30, 2026Updated last month
- Post-processing OCR errors with seq2seq models☆28Jul 30, 2020Updated 5 years ago
- Small utility to monitor fairseq training in tensorboard☆21Apr 28, 2019Updated 6 years ago
- Zero-Shot Translation implemented by Transformer☆14Mar 24, 2023Updated 2 years ago
- a ducttape workflow for neural machine translation☆14Mar 23, 2021Updated 4 years ago
- ☆18Sep 16, 2022Updated 3 years ago
- Crawler that collects and extracts content of daily published news articles☆12Feb 18, 2023Updated 3 years ago
- Project OCELoT: an Open, Collaborative Evaluation Leaderboard of Translations☆23Nov 5, 2025Updated 4 months ago