ffreemt / fast-langidLinks
Detect language of a given text, fast
☆10Updated last year
Alternatives and similar repositories for fast-langid
Users that are interested in fast-langid are comparing it to the libraries listed below
Sorting:
- ✨ Split text by languages (e.g. 你喜欢看アニメ吗 -> 你喜欢看 | アニメ | 吗) for NLP tasks (e.g. parse, TTS). Powered by fasttext and budoux☆66Updated 3 months ago
- ggml implementation of BERT Embedding☆26Updated 2 years ago
- Easy-Translate is a script for translating large text files with a SINGLE COMMAND. Easy-Translate is designed to be as easy as possible f…☆226Updated last year
- ⚡️ 80x faster Fasttext language detection out of the box | Split text by language☆273Updated 3 months ago
- A performant high-throughput CPU-based API for Meta's No Language Left Behind (NLLB) using CTranslate2, hosted on Hugging Face Spaces.☆132Updated this week
- 80x faster and 95% accurate language identification with Fasttext☆163Updated last year
- Faster, modernized fork of the language identification tool langid.py☆61Updated last year
- pkuseg多领域中文分词工具; The pkuseg toolkit for multi-domain Chinese word segmentation☆67Updated 5 months ago
- A sentence segmentation library with wide language support optimized for speed and utility.☆76Updated last week
- Training open neural machine translation models☆387Updated 9 months ago
- GGML implementation of BERT model with Python bindings and quantization.☆58Updated last year
- A small seq2seq punctuator tool based on DistilBERT☆53Updated 11 months ago
- Port of Funasr's Paraformer model in C/C++☆39Updated last year
- Local cross-platform machine translation GUI, based on CTranslate2☆99Updated last year
- Triton backend for https://github.com/OpenNMT/CTranslate2☆35Updated 2 years ago
- A model that predicts the punctuation of English, Italian, French and German texts.☆83Updated 2 years ago
- Experiments to test different speech recognition systems for SEPIA Framework☆62Updated 2 years ago
- openvino version of openai/whisper☆178Updated 2 years ago
- Python bindings for whisper.cpp☆247Updated last year
- Open language modeling toolkit based on PyTorch☆159Updated last week
- This is an example of search videos using jina☆24Updated 3 years ago
- Library for translating between 200 languages. Built on 🤗 transformers.☆495Updated last year
- Faster access to Tesseract-OCR from Python☆13Updated 4 years ago
- Toolkit for training/converting LibreTranslate compatible language models 🚂☆71Updated 5 months ago
- whisper.cpp bindings for python☆108Updated 2 years ago
- ☆17Updated 2 years ago
- Python bindings for whisper.cpp☆305Updated last week
- Turn any OCR models into online inference API endpoint 🚀 🌖☆57Updated last month
- Cross-platform audio recorder designed for real-time speech audio processing☆124Updated 4 months ago
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆108Updated 2 months ago