ffreemt / fast-langidLinks
Detect language of a given text, fast
☆10Updated last year
Alternatives and similar repositories for fast-langid
Users that are interested in fast-langid are comparing it to the libraries listed below
Sorting:
- ggml implementation of BERT Embedding☆26Updated last year
- ✨ Split text by languages (e.g. 你喜欢看アニメ吗 -> 你喜欢看 | アニメ | 吗) for NLP tasks (e.g. parse, TTS). Powered by fasttext and budoux☆65Updated last month
- Easy-Translate is a script for translating large text files with a SINGLE COMMAND. Easy-Translate is designed to be as easy as possible f…☆224Updated 11 months ago
- machine translate docx/txt via deepl and pyppeteer☆15Updated 2 years ago
- ⚡️ 80x faster Fasttext language detection out of the box | Split text by language☆254Updated last month
- 80x faster and 95% accurate language identification with Fasttext☆161Updated last year
- A performant high-throughput CPU-based API for Meta's No Language Left Behind (NLLB) using CTranslate2, hosted on Hugging Face Spaces.☆127Updated this week
- A sentence segmentation library with wide language support optimized for speed and utility.☆71Updated this week
- pkuseg多领域中文分词工具; The pkuseg toolkit for multi-domain Chinese word segmentation☆64Updated 3 months ago
- openvino version of openai/whisper☆176Updated 2 years ago
- Faster, modernized fork of the language identification tool langid.py☆61Updated 11 months ago
- whisper.cpp bindings for python☆107Updated 2 years ago
- GGML implementation of BERT model with Python bindings and quantization.☆56Updated last year
- ONNX-compatible Fast SeamlessM4T—Massively Multilingual & Multimodal Machine Translation☆43Updated 2 years ago
- Library and command line utility to do approximate string matching of a source against a bitext index and get matched source and target.☆51Updated 6 months ago
- Local cross-platform machine translation GUI, based on CTranslate2☆97Updated last year
- ☆57Updated 3 years ago
- Triton backend for https://github.com/OpenNMT/CTranslate2☆36Updated 2 years ago
- Open language modeling toolkit based on PyTorch☆152Updated last week
- pure go for rwkv☆19Updated last year
- Model for recasing and repunctuating ASR transcripts☆141Updated last year
- Experiments to test different speech recognition systems for SEPIA Framework☆63Updated 2 years ago
- Sentence Transformers API: An OpenAI compatible embedding API server☆68Updated last year
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆70Updated 3 months ago
- Streaming ASR and TTS based on FastAPI+ sherpa-onnx☆160Updated last week
- Port of Funasr's Paraformer model in C/C++☆37Updated last year
- Check for multiple patterns in a single string at the same time: a fast Aho-Corasick algorithm for Python☆215Updated last week
- A tiny, generic implementation of the Myers diff algorithm☆21Updated 5 years ago
- A small seq2seq punctuator tool based on DistilBERT☆53Updated 10 months ago
- 📝An easy-to-use package to restore punctuation of the text.☆119Updated 2 years ago