ffreemt / fast-langidLinks
Detect language of a given text, fast
☆10Updated last year
Alternatives and similar repositories for fast-langid
Users that are interested in fast-langid are comparing it to the libraries listed below
Sorting:
- ggml implementation of BERT Embedding☆26Updated last year
- ✨ Split text by languages (e.g. 你喜欢看アニメ吗 -> 你喜欢看 | アニメ | 吗) for NLP tasks (e.g. parse, TTS). Powered by fasttext and budoux☆62Updated 6 months ago
- ⚡️ 80x faster Fasttext language detection out of the box | Split text by language☆230Updated 5 months ago
- A performant high-throughput CPU-based API for Meta's No Language Left Behind (NLLB) using CTranslate2, hosted on Hugging Face Spaces.☆123Updated this week
- Easy-Translate is a script for translating large text files with a SINGLE COMMAND. Easy-Translate is designed to be as easy as possible f…☆219Updated 9 months ago
- Local cross-platform machine translation GUI, based on CTranslate2☆96Updated last year
- 80x faster and 95% accurate language identification with Fasttext☆162Updated last year
- GGML implementation of BERT model with Python bindings and quantization.☆56Updated last year
- machine translate docx/txt via deepl and pyppeteer☆15Updated 2 years ago
- Semantic Search demo featuring UForm, USearch, UCall, and StreamLit, to visual and retrieve from image datasets, similar to "CLIP Retriev…☆50Updated last year
- Open language modeling toolkit based on PyTorch☆143Updated this week
- ONNX-compatible Fast SeamlessM4T—Massively Multilingual & Multimodal Machine Translation☆43Updated 2 years ago
- 🔧 Repair JSON!Solution for JSON Anomalies from LLMs.☆274Updated last week
- pkuseg多领域中文分词工具; The pkuseg toolkit for multi-domain Chinese word segmentation☆62Updated last month
- Port of Funasr's Paraformer model in C/C++☆34Updated last year
- Toolkit for training/converting LibreTranslate compatible language models 🚂☆63Updated 2 months ago
- ☆29Updated last year
- Open Source Text Embedding Models with OpenAI Compatible API☆159Updated last year
- A sentence segmentation library with wide language support optimized for speed and utility.☆66Updated 2 months ago
- pure go for rwkv☆19Updated last year
- Multilingual sentence alignment using sentence embeddings☆122Updated 10 months ago
- Faster access to Tesseract-OCR from Python☆13Updated 4 years ago
- 🗲 A high-performance on-disk dictionary.☆28Updated 3 weeks ago
- 用于SenseVoice的api项目,输出带时间戳字幕☆39Updated 10 months ago
- ☆57Updated 3 years ago
- Meta's "No Language Left Behind" models served as web app and REST API☆235Updated 3 months ago
- A cross platform implementation of Text-to-Speech based on ONNXRuntime.☆32Updated 2 years ago
- Faster, modernized fork of the language identification tool langid.py☆56Updated 9 months ago
- ☆22Updated 10 months ago
- Deploy an API that pulls data from duckduckgo search engine.☆16Updated 4 months ago