ffreemt / fast-langid
Detect language of a given text, fast
☆9Updated 2 months ago
Related projects: ⓘ
- 80x faster and 95% accurate language identification with Fasttext☆131Updated 7 months ago
- machine translate docx/txt via deepl and pyppeteer☆15Updated last year
- A performant high-throughput CPU-based API for Meta's No Language Left Behind (NLLB) using CTranslate2, hosted on Hugging Face Spaces.☆84Updated this week
- Simply, faster, sentence-transformers☆127Updated 3 weeks ago
- Targetted language identifier, based on FastText and Hunspell.☆27Updated 2 weeks ago
- ggml implementation of BERT Embedding☆24Updated 10 months ago
- Meta's "No Language Left Behind" models served as web app and REST API☆171Updated last month
- faster-whisper livestream translation, OBS noise reduction, dual language subtitles☆73Updated last year
- Easy-Translate is a script for translating large text files with a SINGLE COMMAND. Easy-Translate is designed to be as easy as possible f…☆180Updated 2 weeks ago
- Faster, modernized fork of the language identification tool langid.py☆45Updated 3 months ago
- 中文标点符号模型,可以给文本添加标点符号。☆128Updated 6 months ago
- ☆22Updated last year
- fastertransformer for codegeex model☆63Updated last year
- ⚡️ 80x faster language detection with Fasttext | Split text by language for TTS☆104Updated last week
- Archived 🚧|🌻Building ChatBot with LLMs.🌻 | Using async requests. | 具有多 LLM 适应性 | 通用大语言模型代理端框架 |多人称全类型注解☆41Updated 11 months ago
- A model that predicts the punctuation of English, Italian, French and German texts.☆70Updated last year
- BigTranslate: Augmenting Large Language Models with Multilingual Translation Capability over 100 Languages☆213Updated 10 months ago
- Local cross-platform machine translation GUI, based on CTranslate2☆84Updated 8 months ago
- Scrape deepl using playwright☆9Updated last year
- pkuseg多领域中文分词工具; The pkuseg toolkit for multi-domain Chinese word segmentation☆51Updated 2 weeks ago
- ✨ Split text by languages (e.g. 你喜欢看アニメ吗 -> 你喜欢看 | アニメ | 吗) for NLP tasks (e.g. parse, TTS). Powered by fasttext and langua☆30Updated 2 months ago
- This code sets up a simple yet robust server using FastAPI for handling asynchronous requests for embedding generation and reranking task…☆47Updated 4 months ago
- Multilingual sentence alignment using sentence embeddings☆92Updated 9 months ago
- Toolkit for training/converting LibreTranslate compatible language models 🚂☆42Updated 3 months ago
- Live-Transcription (STT) with Whisper PoC☆140Updated 3 months ago
- ASR (Automatic Speech Recognition) for real-time streamed audio powered by Whisper and tranformers☆17Updated last month
- ☆20Updated last year
- Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)☆43Updated 3 months ago
- A simple Python package to easily use Meta's Massively Multilingual Speech (MMS) project☆51Updated last year
- Translation demonstrator☆27Updated 4 years ago