ffreemt / fast-langidLinks
Detect language of a given text, fast
☆10Updated last year
Alternatives and similar repositories for fast-langid
Users that are interested in fast-langid are comparing it to the libraries listed below
Sorting:
- ggml implementation of BERT Embedding☆26Updated last year
- 80x faster and 95% accurate language identification with Fasttext☆160Updated last year
- ⚡️ 80x faster Fasttext language detection out of the box | Split text by language☆223Updated 4 months ago
- Faster, modernized fork of the language identification tool langid.py☆56Updated 8 months ago
- ✨ Split text by languages (e.g. 你喜欢看アニメ吗 -> 你喜欢看 | アニメ | 吗) for NLP tasks (e.g. parse, TTS). Powered by fasttext and budoux☆62Updated 5 months ago
- Easy-Translate is a script for translating large text files with a SINGLE COMMAND. Easy-Translate is designed to be as easy as possible f…☆221Updated 9 months ago
- A performant high-throughput CPU-based API for Meta's No Language Left Behind (NLLB) using CTranslate2, hosted on Hugging Face Spaces.☆120Updated this week
- machine translate docx/txt via deepl and pyppeteer☆15Updated 2 years ago
- pkuseg多领域中文分词工具; The pkuseg toolkit for multi-domain Chinese word segmentation☆62Updated 3 weeks ago
- Open Source Text Embedding Models with OpenAI Compatible API☆157Updated last year
- Extract docx headers, footers, (formatted) text, footnotes, endnotes, properties, and images.☆187Updated last week
- Meta's "No Language Left Behind" models served as web app and REST API☆228Updated 2 months ago
- ☆15Updated 2 years ago
- A small seq2seq punctuator tool based on DistilBERT☆52Updated 7 months ago
- A sentence segmentation library with wide language support optimized for speed and utility.☆66Updated last month
- A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order.☆267Updated 2 months ago
- GGML implementation of BERT model with Python bindings and quantization.☆57Updated last year
- Scrape deepl using playwright☆9Updated 2 years ago
- whisper.cpp bindings for python☆100Updated last year
- pygoogletranslation: Free and Unlimited Google translate API for Python. Translates totally free of charge.☆160Updated 4 years ago
- 🔧 Repair JSON!Solution for JSON Anomalies from LLMs.☆270Updated 2 months ago
- Port of Funasr's Paraformer model in C/C++☆33Updated last year
- not official API for Microsoft speech synthesis from Microsoft Edge web browser read aloud☆59Updated 3 weeks ago
- Streaming ASR and TTS based on FastAPI+ sherpa-onnx☆135Updated 3 months ago
- ONNX-compatible Fast SeamlessM4T—Massively Multilingual & Multimodal Machine Translation☆43Updated last year
- ☆40Updated 4 years ago
- Deploy an API that pulls data from duckduckgo search engine.☆16Updated 3 months ago
- A model that predicts the punctuation of English, Italian, French and German texts.☆80Updated 2 years ago
- 中文标点符号模型,可以给文本添加标点符号。☆142Updated 7 months ago
- A tiny, generic implementation of the Myers diff algorithm☆20Updated 4 years ago