ffreemt / fast-langidLinks
Detect language of a given text, fast
☆10Updated last year
Alternatives and similar repositories for fast-langid
Users that are interested in fast-langid are comparing it to the libraries listed below
Sorting:
- Easy-Translate is a script for translating large text files with a SINGLE COMMAND. Easy-Translate is designed to be as easy as possible f…☆226Updated last year
- A performant high-throughput CPU-based API for Meta's No Language Left Behind (NLLB) using CTranslate2, hosted on Hugging Face Spaces.☆136Updated this week
- ⚡️ 80x faster Fasttext language detection out of the box | Split text by language☆283Updated 4 months ago
- machine translate docx/txt via deepl and pyppeteer☆15Updated 3 years ago
- pkuseg多领域中文分词工具; The pkuseg toolkit for multi-domain Chinese word segmentation☆69Updated 6 months ago
- 80x faster and 95% accurate language identification with Fasttext☆164Updated 2 years ago
- Faster, modernized fork of the language identification tool langid.py☆60Updated last year
- Local cross-platform machine translation GUI, based on CTranslate2☆99Updated 2 years ago
- A tiny, generic implementation of the Myers diff algorithm☆22Updated 5 years ago
- A sentence segmentation library with wide language support optimized for speed and utility.☆84Updated last week
- Open language modeling toolkit based on PyTorch☆173Updated this week
- 渊 - A project for Classical Chinese☆110Updated 3 years ago
- ASR (Automatic Speech Recognition) for real-time streamed audio powered by Whisper and tranformers☆35Updated last year
- ☆23Updated last year
- Meta's "No Language Left Behind" models served as web app and REST API☆253Updated 8 months ago
- 🔧 Repair JSON!Solution for JSON Anomalies from LLMs.☆345Updated this week
- Multilingual sentence alignment using sentence embeddings☆139Updated last year
- ☆15Updated last year
- Library for translating between 200 languages. Built on 🤗 transformers.☆496Updated last year
- Simply, faster, sentence-transformers☆143Updated last year
- Whisper realtime streaming for long speech-to-text transcription and translation☆121Updated 2 years ago
- A small seq2seq punctuator tool based on DistilBERT☆53Updated last year
- fastertransformer for codegeex model☆65Updated 2 years ago
- whisper.cpp bindings for python☆110Updated 2 years ago
- Python module that identifies Chinese text as being Simplified or Traditional☆105Updated last year
- ONNX-compatible Fast SeamlessM4T—Massively Multilingual & Multimodal Machine Translation☆43Updated 2 years ago
- The multilingual variant of GLM, a general language model trained with autoregressive blank infilling objective☆62Updated 3 years ago
- Library and command line utility to do approximate string matching of a source against a bitext index and get matched source and target.☆51Updated 9 months ago
- Extract docx headers, footers, (formatted) text, footnotes, endnotes, properties, and images.☆201Updated this week
- streaming speech to text server using Whisper☆101Updated 2 years ago