GitHub30 / winocrLinks
☆22Updated 9 months ago
Alternatives and similar repositories for winocr
Users that are interested in winocr are comparing it to the libraries listed below
Sorting:
- An even smaller speech recognizer / force aligner☆35Updated 7 months ago
- Character-level conversion between Hebrew text and Latin transliteration using deep learning - a demonstration of seq2seq training.☆14Updated 2 years ago
- ✨ Split text by languages (e.g. 你喜欢看アニメ吗 -> 你喜欢看 | アニメ | 吗) for NLP tasks (e.g. parse, TTS). Powered by fasttext and budoux☆62Updated 5 months ago
- 📈 A forced aligner intended for synchronization of narrated text☆95Updated 2 years ago
- Local cross-platform machine translation GUI, based on CTranslate2☆95Updated last year
- Seed Machine Translation Data☆32Updated 8 months ago
- ☆13Updated last year
- A sentence segmentation library with wide language support optimized for speed and utility.☆66Updated last month
- ONNX-compatible Fast SeamlessM4T—Massively Multilingual & Multimodal Machine Translation☆43Updated last year
- A performant high-throughput CPU-based API for Meta's No Language Left Behind (NLLB) using CTranslate2, hosted on Hugging Face Spaces.☆120Updated this week
- Library for extracting text and timestamps from multiple subtitle files (.ass, .ssa, .srt, .sub, .txt).☆53Updated last year
- web based editor for subtitles and transcripts☆137Updated 11 months ago
- Faster, modernized fork of the language identification tool langid.py☆56Updated 8 months ago
- A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text☆35Updated 5 years ago
- Creates video from TTS output and viseme images.☆12Updated 3 years ago
- download youtube subtitles(closed caption, cc) as txt or json, support translation and proxy. available on PIP 🐍 . try it online at goo…☆71Updated last year
- Tokenizer POS-Tagger and Dependency-parser with BERT/RoBERTa/DeBERTa/GPT models for Japanese and other languages☆52Updated 3 months ago
- An open source NLP as a service project focused on providing state of the art systems with ease. Training and inference by simple docker …☆20Updated 10 months ago
- Convert epub file to txt☆37Updated 2 years ago
- A model that predicts the punctuation of English, Italian, French and German texts.☆80Updated 2 years ago
- Easy-Translate is a script for translating large text files with a SINGLE COMMAND. Easy-Translate is designed to be as easy as possible f…☆221Updated 8 months ago
- Python interface to the WebRTC Voice Activity Detector (VAD) [released with binary wheels!]☆26Updated 8 months ago
- Split long audio files based on subtitle-info in SRT File (Transcript saved in CSV)☆20Updated 5 years ago
- Complex data extraction and orchestration framework designed for processing unstructured documents. It integrates AI-powered document pip…☆70Updated this week
- Unicode Standard tokenization routines and orthography profile segmentation☆37Updated 5 months ago
- Translate HTML using Argos Translate☆53Updated 2 years ago
- IPA Phonemizer/Dephonemizer for 139 human languages☆31Updated this week
- Multilingual sentence alignment using sentence embeddings☆121Updated 9 months ago
- A cross platform implementation of Text-to-Speech based on ONNXRuntime.☆32Updated 2 years ago
- 80x faster and 95% accurate language identification with Fasttext☆160Updated last year