andrewjw / pyvobsub2srtLinks
A Python script to convert vobsub subtitles into srt format using tesseract for ocr
☆10Updated 10 years ago
Alternatives and similar repositories for pyvobsub2srt
Users that are interested in pyvobsub2srt are comparing it to the libraries listed below
Sorting:
- ☆87Updated 4 years ago
- Port of the OpenFST library to Windows☆79Updated last year
- Simple cue file parser written in python☆22Updated last year
- SpanAlign: Sentence Alignment Method based on Cross-Language Span Prediction and ILP☆14Updated 4 years ago
- Local cross-platform machine translation GUI, based on CTranslate2☆94Updated last year
- This project aims to research google's offline speech recognition, from several android apps and ideally make them interoperable by repli…☆67Updated 5 years ago
- A fork of open_jtalk☆60Updated 3 months ago
- Large scale (>200h) and publicly available read audio book corpus. This corpus is an augmentation of LibriSpeech ASR Corpus (1000h) and c…☆44Updated 3 years ago
- Dictionary of pairs of Korean word and IPA crawled from Wiktionary (Korean edition)☆21Updated 2 years ago
- A Python library for editing subtitle files☆378Updated 5 months ago
- Python bindings around the LAME encoder☆60Updated 6 months ago
- python based software to unpack kindlegen generated ebooks☆63Updated 2 years ago
- Voice100 includes neural TTS/ASR models. Inference of Voice100 is low cost as its models are tiny and only depend on CNN without autoregr…☆28Updated last year
- Python module to clean and transliterate (i.e. normalize) German text including abbreviations, numbers, timestamps etc. It can be used to…☆33Updated 4 years ago
- Multilingual sentence alignment using sentence embeddings☆119Updated 8 months ago
- Library and command line utility to do approximate string matching of a source against a bitext index and get matched source and target.☆51Updated 3 months ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆116Updated 2 years ago
- Python parser for SubRip (srt) files☆477Updated 2 years ago
- JASS: Japanese-specific Sequence to Sequence Pre-training for Neural Machine Translation (LREC2020) & Linguistically Driven Multi-Task Pr…☆16Updated 3 years ago
- Some simple wrappers around eSpeak NG intended to make using this excellent TTS for waveform and IPA generation as convenient as possible…☆42Updated 9 months ago
- Converts English text to IPA notation☆389Updated 2 years ago
- An example usage of JParaCrawl pre-trained Neural Machine Translation (NMT) models.☆103Updated 4 years ago
- ☆223Updated last year
- Universal Romanizer that can convert any unicode script to roman (latin) script☆214Updated 11 months ago
- Tool to fix bitexts and tag near-duplicates for removal☆30Updated 5 months ago
- Corpus preprocessing☆97Updated last year
- Python module for syllabifying English ARPABET transcriptions☆68Updated 6 years ago
- Read, write, convert and segment WebVTT caption files in Python.☆217Updated last year
- context labels and pronunciation data for JSUT corpus☆71Updated 3 years ago
- VS Code extension that allows you to preview and play audio files.☆162Updated last month