ivrit-ai / ivrit.ai
ivrit.ai codebase
☆25Updated 2 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for ivrit.ai
- Hebrew Diacritizer☆30Updated 2 months ago
- ☆32Updated 11 months ago
- AlephBertGimmel - Modern Hebrew pretrained BERT model with a 128K token vocabulary.☆21Updated last year
- The official implementation of "A Language Modeling Approach to Diacritic-Free Hebrew TTS"☆69Updated 3 months ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆28Updated last year
- ☆9Updated last month
- Suite for phonetic word embeddings, especially their evaluation and baseline models.☆22Updated last week
- 🎹 pyannote + 🗒 notebook = pyannotebook☆25Updated last year
- An even smaller speech recognizer / force aligner☆32Updated 2 months ago
- A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.☆15Updated 2 weeks ago
- TTS Client for Coqui TTS server☆13Updated last year
- [DEPRECIATED] Very fast, large music transformer with 8k sequence length, efficient heptabit MIDI notes encoding, true full MIDI instrume…☆15Updated 11 months ago
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]☆24Updated 2 years ago
- A free & open tool for transcribing audio interviews with offline ASR support☆24Updated 10 months ago
- ☆12Updated 5 years ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated last year
- Syllable Segmentation and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model☆30Updated last year
- The TTSDS benchmark evaluates synthetic speech quality by considering prosody, speaker identity, and intelligibility, comparing these fac…☆18Updated this week
- ☆12Updated last year
- Collection of scripts from mHuBERT-147.☆22Updated 4 months ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Updated 3 years ago
- Code for the paper "FLowHigh: Towards efficient and high-quality audio super-resolution with single-step flow matching"☆16Updated this week
- UTAUTAI(Unrestricted Tune Automated Technology Artificial Interigence)☆11Updated last year
- ☆11Updated 9 years ago
- A question answering dataset in Modern Hebrew, containing 30,147 questions.☆18Updated last year
- A corpus of speech from the Joe Rogan Experience podcast, consisting of 8.43 million words. It includes aligned TextGrids with phonetic a…☆17Updated 4 years ago
- ☆23Updated last year
- A JAX library for building lattice-based speech transducer models☆40Updated 2 weeks ago
- proof of concept conversation orchestrator with a speech-language model☆13Updated 3 weeks ago
- Google Colab Notebooks for Transcription with Whisper☆22Updated 5 months ago