Sharonio / roboshaulLinks
☆35Updated 2 years ago
Alternatives and similar repositories for roboshaul
Users that are interested in roboshaul are comparing it to the libraries listed below
Sorting:
- AlephBertGimmel - Modern Hebrew pretrained BERT model with a 128K token vocabulary.☆25Updated 3 years ago
- ☆21Updated 2 years ago
- ☆358Updated last year
- A question answering dataset in Modern Hebrew, containing 30,147 questions.☆24Updated last year
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆137Updated 2 years ago
- ivrit.ai codebase☆44Updated 2 months ago
- Zero-shot Audio Classification using Whisper☆79Updated 3 years ago
- Hebrew whisper powerful transcription and translation tool☆71Updated last year
- Tools, examples, and resources to assist in the development of Gen-AI (Generative Artificial Intelligence) applications in Hebrew, with a…☆31Updated last year
- Hebrew grapheme to phoneme (G2P)☆81Updated last week
- The official implementation of "A Language Modeling Approach to Diacritic-Free Hebrew TTS"☆103Updated 6 months ago
- A library for detecting problematic data segments in structured and unstructured data with few lines of code.☆64Updated 2 years ago
- Mission to create a Hebrew TTS model as powerful and user-friendly as WaveNet☆38Updated last year
- Whisper combined with Silero VAD, for improved long-form transcriptions☆54Updated 3 years ago
- Improving transcription performance of OpenAI Whisper for CPU based deployment☆257Updated 3 years ago
- Coqui AI TTS plugin☆85Updated 6 months ago
- Speaker Diarization with Transformers☆69Updated 7 months ago
- Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation☆151Updated last year
- Hebrew word lists☆48Updated last year
- Performant and accurate speech recognition built on Pytorch☆254Updated 3 years ago
- Google Colab Notebooks for Transcription with Whisper☆24Updated 8 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆69Updated 2 months ago
- This is the PyTorch implementation of the Universal Source Separation with Weakly labelled Data.☆367Updated 2 years ago
- A live speech recognition using Facebooks wav2vec 2.0 model.☆375Updated last year
- open-source audio datasets☆155Updated 2 years ago
- A high-quality, varied ~30hr voice dataset suitable for training a TTS model☆62Updated 3 years ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆80Updated 2 years ago
- A Lossless Compression Library for AI pipelines☆290Updated 6 months ago
- Text to Speech for Indic languages☆52Updated 3 years ago
- A simple voice conversion tool☆19Updated 3 years ago