Sharonio / roboshaulLinks
☆35Updated last year
Alternatives and similar repositories for roboshaul
Users that are interested in roboshaul are comparing it to the libraries listed below
Sorting:
- AlephBertGimmel - Modern Hebrew pretrained BERT model with a 128K token vocabulary.☆23Updated 2 years ago
- The official implementation of "A Language Modeling Approach to Diacritic-Free Hebrew TTS"☆100Updated last month
- ☆21Updated 2 years ago
- A library for detecting problematic data segments in structured and unstructured data with few lines of code.☆64Updated last year
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆137Updated last year
- HeBERT: Pre-training BERT for modern Hebrew☆78Updated 2 years ago
- Tools, examples, and resources to assist in the development of Gen-AI (Generative Artificial Intelligence) applications in Hebrew, with a…☆31Updated last year
- Zero-shot Audio Classification using Whisper☆79Updated 2 years ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆149Updated last year
- ☆359Updated last year
- ☆158Updated 2 years ago
- open-source audio datasets☆153Updated last year
- Improving transcription performance of OpenAI Whisper for CPU based deployment☆247Updated 2 years ago
- Hebrew Diacritizer☆43Updated 2 months ago
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆19Updated last year
- ☆53Updated 3 years ago
- ivrit.ai codebase☆40Updated 3 weeks ago
- This is the PyTorch implementation of the Universal Source Separation with Weakly labelled Data.☆361Updated last year
- Text to Speech for Indic languages☆51Updated 3 years ago
- Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation☆148Updated last year
- A high-quality, varied ~30hr voice dataset suitable for training a TTS model☆61Updated 2 years ago
- Lyrics generation with GPT2-based Transformer☆106Updated 3 years ago
- A collection of pre-trained audio models, in PyTorch.☆114Updated 2 years ago
- Speakerbox: Fine-tune Audio Transformers for speaker identification.☆58Updated 8 months ago
- text-to-audio-latent-diffusion☆37Updated last year
- Speaker Diarization with Transformers☆69Updated 2 months ago
- A huggingface pipeline to train a gpt model based on the transcript obtained byt the Open AI whisper model☆15Updated 2 years ago
- Hebrew grapheme to phoneme (G2P)☆38Updated 2 weeks ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆82Updated 2 years ago
- Repository contains code to fine-tune WhisperASR model☆23Updated 2 years ago