Sharonio / roboshaulLinks
☆35Updated last year
Alternatives and similar repositories for roboshaul
Users that are interested in roboshaul are comparing it to the libraries listed below
Sorting:
- AlephBertGimmel - Modern Hebrew pretrained BERT model with a 128K token vocabulary.☆23Updated 2 years ago
- ☆21Updated 2 years ago
- HeBERT: Pre-training BERT for modern Hebrew☆78Updated 2 years ago
- Tools, examples, and resources to assist in the development of Gen-AI (Generative Artificial Intelligence) applications in Hebrew, with a…☆31Updated last year
- The official implementation of "A Language Modeling Approach to Diacritic-Free Hebrew TTS"☆100Updated last month
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆137Updated last year
- A Lossless Compression Library for AI pipelines☆268Updated 2 weeks ago
- ☆359Updated last year
- A comprehensive list of Hebrew NLP resources.☆274Updated 2 months ago
- Performant and accurate speech recognition built on Pytorch☆253Updated 3 years ago
- Improving transcription performance of OpenAI Whisper for CPU based deployment☆246Updated 2 years ago
- This is the PyTorch implementation of the Universal Source Separation with Weakly labelled Data.☆358Updated last year
- Neural Modeling for Named Entities and Morphology (Hebrew NER)☆32Updated 2 years ago
- Zero-shot Audio Classification using Whisper☆79Updated 2 years ago
- A collection of pre-trained audio models, in PyTorch.☆114Updated 2 years ago
- ivrit.ai codebase☆38Updated last week
- Hebrew word lists☆43Updated 8 months ago
- Coqui AI TTS plugin☆81Updated 2 weeks ago
- ☆52Updated 3 years ago
- Automatically generates TTS dataset using audio and associated text. Make cuts under a custom length. Uses Google Speech to text API to p…☆52Updated 3 years ago
- Google Colab Notebooks for Transcription with Whisper☆24Updated 2 months ago
- Hebrew Diacritizer☆41Updated last month
- generate granular word-level captions in srt format☆57Updated 2 years ago
- A question answering dataset in Modern Hebrew, containing 30,147 questions.☆23Updated 7 months ago
- A library for detecting problematic data segments in structured and unstructured data with few lines of code.☆64Updated last year
- a complete reproducible example of training a word2vec model for Hebrew☆12Updated 2 years ago
- An open source interactive spectrogram audio player, primarily based on bokeh and the holoviz stack (wav+holoviz=waloviz)☆67Updated this week
- Dora is an experiment management framework. It expresses grid searches as pure python files as part of your repo. It identifies experimen…☆297Updated last year
- ☆59Updated 5 months ago
- ☆85Updated 2 years ago