Dicta-Israel-Center-for-Text-Analysis / alephbertgimmelLinks
AlephBertGimmel - Modern Hebrew pretrained BERT model with a 128K token vocabulary.
☆25Updated 3 years ago
Alternatives and similar repositories for alephbertgimmel
Users that are interested in alephbertgimmel are comparing it to the libraries listed below
Sorting:
- Hebrew grapheme to phoneme (G2P)☆80Updated last month
- ☆55Updated 3 years ago
- ☆18Updated last year
- A corpus of diacritized Hebrew texts (טקסט מנוקד)☆11Updated 3 years ago
- The official implementation of "A Language Modeling Approach to Diacritic-Free Hebrew TTS"☆103Updated 6 months ago
- Hebrew Diacritizer☆46Updated last month
- ☆21Updated 2 years ago
- scipts for working with open.bible data☆26Updated 3 years ago
- ☆14Updated 10 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- ☆35Updated 2 years ago
- Unicode Standard tokenization routines and orthography profile segmentation☆38Updated 10 months ago
- Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.☆12Updated 2 years ago
- Onnx compatible styletts2 code☆13Updated 6 months ago
- The EveryVoice TTS Toolkit - Text To Speech for your language☆41Updated last week
- phone inventory library☆17Updated 2 years ago
- Lyrics generation with GPT2-based Transformer☆108Updated 3 years ago
- OCTRA is a web-application for the orthographic transcription of audio files.☆39Updated last week
- Script to train a German n-gram Language Model on articles of Wikipedia☆13Updated 7 years ago
- Transcribing audio files using Hugging Face's implementation of Wav2Vec2 + "chain-linking" NLP tasks to combine speech-to-text with downs…☆32Updated 4 years ago
- ☆19Updated 3 years ago
- Grapheme-to-phoneme tool for corpus conversion, where phonemes match Phoible inventories☆18Updated 8 months ago
- arxiv daily for speech translation, legal. Ref: Vincentqyw/cv-arxiv-daily☆14Updated 11 months ago
- Speakerbox: Fine-tune Audio Transformers for speaker identification.☆59Updated last year
- Self-supervised neural network for music recommendations.☆18Updated 2 years ago
- IPA Phonemizer/Dephonemizer for 144 human languages☆48Updated last week
- ☆20Updated last year
- Train a fiwGAN or ciwGAN model using your own training data☆14Updated 3 years ago
- 🏥 🎤 The largest clinical study in the world to collect voice data labeled with health information (N>6,000 participants, 48 utterances…☆31Updated 8 months ago
- HeBERT: Pre-training BERT for modern Hebrew☆80Updated 2 years ago