codegram / calbert
Catalan ALBERT (A Lite BERT for self-supervised learning of language representations)
☆14Updated 4 years ago
Alternatives and similar repositories for calbert:
Users that are interested in calbert are comparing it to the libraries listed below
- Pre-production releases for Spacy in Catalan☆14Updated 3 years ago
- The RadioTalk dataset of talk radio transcripts☆57Updated 4 years ago
- Code for my blog post on Generating Words from Embeddings☆23Updated 6 months ago
- A raspberry pi 64bit image with spacy and neuralcoref pre-installed☆21Updated 5 years ago
- ☆17Updated 6 months ago
- Repository for subjective and objective evaluation of source separation algorithms☆12Updated 6 years ago
- Running Mozilla's implementation of Baidu DeepSpeech on Google Colaboratory☆16Updated 5 years ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Updated 3 years ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆23Updated 3 years ago
- Experiments with Hugging Face 🔬 🤗☆45Updated 6 months ago
- ADS Project☆14Updated 9 years ago
- Official source for Catalan Language Models and resources made within Aina project.☆23Updated last year
- Acoustic and language models for minorised languages.☆26Updated 4 years ago
- ☆30Updated 2 years ago
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Updated 2 years ago
- App to explore latent spaces of music collections☆33Updated 9 months ago
- A simple neural truecaser written in pytorch and allennlp.☆33Updated 8 months ago
- ☆32Updated 4 years ago
- Simple text to phonemes converter for multiple languages☆20Updated 2 years ago
- 🧮 Python package to construct word embeddings for small data using PMI and SVD☆17Updated 4 years ago
- creating audio preprocessing features in TensorFlow keras layers,☆14Updated 3 years ago
- ☆32Updated 3 years ago
- Utilities for manipulating finite state transducers with the OpenFst library.☆30Updated 7 years ago
- German lemmatization with IWNLP as extension for spaCy☆24Updated last year
- A collection of useful tools for handling speech recognition data☆30Updated 2 years ago
- Deepspeech ASR Model for the Catalan Language☆17Updated 4 years ago
- REST api for mozilla deepspeech voice recognition engine☆20Updated 3 years ago
- ☆15Updated 4 years ago
- ☆10Updated 3 years ago
- Forced Alignments for Common Voice☆31Updated 4 years ago