codegram / calbertLinks
Catalan ALBERT (A Lite BERT for self-supervised learning of language representations)
☆14Updated 5 years ago
Alternatives and similar repositories for calbert
Users that are interested in calbert are comparing it to the libraries listed below
Sorting:
- The RadioTalk dataset of talk radio transcripts☆60Updated 4 years ago
- Pre-production releases for Spacy in Catalan☆14Updated 3 years ago
- Markdown template for Dataseets for Datasets☆63Updated 3 years ago
- Featurize words into orthographic and phonological vectors.☆41Updated 2 years ago
- Experiments with Hugging Face 🔬 🤗☆44Updated 10 months ago
- Trained models for automatic speech recognition (ASR). A library to quickly build applications that require speech to text conversion.☆129Updated 4 years ago
- Extremely easy to use sequence to sequence library with attention, for text to text conversion tasks.☆39Updated 4 years ago
- Official source for Catalan Language Models and resources made within Aina project.☆24Updated last year
- A simple neural truecaser written in pytorch and allennlp.☆33Updated last year
- ☆17Updated 11 months ago
- An asynchronous concurrent pipeline for classifying Common Crawl based on fastText's pipeline.☆86Updated 4 years ago
- ☆12Updated 3 years ago
- Multi-lingual Text Processing☆96Updated 6 years ago
- BERT models for many languages created from Wikipedia texts☆33Updated 5 years ago
- Code and data used in named entity transliteration experiments☆57Updated 7 years ago
- A raspberry pi 64bit image with spacy and neuralcoref pre-installed☆21Updated 5 years ago
- Running Mozilla's implementation of Baidu DeepSpeech on Google Colaboratory☆16Updated 6 years ago
- Simple text to phonemes converter for multiple languages☆20Updated 2 years ago
- A guide to building language technology in new languages.☆58Updated 3 years ago
- ☆76Updated 3 years ago
- WordNet Domains, WordNet Affect and SentiWords☆48Updated 9 years ago
- Gamma Agreement in Python☆44Updated last year
- ☆15Updated 6 years ago
- Open Source AI Benchmarking toolkit for benchmarking speech to text services☆56Updated last year
- Forced Alignments for Common Voice☆31Updated 4 years ago
- ☆10Updated 4 years ago
- Code for my blog post on Generating Words from Embeddings☆23Updated 11 months ago
- docker for HF wav2vec2-sprint☆13Updated 4 years ago
- ELECTRA MODEL NLP☆13Updated 5 years ago
- A Python 3 phonetics library.☆133Updated 5 years ago