codegram / calbertLinks
Catalan ALBERT (A Lite BERT for self-supervised learning of language representations)
☆14Updated 5 years ago
Alternatives and similar repositories for calbert
Users that are interested in calbert are comparing it to the libraries listed below
Sorting:
- The RadioTalk dataset of talk radio transcripts☆60Updated 4 years ago
- Pre-production releases for Spacy in Catalan☆14Updated 3 years ago
- ☆17Updated last year
- Official source for Catalan Language Models and resources made within Aina project.☆25Updated 2 years ago
- Generating English Rock lyrics using BERT☆19Updated 6 years ago
- An asynchronous concurrent pipeline for classifying Common Crawl based on fastText's pipeline.☆86Updated 4 years ago
- 🦁 Nala is an agile open-source voice assistant framework (20+ actions).☆35Updated 2 years ago
- A simple neural truecaser written in pytorch and allennlp.☆33Updated last year
- ☆18Updated 10 years ago
- spaCy match and replace, maintaining conjugation☆35Updated 2 years ago
- Markdown template for Dataseets for Datasets☆63Updated 3 years ago
- Experiments with Hugging Face 🔬 🤗☆44Updated last year
- fastlangid, the only language identification package that support cantonese (zh-yue), simplified (zh-hans) and traditional chinese (zh-ha…☆40Updated 2 years ago
- ☆30Updated 3 years ago
- App to explore latent spaces of music collections☆35Updated last year
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆24Updated 4 years ago
- Trained models for automatic speech recognition (ASR). A library to quickly build applications that require speech to text conversion.☆129Updated 4 years ago
- A guide to building language technology in new languages.☆59Updated 3 years ago
- ☆76Updated 3 years ago
- Simple text to phonemes converter for multiple languages☆20Updated 2 years ago
- A Docker Wrapper to make the machine easily learn any language on top of INRIA OSCAR dataset using GPT2☆11Updated 5 years ago
- Forced Alignments for Common Voice☆31Updated 4 years ago
- Reddit title generator API based on GPT-2☆19Updated 5 years ago
- Many Natural Language Processing tasks rely on sentence boundary detection (SBD). Although amazing libraries like spacy provide state of …☆61Updated 4 years ago
- Text and Punctuation correction with Deep Learning☆128Updated 5 years ago
- Acoustic and language models for minorised languages.☆26Updated 4 years ago
- ML model to classify music instruments from audio - Heroku deployment.☆18Updated 8 years ago
- This code provides word level language identification tool for identifying language for individual words in Code-Mixed text. e.g. The tex…☆55Updated 5 years ago
- Which ML are you?☆13Updated 2 years ago
- Gamma Agreement in Python☆45Updated last year