Public domain corpus of Catalan text
☆18Dec 20, 2021Updated 4 years ago
Alternatives and similar repositories for ca-text-corpus
Users that are interested in ca-text-corpus are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Microsoft Windows & Mac OS program that makes your system Catalan language friendly☆29Dec 18, 2025Updated 6 months ago
- Apertium linguistic data for Catalan☆11Mar 13, 2026Updated 3 months ago
- Tools for managing Catalan dictionaries☆64Updated this week
- Catalan bert model☆13Oct 17, 2020Updated 5 years ago
- 🤖 Deep Catalan: Bring closer the Catalan Language to Deep Learning using ULMFit.☆12Oct 15, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Tool for creating Kaldi nnet3 recipes using the International Phonetic Alphabet (IPA)☆10Jun 2, 2021Updated 5 years ago
- ☆13Updated this week
- finite-state toolkit, EM and Bayesian (Gibbs sampling) training for FST and context-free derivation forests☆15Jan 24, 2017Updated 9 years ago
- Official source for Catalan Language Models and resources made within Aina project.☆26Jul 28, 2023Updated 2 years ago
- Text-to-Speech conversor for Basque and Spanish. It includes linguistic processing and built voices for the languages aforementioned. Its…☆18Jan 15, 2026Updated 5 months ago
- Study on lexibank data (presenting the lexibank dataset).☆16Jun 16, 2026Updated 2 weeks ago
- Wav2Vec 2.0 catalan training scripts and models☆12Jun 18, 2021Updated 5 years ago
- This repository contains Neural Machine Translation tools built at Softcatalà☆46Mar 7, 2026Updated 3 months ago
- Freeling wrapper☆12Jun 27, 2016Updated 10 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Catalan ALBERT (A Lite BERT for self-supervised learning of language representations)☆14Jul 9, 2020Updated 5 years ago
- Deutsch Language Tool Kit☆12Aug 31, 2015Updated 10 years ago
- A free & open tool for transcribing audio interviews with offline ASR support☆25Dec 21, 2023Updated 2 years ago
- Private messaging from your desktop☆12Updated this week
- Acoustic and language models for minorised languages.☆26Sep 30, 2020Updated 5 years ago
- Adapt Capsule Network for Name Entity Recognition Task☆10Jun 12, 2019Updated 7 years ago
- An observatory of anglicism usage in the Spanish press☆11May 11, 2026Updated last month
- Utility to translate NIF files across identifier schemes, such as DBpedia and Wikidata☆11Aug 24, 2019Updated 6 years ago
- Internet speed-test data highlighting Comcast practices + reproduction code☆16Dec 19, 2025Updated 6 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ViXeN is a multimedia viewer, metadata extractor and annotator.☆15Oct 13, 2019Updated 6 years ago
- Metaphor dataset: literal versus non-literal uses of words☆14Nov 8, 2015Updated 10 years ago
- Political Discourse Analysis (PDA) of Political Speech Transcripts using Natural Language Processing (NLP)☆16Apr 28, 2021Updated 5 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Mar 30, 2021Updated 5 years ago
- ☆11Nov 5, 2021Updated 4 years ago
- Apertium tools☆20May 27, 2021Updated 5 years ago
- Terminal, neovim and swaywbspwm configuration☆28Aug 23, 2021Updated 4 years ago
- FreeLing project source code☆262Mar 9, 2026Updated 3 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Linux/BSD/OSX Installer☆29Sep 5, 2024Updated last year
- PAVOQUE Corpus of Expressive Speech☆12Aug 2, 2016Updated 9 years ago
- steps to perform text-based speaker diarization with kaldi toolkit☆12Nov 2, 2018Updated 7 years ago
- ☆10Nov 1, 2025Updated 8 months ago
- wav2rtp is a simple tool intended to convert speech data from wav files to RTP data stream☆14Aug 15, 2021Updated 4 years ago
- Python Distribution Grid Simulator☆18Sep 21, 2020Updated 5 years ago
- Phonetically-Oriented Word Error Rate☆36May 4, 2019Updated 7 years ago