Softcatala/ca-text-corpus

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Softcatala/ca-text-corpus)

Softcatala / ca-text-corpus

Public domain corpus of Catalan text

☆18

Alternatives and similar repositories for ca-text-corpus

Users that are interested in ca-text-corpus are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ccoreilly / deepspeech-catala
View on GitHub
Deepspeech ASR Model for the Catalan Language
☆17Feb 15, 2021Updated 5 years ago
apertium / apertium-cat
View on GitHub
Apertium linguistic data for Catalan
☆11Mar 13, 2026Updated 4 months ago
Softcatala / Catalanitzador
View on GitHub
A Microsoft Windows & Mac OS program that makes your system Catalan language friendly
☆29Dec 18, 2025Updated 7 months ago
AlexK-PL / GST_Tacotron2
View on GitHub
A NVIDIA's Pytorch Tacotron2 adaptation with unsupervised Global Style Tokens. The model has been trained with the English read-speech LJ…
☆10Sep 4, 2023Updated 2 years ago
Softcatala / julibert
View on GitHub
Catalan bert model
☆13Oct 17, 2020Updated 5 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ccoreilly / spacy-catala
View on GitHub
Spacy NLP Model for the Catalan language
☆16Nov 21, 2020Updated 5 years ago
TalnUPF / praat_web
View on GitHub
☆13Jun 30, 2026Updated 3 weeks ago
benwebber / duiker
View on GitHub
Index your shell history in a full-text search database
☆13Aug 23, 2024Updated last year
henrikingo / presentations
View on GitHub
Impress.js presentations I've done
☆11Sep 30, 2020Updated 5 years ago
lexibank / lexibank-analysed
View on GitHub
Study on lexibank data (presenting the lexibank dataset).
☆16Jun 16, 2026Updated last month
isi-nlp / carmel
View on GitHub
finite-state toolkit, EM and Bayesian (Gibbs sampling) training for FST and context-free derivation forests
☆15Jan 24, 2017Updated 9 years ago
pepprseed / svgdatashapes
View on GitHub
a compact set of python functions for creating many types of plots and data displays in SVG for use in web pages.
☆12Mar 16, 2023Updated 3 years ago
projecte-aina / lm-catalan
View on GitHub
Official source for Catalan Language Models and resources made within Aina project.
☆26Jul 28, 2023Updated 2 years ago
ccoreilly / wav2vec2-catala
View on GitHub
Wav2Vec 2.0 catalan training scripts and models
☆12Jun 18, 2021Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
aholab / AhoTTS
View on GitHub
Text-to-Speech conversor for Basque and Spanish. It includes linguistic processing and built voices for the languages aforementioned. Its…
☆18Jan 15, 2026Updated 6 months ago
Softcatala / catalan-dict-tools
View on GitHub
Tools for managing Catalan dictionaries
☆64Updated this week
collectivat / cmusphinx-models
View on GitHub
Acoustic and language models for minorised languages.
☆26Jul 17, 2026Updated last week
malev / pyfreeling
View on GitHub
Freeling wrapper
☆12Jun 27, 2016Updated 10 years ago
lirondos / lazaro
View on GitHub
An observatory of anglicism usage in the Spanish press
☆11Jul 15, 2026Updated last week
vixen-project / vixen
View on GitHub
ViXeN is a multimedia viewer, metadata extractor and annotator.
☆15Oct 13, 2019Updated 6 years ago
projecte-aina / oTranscribe-plus
View on GitHub
A free & open tool for transcribing audio interviews with offline ASR support
☆25Dec 21, 2023Updated 2 years ago
alvations / DLTK
View on GitHub
Deutsch Language Tool Kit
☆12Aug 31, 2015Updated 10 years ago
negativo17 / Signal-Desktop
View on GitHub
Private messaging from your desktop
☆12Jul 17, 2026Updated last week
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ceteri / jem-docker
View on GitHub
Docker container capable of running an iPython notebook server, for "Just Enough Math"
☆16Mar 31, 2023Updated 3 years ago
yuri-bizzoni / Metaphor-Paraphrase
View on GitHub
☆14Jul 31, 2022Updated 3 years ago
wetneb / nifconverter
View on GitHub
Utility to translate NIF files across identifier schemes, such as DBpedia and Wikidata
☆11Aug 24, 2019Updated 6 years ago
arielf / speedtests
View on GitHub
Internet speed-test data highlighting Comcast practices + reproduction code
☆16Dec 19, 2025Updated 7 months ago
unicode-org / unilex
View on GitHub
Lexical data at Unicode
☆70Sep 1, 2024Updated last year
katreparitosh / Discourse-Analytics-of-Political-Speech-Transcripts
View on GitHub
Political Discourse Analysis (PDA) of Political Speech Transcripts using Natural Language Processing (NLP)
☆17Apr 28, 2021Updated 5 years ago
mukesh-mehta / VDCNN
View on GitHub
Implementation of Very Deep Convolutional Neural Network paper
☆14Nov 14, 2018Updated 7 years ago
sfu-natlang / trofi-metaphor-data
View on GitHub
Metaphor dataset: literal versus non-literal uses of words
☆14Nov 8, 2015Updated 10 years ago
CyanogenMod / android_packages_providers_CalendarProvider
View on GitHub
Android CalendarProvider
☆22Dec 17, 2016Updated 9 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
desh2608 / kaldi-noise-vectors
View on GitHub
Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.
☆13Feb 13, 2021Updated 5 years ago
apertium / apertium-tools
View on GitHub
Apertium tools
☆20May 27, 2021Updated 5 years ago
PascalLesage / presamples
View on GitHub
Package to write, load, manage and verify numerical arrays, called presamples.
☆14Dec 2, 2022Updated 3 years ago
daanzu / wenet_stt_python
View on GitHub
☆33Nov 27, 2021Updated 4 years ago
AhlemGit / Arabic-WordNet-To-SQLite
View on GitHub
This repository is about how to build an SQLite version of the Arabic WordNet database.
☆11Mar 19, 2019Updated 7 years ago
TehreemFarooqi / Preparing-a-speech-recognition-dataset-using-YouTube-videos
View on GitHub
Using YouTube to prepare a speech recognition dataset for any language
☆10Mar 30, 2021Updated 5 years ago
CyanogenMod / android_packages_wallpapers_Basic
View on GitHub
☆26Dec 23, 2016Updated 9 years ago