mit-ccc/RadioTalk

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/mit-ccc/RadioTalk)

mit-ccc / RadioTalk

The RadioTalk dataset of talk radio transcripts

☆62

Alternatives and similar repositories for RadioTalk

Users that are interested in RadioTalk are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

BBC-archive / newslabs-wat
View on GitHub
Compare coverage across different media sources using the Juicer
☆12Apr 1, 2016Updated 10 years ago
desh2608 / kaldi-noise-vectors
View on GitHub
Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.
☆13Feb 13, 2021Updated 5 years ago
mikex86 / DeepSpeech-Java-Bindings
View on GitHub
Java Bindings for the C++ library DeepSpeech
☆10Jun 4, 2020Updated 6 years ago
kate-egorova / ASR-hybrid-decoding
View on GitHub
This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…
☆11Feb 4, 2020Updated 6 years ago
gpu-poor / gramvaani_hindi_asr
View on GitHub
This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge
☆16Mar 26, 2022Updated 4 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
tiro-is / tiro-speech-core
View on GitHub
This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core
☆15Jun 19, 2023Updated 3 years ago
tiefenauer / wiki-lm
View on GitHub
Script to train a German n-gram Language Model on articles of Wikipedia
☆14Oct 20, 2018Updated 7 years ago
revdotcom / words2num
View on GitHub
Convert words to numbers
☆21Apr 13, 2022Updated 4 years ago
charlesliucn / LanMIT
View on GitHub
📖 LanMIT: A Toolkit for Improving Language Models in Low-resourced Speech Recognition based on Kaldi.
☆22Jul 12, 2019Updated 7 years ago
tmcw-up-for-adoption / geocodify
View on GitHub
Geocode streaming CSV data, producing GeoJSON & CSV.
☆34Mar 4, 2019Updated 7 years ago
XapaJIaMnu / gLM
View on GitHub
A GPU language model, based on btree backed tries.
☆30Mar 6, 2018Updated 8 years ago
pilot7747 / VoxDIY
View on GitHub
This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.
☆16Jul 22, 2021Updated 5 years ago
bshall / dusted
View on GitHub
DUSTED: Spoken-Term Discovery using Discrete Speech Units
☆17Oct 2, 2024Updated last year
er537 / whisper_interpretability
View on GitHub
A repo to do interpretability of pre-trained acoustic models
☆15Oct 15, 2023Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
SpeechColab / PySpeechColab
View on GitHub
A library of speech gadgets.
☆15Oct 15, 2022Updated 3 years ago
coqui-ai / data-checker
View on GitHub
🫠 check your data, before you wreck your model
☆16Aug 11, 2022Updated 3 years ago
burrmill / burrmill
View on GitHub
BurrMill core
☆22Nov 2, 2021Updated 4 years ago
sytelus / nanuGPT
View on GitHub
Simple, reliable and well tested training code for quick experiments with transformer based models
☆13Jun 28, 2026Updated last month
alumae / streaming-punctuator
View on GitHub
☆17Apr 14, 2023Updated 3 years ago
slanglab / IndiaPoliceEvents
View on GitHub
Data and code to accompany the paper: Halterman, Keith, Sarwar, and O'Connor. "Corpus-Level Evaluation for Event QA: The IndiaPoliceEvent…
☆15Aug 6, 2021Updated 4 years ago
sunlightlabs / read_FEC
View on GitHub
Turn raw electronic FEC filings into meaningful data
☆19May 20, 2016Updated 10 years ago
vadimkantorov / inferspeech
View on GitHub
PyTorch speech2text inference script for the NVidia openseq2seq wav2letter model variant
☆10Aug 12, 2019Updated 6 years ago
dense-analysis / vim-speech
View on GitHub
Vim Speech Recognition Experiments
☆20May 30, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
pigzach / MagicSpeechASR
View on GitHub
magicspeech competition recipe
☆18Jun 29, 2020Updated 6 years ago
qiujiali / lattice-rescore
View on GitHub
☆16Jun 13, 2022Updated 4 years ago
justingrimmer / tad_19
View on GitHub
Text as Data 2019
☆61Jun 5, 2019Updated 7 years ago
cheoljun95 / sdhubert
View on GitHub
☆27Dec 4, 2024Updated last year
DallasMorningNews / chartwerk-editor
View on GitHub
React/Redux Chartwerk editor.
☆10Oct 5, 2018Updated 7 years ago
danijel3 / SparrowhawkTest
View on GitHub
A simple tutorial on setting up Sparrowhawk - a text-to-speech normalization engine
☆14Oct 16, 2017Updated 8 years ago
TehreemFarooqi / Preparing-a-speech-recognition-dataset-using-YouTube-videos
View on GitHub
Using YouTube to prepare a speech recognition dataset for any language
☆10Mar 30, 2021Updated 5 years ago
danpovey / pocolm
View on GitHub
Small language toolkit for creation, interpolation and pruning of ARPA language models
☆92Aug 6, 2022Updated 3 years ago
idiap / inv-tn
View on GitHub
A bunch of scripts exploiting several tools to perform inverse text normalization (ITN)
☆21Sep 27, 2017Updated 8 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
The-Politico / gootenberg
View on GitHub
A tool for handling news developer needs from the Google API.
☆38Jan 3, 2023Updated 3 years ago
cjbarrie / sicss_23
View on GitHub
Repository of materials for SICSS-Edinburgh, 2023.
☆12Jun 19, 2023Updated 3 years ago
zeke / acrophonic-alphabets
View on GitHub
A collection of 92 Alpha-Bravo-Charlie-style alphabets from around the world.
☆13May 25, 2022Updated 4 years ago
collectivat / cmusphinx-models
View on GitHub
Acoustic and language models for minorised languages.
☆26Jul 17, 2026Updated 2 weeks ago
artbataev / end2end
View on GitHub
Losses and decoders for end-to-end ASR and OCR
☆34Oct 30, 2020Updated 5 years ago
EgorLakomkin / KTSpeechCrawler
View on GitHub
Automatically constructing corpus for automatic speech recognition from YouTube videos
☆157Feb 15, 2020Updated 6 years ago
nttcslab-sp / kaldiio
View on GitHub
A pure python module for reading and writing kaldi ark files
☆268Mar 6, 2025Updated last year