huangruizhe/ConEC

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/huangruizhe/ConEC)

huangruizhe / ConEC

☆14

Alternatives and similar repositories for ConEC

Users that are interested in ConEC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Mashiro009 / slidespeech_dl
View on GitHub
☆24Sep 20, 2024Updated last year
revdotcom / speech-datasets
View on GitHub
Various speech datasets made available to the public
☆136May 29, 2026Updated 2 months ago
isaacOnline / whisper
View on GitHub
Robust Speech Recognition via Large-Scale Weak Supervision
☆13Oct 28, 2023Updated 2 years ago
Speech-Lab-IITM / data2vec-aqc
View on GitHub
Repository having the code and models from the paper: data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student traini…
☆13Mar 18, 2024Updated 2 years ago
HuangZiliAndy / SSL_for_multitalker
View on GitHub
ADAPTING SELF-SUPERVISED MODELS TO MULTI-TALKER SPEECH RECOGNITION USING SPEAKER EMBEDDINGS
☆33Mar 16, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Thesys-lab / learned-coded-computation
View on GitHub
Code for paper "Learning a Code: Machine Learning for Approximate Non-Linear Coded-Computation"
☆10Dec 21, 2020Updated 5 years ago
WangHelin1997 / LibriLightMix-WHAMR
View on GitHub
Python scripts to create noisy and reverberant 2-speaker mixture audio with Libri-Light and WHAM
☆17Nov 7, 2024Updated last year
BriansIDP / WhisperBiasing
View on GitHub
☆88Jul 31, 2025Updated 11 months ago
ufal / MLASK
View on GitHub
EACL 2023 paper "MLASK: Multimodal Summarization of Video-based News Articles"
☆11Nov 7, 2023Updated 2 years ago
Splend1d / T5lephone
View on GitHub
Code for T5lephone: Bridging Speech and Text Self-supervised Models for Spoken Language Understanding via Phoneme level T5
☆19Nov 29, 2022Updated 3 years ago
skit-ai / slu-prosody
View on GitHub
Code repository for the paper "Improving End-to-End SLU performance with Prosodic Attention and Distillation" accepted at Interspeech 202…
☆27May 17, 2023Updated 3 years ago
danpovey / lilcom
View on GitHub
Small compression utility
☆38Jan 20, 2026Updated 6 months ago
idiap / icassp-oov-recognition
View on GitHub
Data and code related to the ICASSP submission "A comparison of methods for OOV-word recognition"
☆17Nov 28, 2021Updated 4 years ago
zsLin177 / CopyNE
View on GitHub
☆20Jun 3, 2024Updated 2 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
revdotcom / fstalign
View on GitHub
An efficient OpenFST-based tool for calculating WER and aligning two transcript sequences.
☆169May 12, 2026Updated 2 months ago
edwinhu / pin-code
View on GitHub
☆10May 10, 2026Updated 2 months ago
iral-lab / gold
View on GitHub
Multimodal grounded language dataset
☆11Dec 14, 2021Updated 4 years ago
stevenhillis / awesome-asr-contextualization
View on GitHub
A curated list of awesome papers on contextualizing E2E ASR outputs
☆81May 10, 2023Updated 3 years ago
fidler-lab / pmn_demo
View on GitHub
code for running trained model from Visual Reasoning by Progressive Module Networks (ICLR19)
☆15Jan 30, 2019Updated 7 years ago
apple / ml-interspeech2022-phi_rtn
View on GitHub
Repository accompanying the Interspeech 2022 publication titled "Space-Efficient Representation of Entity-centric Query Language Models" …
☆13Sep 8, 2022Updated 3 years ago
imartinezz / INPACT-S
View on GitHub
☆12Dec 26, 2023Updated 2 years ago
hakanmhmd / MMM-Stock
View on GitHub
Stock prices module for Magic Mirror
☆17Feb 4, 2020Updated 6 years ago
csukuangfj / kaldi_native_io
View on GitHub
python wrapper for kaldi's native I/O
☆27Jan 9, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
csukuangfj / optimized_transducer
View on GitHub
Memory efficient transducer loss computation
☆70Jun 10, 2022Updated 4 years ago
asappresearch / slue-toolkit
View on GitHub
A toolkit for Spoken Language Understanding Evaluation (SLUE) benchmark. Refer paper https://arxiv.org/abs/2111.10367 for more details. O…
☆65Feb 26, 2024Updated 2 years ago
lumaku / ctc-segmentation
View on GitHub
Segment an audio file and obtain utterance alignments. (Python package)
☆348May 15, 2024Updated 2 years ago
toxtli / GooglePlay-AppleStore-reviews-scraper
View on GitHub
This script extracts the reviews from a given app store, it uses non-specific CSS selectors to prevent malfunctions in the future.
☆10Oct 19, 2019Updated 6 years ago
stephenroller / utcs-util
View on GitHub
A group of utilities useful for members of UTCS.
☆13Nov 19, 2016Updated 9 years ago
zehuiwu / SpeechCueLLM
View on GitHub
☆31Feb 27, 2025Updated last year
dukeplusds / mlwscv2022
View on GitHub
Duke Machine Learning Winter School: Computer Vision 2022
☆10Jan 3, 2022Updated 4 years ago
MiuLab / SpokenVec
View on GitHub
Learning ASR-Robust Contextualized Embeddings for Spoken Language Understanding
☆24Dec 8, 2022Updated 3 years ago
justin / podscraper
View on GitHub
Python scripts to scrape the iTunes Podcast categories.
☆12Nov 30, 2020Updated 5 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
FlorinAndrei / misc
View on GitHub
a catch-all repo
☆11Dec 28, 2023Updated 2 years ago
FAST-ASR / MarkovModels.jl
View on GitHub
Julia package for Hidden Markov Model
☆34Sep 11, 2023Updated 2 years ago
facebookresearch / MMCSG
View on GitHub
This repository contains the baseline system for CHiME-8 MMCSG challenge focusing on transcribing both sides of a conversation where one …
☆41Mar 13, 2024Updated 2 years ago
m-wiesner / nnet_pytorch
View on GitHub
Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.
☆26Jul 25, 2024Updated 2 years ago
h-munakata / Lighthouse-Wrapper-for-Audio-Moment-Retrieval
View on GitHub
☆13Mar 23, 2026Updated 4 months ago
csukuangfj / transducer-loss-benchmarking
View on GitHub
☆67Mar 25, 2022Updated 4 years ago
AudenAI / Auden
View on GitHub
☆71Apr 2, 2026Updated 3 months ago