huangruizhe / ConECView external linksLinks
☆14Jun 17, 2024Updated last year
Alternatives and similar repositories for ConEC
Users that are interested in ConEC are comparing it to the libraries listed below
Sorting:
- ☆24Sep 20, 2024Updated last year
- Repository having the code and models from the paper: data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student traini…☆13Mar 18, 2024Updated last year
- EACL 2023 paper "MLASK: Multimodal Summarization of Video-based News Articles"☆12Nov 7, 2023Updated 2 years ago
- ADAPTING SELF-SUPERVISED MODELS TO MULTI-TALKER SPEECH RECOGNITION USING SPEAKER EMBEDDINGS☆33Mar 16, 2023Updated 2 years ago
- Python scripts to create noisy and reverberant 2-speaker mixture audio with Libri-Light and WHAM☆17Nov 7, 2024Updated last year
- Robust Speech Recognition via Large-Scale Weak Supervision☆13Oct 28, 2023Updated 2 years ago
- Small compression utility☆38Jan 20, 2026Updated 3 weeks ago
- Various speech datasets made available to the public☆130Dec 13, 2024Updated last year
- ☆20Jun 3, 2024Updated last year
- Code for T5lephone: Bridging Speech and Text Self-supervised Models for Spoken Language Understanding via Phoneme level T5☆19Nov 29, 2022Updated 3 years ago
- Code repository for the paper "Improving End-to-End SLU performance with Prosodic Attention and Distillation" accepted at Interspeech 202…☆27May 17, 2023Updated 2 years ago
- Data and code related to the ICASSP submission "A comparison of methods for OOV-word recognition"☆17Nov 28, 2021Updated 4 years ago
- python wrapper for kaldi's native I/O☆27Jan 9, 2025Updated last year
- (Hybrid) BYOL-S feature extractor using serab-byols package in pytorch.☆27Apr 20, 2024Updated last year
- ☆28Oct 7, 2025Updated 4 months ago
- Learning ASR-Robust Contextualized Embeddings for Spoken Language Understanding☆24Dec 8, 2022Updated 3 years ago
- A toolkit for Spoken Language Understanding Evaluation (SLUE) benchmark. Refer paper https://arxiv.org/abs/2111.10367 for more details. O…☆66Feb 26, 2024Updated last year
- Julia package for Hidden Markov Model☆34Sep 11, 2023Updated 2 years ago
- Memory efficient transducer loss computation☆69Jun 10, 2022Updated 3 years ago
- ☆67Mar 25, 2022Updated 3 years ago
- This repository contains the baseline system for CHiME-8 MMCSG challenge focusing on transcribing both sides of a conversation where one …☆40Mar 13, 2024Updated last year
- A curated list of awesome papers on contextualizing E2E ASR outputs☆80May 10, 2023Updated 2 years ago
- ☆12Dec 26, 2023Updated 2 years ago
- [ICIP 2024] Open-Vocabulary Panoptic Segmentation Using BERT Pre-Training of Vision-Language Multiway Transformer Model☆16May 23, 2025Updated 8 months ago
- [ICASSP 2025] PDSeg: Patch-Wise Distillation and Controllable Image Generation for Weakly-Supervised Histopathology Tissue Segmentation☆18May 23, 2025Updated 8 months ago
- Self-Supervised Speech Pre-training and Representation Learning Toolkit.☆10Feb 29, 2024Updated last year
- ☆37Mar 30, 2021Updated 4 years ago
- ☆37Aug 30, 2023Updated 2 years ago
- [CVPR 2024] MMSum: A Dataset for Multimodal Summarization and Thumbnail Generation of Videos☆37Jan 29, 2025Updated last year
- ☆86Jul 31, 2025Updated 6 months ago
- Segment an audio file and obtain utterance alignments. (Python package)☆345May 15, 2024Updated last year
- Python wrapper for kaldi's arpa2fst☆37Aug 27, 2025Updated 5 months ago
- ☆11Aug 11, 2023Updated 2 years ago
- Duke Machine Learning Winter School: Computer Vision 2022☆10Jan 3, 2022Updated 4 years ago
- The implementation codes of paper: Multimodal Sentiment Analysis with Mutual Information-based Disentangled Representation Learning☆18May 8, 2025Updated 9 months ago
- The code and data for "Summary-Oriented Vision Modeling for Multimodal Abstractive Summarization"☆11May 16, 2023Updated 2 years ago
- ☆37Nov 22, 2025Updated 2 months ago
- A temporal module for PyTorch-ComplexTensor☆44Jun 28, 2024Updated last year
- RFCs for standardcompletions.org☆25Jun 11, 2025Updated 8 months ago