☆11Oct 24, 2022Updated 3 years ago
Alternatives and similar repositories for ConstDecoder
Users that are interested in ConstDecoder are comparing it to the libraries listed below
Sorting:
- Source code to "SliTraNet: Automatic Detection of Slide Transitions in Lecture Videos using Convolutional Neural Networks"☆10Dec 17, 2023Updated 2 years ago
- Accompanying code for paper "Attention-Based Contextual Language Model Adaptation for Speech Recognition", submitted to ACL 2021.☆14Jul 25, 2023Updated 2 years ago
- Code for InterSpeech 2024 Paper: LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition☆18Jul 16, 2024Updated last year
- This repository contains the VLEngagement dataset and the helper functions/ tools required to work with the dataset.☆16Dec 3, 2021Updated 4 years ago
- Error correction back-end for speaker diarization☆18Sep 26, 2023Updated 2 years ago
- Hierarchical Context Tagger for utterance rewriting☆13Mar 27, 2022Updated 3 years ago
- Repository for "LLM-based speaker diarization correction: A generalizable approach" paper☆20Jul 31, 2024Updated last year
- EDUKG: a Heterogeneous Sustainable K-12 Educational Knowledge Graph☆77Jul 27, 2022Updated 3 years ago
- Learning ASR-Robust Contextualized Embeddings for Spoken Language Understanding☆24Dec 8, 2022Updated 3 years ago
- FUSION is an open-source project aimed at revolutionizing networking through the simulation of advanced SD-EONs and AI-enhanced networks,…☆13Feb 18, 2026Updated last week
- Official PyTorch implementation of the paper "Robust Training for Speaker Verification against Noisy Labels" in INTERSPEECH 2023.☆11Oct 23, 2023Updated 2 years ago
- ☆34Jun 15, 2021Updated 4 years ago
- ☆30Jun 12, 2025Updated 8 months ago
- End-to-end MOdeling of ASR (Automatic Speech Recognition)☆33Feb 16, 2023Updated 3 years ago
- Myanmar lexicon analyzer - Sorting and Segmentation☆10Aug 11, 2021Updated 4 years ago
- Myanmar consonant and vowel audio files that I recorded at University of Computer Studies Banmaw☆11Mar 2, 2019Updated 6 years ago
- Artifact code release for paper "Uniform-Cost Multi-Path Routing for Reconfigurable Data Center Networks"☆12Sep 5, 2024Updated last year
- Correcting Chinese Spelling Errors with Phonetic Pre-training 非官方实现☆39Feb 11, 2022Updated 4 years ago
- The code and data for "Summary-Oriented Vision Modeling for Multimodal Abstractive Summarization"☆11May 16, 2023Updated 2 years ago
- ☆17May 27, 2025Updated 9 months ago
- The implementation codes of paper: Multimodal Sentiment Analysis with Mutual Information-based Disentangled Representation Learning☆18May 8, 2025Updated 9 months ago
- Code for paper "Cross-Domain Slot Filling as Machine Reading Comprehension" in IJCAI 2021☆11Aug 24, 2021Updated 4 years ago
- 带拼音、字形特征的文本纠错模型☆11Jan 1, 2023Updated 3 years ago
- Multimodal SER Model meant to be trained on recognising emotions from speech (text + acoustic data). Fine-tuned the DeBERTaV3 model, resp…☆11Jun 19, 2024Updated last year
- A lecture summarization tool that uses AI and computer vision to summarize and index videos☆11Dec 8, 2022Updated 3 years ago
- ☆13Oct 17, 2020Updated 5 years ago
- ☆10Jul 16, 2024Updated last year
- ☆10Oct 16, 2025Updated 4 months ago
- Awesome Multimodal Fusion in Speech Emotion Recognition☆13Nov 11, 2025Updated 3 months ago
- Reproduction of the paper SFSRNet: Super-resolution for single-channel Audio Source Separation by me (@arda-num) and @dritx16. Navigate P…☆11Jul 7, 2022Updated 3 years ago
- ☆13Jan 31, 2023Updated 3 years ago
- ☆10Jan 18, 2024Updated 2 years ago
- ☆12Nov 8, 2024Updated last year
- TVDiag: A Task-oriented and View-invariant Failure Diagnosis Framework with Multimodal Data☆15Apr 28, 2025Updated 10 months ago
- NS3 simulator for RDMA load balancing☆11Jan 31, 2025Updated last year
- ☆11Nov 11, 2022Updated 3 years ago
- We present a study of a neural network based method for speech emotion recognition, using audio-only features. In the studied scheme, the…☆11Jul 24, 2024Updated last year
- Substitute alternative spellings of special characters (e.g. German umlauts [ae, oe, ue] and [ss]) with their correct versions (ä, ö, ü, …☆11Nov 24, 2024Updated last year
- This is an official implementation in PyTorch of PTH-Net: Dynamic Facial Expression Recognition without Face Detection and Alignment..☆13Jul 1, 2025Updated 8 months ago