End-to-end ASR repository for AGI
☆20Dec 19, 2025Updated 3 months ago
Alternatives and similar repositories for AGI_HER_ASR
Users that are interested in AGI_HER_ASR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆25Dec 19, 2025Updated 3 months ago
- AGI_HER_LLM☆36Dec 19, 2025Updated 3 months ago
- ☆30Dec 19, 2025Updated 3 months ago
- FastSpeech2, modified for training KSS Dataset. Modified from https://github.com/ming024/FastSpeech2☆38Dec 19, 2025Updated 3 months ago
- Flow matching based speaker verification☆24Dec 20, 2025Updated 3 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- (Interspeech 2025, official code) Speech enhancement based on cascaded two flows☆16Sep 1, 2025Updated 6 months ago
- Official repository of NeXt-TDNN for speaker verification☆80Oct 10, 2024Updated last year
- Official code for AL-PINNS: Augmented Lagrangian relaxation method for Physics-Informed Neural Networks☆12Jul 29, 2023Updated 2 years ago
- Collection of PyTorch implementations of Spoken Keyword Spotting presented in research papers.☆37Apr 5, 2024Updated last year
- (ICASSP 2025, official code)FlowSE: Flow Matching-based Speech Enhancement☆92Jul 23, 2025Updated 8 months ago
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆16Mar 28, 2023Updated 3 years ago
- Educational Implementation of "Edit Flows: Flow Matching with Edit Operations" by Havasi et al.☆38Oct 17, 2025Updated 5 months ago
- Official code for Metric learning for user-defined keyword spotting☆38Feb 21, 2024Updated 2 years ago
- [Neurocomputing] EmoVerse: Enhancing Multimodal Large Language Models for Affective Computing via Multitask Learning☆18Jul 6, 2025Updated 8 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Sound Source Localization for AI Grand Challenge 2021☆21Feb 7, 2022Updated 4 years ago
- Zero-shot Domain-sensitive Speech Recognition with Prompt-conditioning Fine-tuning (ASRU2023)☆27Oct 10, 2023Updated 2 years ago
- audio source separation evaluation metrics☆29Aug 27, 2019Updated 6 years ago
- ☆55Jul 16, 2025Updated 8 months ago
- Code for the Interspeech 2024 paper "MM-KWS: Multi-modal Prompts for Multilingual User-defined Keyword Spotting"☆48Jan 24, 2026Updated 2 months ago
- Pytorch implementation of LearnableUpsamplingLayer (NaturalSpeech, Tan et al., 2022)☆57Mar 12, 2024Updated 2 years ago
- A custom, multi-threaded HTTP web server☆35Feb 17, 2013Updated 13 years ago
- Sound Source Localization for AI Grand Challenge 2021☆22Feb 8, 2022Updated 4 years ago
- ☆12Oct 12, 2024Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Code for “ACE-HGNN: Adaptive Curvature ExplorationHyperbolic Graph Neural Network”☆17Mar 3, 2022Updated 4 years ago
- This repository provides a PyTorch implementation of the physics informed neural networks by M.Raissi et al.☆11Aug 9, 2021Updated 4 years ago
- Official implementation of paper: Shallow Flow Matching for Coarse-to-Fine Text-to-Speech Synthesis☆51Sep 20, 2025Updated 6 months ago
- [INTERSPEECH 2023] Knowledge Transfer from Pre-trained Language Models to Cif-based Recognizers via Hierarchical Distillation☆41Sep 1, 2023Updated 2 years ago
- ☆17Mar 21, 2024Updated 2 years ago
- ☆14Oct 10, 2024Updated last year
- [SLT'24] Mamba-based Decoder-Only Approach for Speech Recognition☆18Dec 1, 2024Updated last year
- Revisiting Multimodal Emotion Recognition in Conversation from the Perspective of Graph Spectrum☆30Dec 15, 2024Updated last year
- Official repository for the paper Singing Voice Graph Modeling for SingFake Detection (Interspeech 2024).☆25Sep 19, 2025Updated 6 months ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- An official documentation of the paper <Wave-U-Mamba: An End-To-End Framework For High-Quality And Efficient Speech Super Resolution>.☆24Oct 29, 2025Updated 5 months ago
- This repository implement a novel zero-shot TTS framework, named Flamed-TTS, focusing on the efficient generation and dynamic pacing in …☆57Aug 9, 2025Updated 7 months ago
- [NeurIPS '24] Code repo for the paper entitled "Learning Structured Representations with Hyperbolic Embeddings" at NeurIPS 2024☆24Jan 22, 2025Updated last year
- Learning Affinity with Hyperbolic Representation for Spatial Propagation☆13Jun 30, 2023Updated 2 years ago
- Source code for paper "Hyperbolic Graph Attention Network"☆18Jun 10, 2021Updated 4 years ago
- C++ STFT and others☆40Feb 11, 2026Updated last month
- Generating sensor signals in isotropic noise fields (MATLAB)☆46Mar 17, 2023Updated 3 years ago