https://deep-learning-101.github.io/Speech-Processing Speech Processing (語音處理)
☆23Mar 18, 2026Updated this week
Alternatives and similar repositories for Speech-Processing-Paper
Users that are interested in Speech-Processing-Paper are comparing it to the libraries listed below
Sorting:
- AIRS-2025赛道二:「星际矿脉」火星矿物高光谱分类挑战赛☆12May 7, 2025Updated 10 months ago
- Causal streaming adaptation of OpenAI Whisper for real-time transcription on small audio chunks.☆64Sep 18, 2025Updated 6 months ago
- low-latency realtime ASR based on FireRedASR☆59Jul 8, 2025Updated 8 months ago
- Calculate allowed interactions in QED☆10Nov 2, 2022Updated 3 years ago
- Compute WER and SER for speech recognition evaluation☆27Updated this week
- A simple implementation for improving CosyVoice2 by GRPO method☆34Oct 17, 2025Updated 5 months ago
- This is the code for our paper: PLACES: Prompting Language Models for Social Conversation Synthesis☆11Feb 17, 2023Updated 3 years ago
- An easy-to-use library and command-line tool for TTS☆15May 3, 2025Updated 10 months ago
- Benchmarks for Business Document Foundation Models☆10Apr 4, 2024Updated last year
- An easy auto framework☆11Nov 14, 2023Updated 2 years ago
- A Snowflake SQL parser (WIP)☆11May 31, 2020Updated 5 years ago
- ☆16May 26, 2025Updated 9 months ago
- [ICCV 2023] Official repository of paper titled "Why Is Prompt Tuning for Vision-Language Models Robust to Noisy Labels?"☆27Sep 20, 2023Updated 2 years ago
- A super-fast proxy server port scanner一个超级快的端口扫描器☆24Aug 31, 2025Updated 6 months ago
- C# library with very fast but not very accurate realisations of System.Math methods.☆12Jun 4, 2017Updated 8 years ago
- This tool helps you easily deploy ASR models on NPUs on AMD's Ryzen AI 300 series laptops☆22Jan 29, 2026Updated last month
- ☆12Dec 1, 2021Updated 4 years ago
- A playground for experimenting with acoustic echo cancellation using a microphone, speaker, and ONNX.☆13Oct 22, 2024Updated last year
- ☆13Dec 25, 2023Updated 2 years ago
- ☆11Sep 26, 2024Updated last year
- ☆12Jul 2, 2025Updated 8 months ago
- This repository contains the official implementation (PyTorch) of "Multimodal Forgery Detection Using Ensemble Learning" proposed in APSI…☆10Jan 4, 2023Updated 3 years ago
- ☆12Mar 6, 2023Updated 3 years ago
- ☆13Sep 7, 2023Updated 2 years ago
- Custom notebook extension that utilizes a new domain specific language, Chemical Markdown Language (CMDL), to assist in documentation of …☆16Sep 17, 2025Updated 6 months ago
- ☆13Mar 30, 2023Updated 2 years ago
- 5Hz Deep-Compression Speech VAE for AR-Diffusion and CALMs☆57Nov 19, 2025Updated 4 months ago
- Awesome AI Tools for Game Development: A curated collection of the best AI tools, libraries, and resources to enhance game development wo…☆18Feb 3, 2026Updated last month
- A hover zoom effect to see a closer view of the image details.☆11Jan 13, 2025Updated last year
- Official implementation of the ACL Findings 2023 paper: Interpretable Automatic Fine-grained Inconsistency Detection in Text Summarizatio…☆14Jan 25, 2024Updated 2 years ago
- Simple NATS JetStream UI☆11Apr 3, 2024Updated last year
- Python Web Scraper☆13Mar 7, 2018Updated 8 years ago
- SpeechAgents: Human-Communication Simulation with Multi-Modal Multi-Agent Systems☆85Jan 9, 2024Updated 2 years ago
- ☆21Jun 12, 2024Updated last year
- ☆42Nov 4, 2025Updated 4 months ago
- Config files for my GitHub profile.☆15Feb 13, 2026Updated last month
- 🦅 VSCode extension for F* with IDE features☆16Mar 21, 2020Updated 6 years ago
- F# Spark & ML.NET Sample for On .NET☆12Jul 25, 2021Updated 4 years ago
- Generate AAS models from PDF raw text with LLM.☆18Nov 23, 2025Updated 3 months ago