owenliang / wakeword-torchView external linksLinks
☆18Dec 8, 2024Updated last year
Alternatives and similar repositories for wakeword-torch
Users that are interested in wakeword-torch are comparing it to the libraries listed below
Sorting:
- Chinese speech recognition | 中文语音识别 (使用AISHELL-3数据集训练语音识别模型)☆11Oct 17, 2024Updated last year
- ☆14Apr 4, 2025Updated 10 months ago
- Utilized attention incorporated UNet model for conditional image generation using Flow Matching with Conditional Optimal Transport Object…☆13Dec 29, 2023Updated 2 years ago
- Simplistic Implementation of Zipformer:A faster and better encoder for automatic speech recognition in PyTorch☆18Jun 3, 2024Updated last year
- Python&Opencv手势识别系统(完整源码&自定义UI操作界面&视频教程)☆22Nov 12, 2023Updated 2 years ago
- Copied from official repo of VITS. Added some comments.☆19Sep 24, 2024Updated last year
- ☆18Mar 20, 2024Updated last year
- GRPO Training Script for Qwen Model on GSM8K Dataset. This script trains a Qwen model using the GRPO (Generalized Reinforcement Policy Op…☆28Dec 11, 2025Updated 2 months ago
- Colab notebook for fine-tuning Qwen2-Audio with trl's SFT and PPO trainers.☆24Nov 23, 2024Updated last year
- Towards a general language-audio model for computational paralinguistic tasks☆23Dec 14, 2024Updated last year
- ☆36Jun 25, 2025Updated 7 months ago
- Tracks your research activities, creates detailed timelines, and exports notes to platforms like Flomo.☆25Oct 27, 2025Updated 3 months ago
- ☆23Oct 17, 2024Updated last year
- A unified Python simulation and hardware communication environment for Franka FR3 robots.☆21Aug 15, 2024Updated last year
- 异步语音对话组件。☆32Mar 13, 2025Updated 11 months ago
- Init☆40Sep 6, 2023Updated 2 years ago
- Zotero Review Assistant is a Zotero plugin that aims to streamline the process of organizing articles for review research.☆44Nov 14, 2025Updated 3 months ago
- 使用onnxruntime部署LivePortrait人像动画生成,包含C++和Python两个版本的程序☆31Aug 5, 2024Updated last year
- [ICML 2025] Rethinking Latent Redundancy in Behavior Cloning: An Information Bottleneck Approach for Robot Manipulation☆46Feb 3, 2026Updated last week
- ☆62Jul 28, 2025Updated 6 months ago
- Panda model for dm_robotics☆44Jun 24, 2025Updated 7 months ago
- This is a repository for fine-tuning Qwen2-Audio, currently supporting Distributed Data Parallel (DDP) and DeepSpeed.☆49Jul 28, 2025Updated 6 months ago
- Vox-Profile Benchmark☆67Sep 12, 2025Updated 5 months ago
- C++ Library for Interfacing with Libfranka and Frankapy☆63Feb 10, 2025Updated last year
- How to use our public wav2vec2 age and gender model☆53Sep 4, 2023Updated 2 years ago
- 利用Python+TensorFlow实现语音识别☆48Oct 30, 2018Updated 7 years ago
- ☆52Feb 17, 2023Updated 2 years ago
- 重生之我是 AI 打工人。前世,我的身份默默无闻,来去匆匆,不知道自己将在何地出生。然而,命运给予了我难得的机会,让我重生为一名 AI 打工人。☆50Jun 24, 2023Updated 2 years ago
- FACTR Hardware☆77Jun 15, 2025Updated 8 months ago
- Qwen2.5 0.5B GRPO☆78Feb 16, 2025Updated last year
- Deep learned features for long-term localization in Visual Teach and Repeat☆62Jun 9, 2024Updated last year
- 这个文档是使用Habitat-sim的中文教程☆72Mar 10, 2023Updated 2 years ago
- Official code for "EmoVoice: LLM-based Emotional Text-To-Speech Model with Freestyle Text Prompting"☆109Oct 16, 2025Updated 4 months ago
- An unofficial pytorch implementation of "STREAMVC: REAL-TIME LOW-LATENCY VOICE CONVERSION".☆81Apr 15, 2025Updated 10 months ago
- [TMECH 2023] Active View Planning for Visual SLAM in Outdoor Environments Based on Continuous Information Modeling☆74Mar 14, 2024Updated last year
- An instruct text-to-speech solution based on LLaSA and CosyVoice2 developed by the ASLP lab and collaborators.☆210Jan 20, 2026Updated 3 weeks ago
- ☆79Jun 9, 2023Updated 2 years ago
- finetune llm part for spark-tts model☆120Mar 25, 2025Updated 10 months ago
- 使用vllm加速cosyvoice2的推理☆482Apr 26, 2025Updated 9 months ago