dusty-nv / jetson-voiceView external linksLinks
ASR/NLP/TTS deep learning inference library for NVIDIA Jetson using PyTorch and TensorRT
☆222Feb 9, 2024Updated 2 years ago
Alternatives and similar repositories for jetson-voice
Users that are interested in jetson-voice are comparing it to the libraries listed below
Sorting:
- Jetbot Voice to Action Tools is a set of ROS2 nodes that utilize the Jetson Automatic Speech Recognition (ASR) deep learning interface li…☆13Feb 6, 2026Updated last week
- T5Voice is a lightweight PyTorch implementation of T5-based text-to-speech synthesis, supporting both streaming and non-streaming speech …☆28Nov 7, 2025Updated 3 months ago
- Speechflow for emotion recognition related information decomposition☆10Jul 27, 2021Updated 4 years ago
- ☆23Aug 20, 2021Updated 4 years ago
- Deep Learning based Automatic Speech Recognition with attention for the Nvidia Jetson.☆199Aug 2, 2024Updated last year
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆16Mar 28, 2023Updated 2 years ago
- Machine Learning Containers for NVIDIA Jetson and JetPack-L4T☆4,365Updated this week
- C++/CUDA/Python multimedia utilities for NVIDIA Jetson☆870Oct 16, 2025Updated 4 months ago
- Training of visual odometry estimation networks using PyTorch☆16Jan 23, 2020Updated 6 years ago
- Easy to use Python camera interface for NVIDIA Jetson☆456Aug 14, 2020Updated 5 years ago
- Inference code for Interspeech 2025 paper, "LSCodec: Low-Bitrate and Speaker-Decoupled Discrete Speech Codec"☆35Oct 23, 2025Updated 3 months ago
- ☆19Nov 4, 2022Updated 3 years ago
- Hello AI World guide to deploying deep-learning inference networks and deep vision primitives with TensorRT and NVIDIA Jetson.☆8,719Oct 16, 2025Updated 4 months ago
- ☆12Nov 8, 2023Updated 2 years ago
- Learning an Interpretable End-to-End Network for Real-Time Acoustic Beamforming☆15Aug 20, 2024Updated last year
- Finally, some decent sample sentences☆23Dec 3, 2023Updated 2 years ago
- 📊 Simple package for monitoring and control your NVIDIA Jetson [Orin, Xavier, Nano, TX] series☆2,473Updated this week
- Stable timestamps and confidence score for words of OpenAI's Whisper outputs down to word-level.☆24Dec 20, 2022Updated 3 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Feb 15, 2024Updated 2 years ago
- PDF slides about NVIDIA's Jetson embedded platform and deep learning.☆64Mar 16, 2024Updated last year
- [ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder☆12Mar 11, 2025Updated 11 months ago
- Batch inference version of Jetson-inference, to run several images recognition on TX1/2 and PC at the same time to save time☆12Dec 20, 2017Updated 8 years ago
- PyTorch implementation of TinyWASE described in our paper "Compressing Speaker Extraction Model with Ultra-low Precision Quantization and…☆11Jun 28, 2021Updated 4 years ago
- All-in-one Speech Transcription☆10Jan 25, 2026Updated 3 weeks ago
- acnn for text-independent speaker recognition☆10Feb 8, 2022Updated 4 years ago
- ☆10Dec 22, 2023Updated 2 years ago
- Code for "Error-driven Fixed-Budget ASR Personalization for Accented Speakers" in ICASSP 2021☆11Jun 13, 2021Updated 4 years ago
- Open tools and data for cloudless automatic speech recognition☆11Oct 1, 2019Updated 6 years ago
- Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE☆15Nov 30, 2022Updated 3 years ago
- Wenet speech to text for react native☆10Nov 1, 2022Updated 3 years ago
- Sample project to demonstrate how to integrate several existing balena projects in one☆10Dec 27, 2019Updated 6 years ago
- Enables Jetson to be controlled with handpose using trt_pose☆12Mar 16, 2021Updated 4 years ago
- ☆11Nov 7, 2024Updated last year
- ☆10Jun 1, 2023Updated 2 years ago
- ROS nodes and Gazebo model for NVIDIA JetBot with Jetson Nano☆395Apr 30, 2022Updated 3 years ago
- Accelerate Whisper tasks such as transcription, by multiprocesing through parallelization☆25Oct 29, 2022Updated 3 years ago
- GUI tool for collecting & labeling data from live camera feed☆44Jun 13, 2024Updated last year
- ☆15Nov 11, 2024Updated last year
- Spiking neural networks (SNNs) for speech classification☆12Mar 14, 2022Updated 3 years ago