ASR/NLP/TTS deep learning inference library for NVIDIA Jetson using PyTorch and TensorRT
☆223Feb 9, 2024Updated 2 years ago
Alternatives and similar repositories for jetson-voice
Users that are interested in jetson-voice are comparing it to the libraries listed below
Sorting:
- Jetbot Voice to Action Tools is a set of ROS2 nodes that utilize the Jetson Automatic Speech Recognition (ASR) deep learning interface li…☆13Feb 6, 2026Updated last month
- OpenAI Whisper for edge devices☆134Mar 21, 2023Updated 3 years ago
- ☆20Mar 23, 2024Updated last year
- Speechflow for emotion recognition related information decomposition☆10Jul 27, 2021Updated 4 years ago
- Training of visual odometry estimation networks using PyTorch☆16Jan 23, 2020Updated 6 years ago
- Using FastChat-T5 Large Language Model, Vosk API for automatic speech recognition, and Piper for text-to-speech☆131Jun 15, 2023Updated 2 years ago
- ☆23Aug 20, 2021Updated 4 years ago
- T5Voice is a lightweight PyTorch implementation of T5-based text-to-speech synthesis, supporting both streaming and non-streaming speech …☆28Nov 7, 2025Updated 4 months ago
- GUI tool for collecting & labeling data from live camera feed☆44Jun 13, 2024Updated last year
- Isaac ROS common utilities, Dockerfiles, and testing code.☆11Oct 20, 2021Updated 4 years ago
- Deep Learning based Automatic Speech Recognition with attention for the Nvidia Jetson.☆199Aug 2, 2024Updated last year
- Machine Learning Containers for NVIDIA Jetson and JetPack-L4T☆4,474Mar 13, 2026Updated last week
- Diagnostics framework for micro-ROS☆10Jun 4, 2025Updated 9 months ago
- ☆10Jun 1, 2023Updated 2 years ago
- Easy to use Python camera interface for NVIDIA Jetson☆459Aug 14, 2020Updated 5 years ago
- C++/CUDA/Python multimedia utilities for NVIDIA Jetson☆876Oct 16, 2025Updated 5 months ago
- Hello AI World guide to deploying deep-learning inference networks and deep vision primitives with TensorRT and NVIDIA Jetson.☆8,755Oct 16, 2025Updated 5 months ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Feb 15, 2024Updated 2 years ago
- The Intel IoT Examples Datastore provides a simple data store for the how-to-code-samples.☆14Mar 9, 2022Updated 4 years ago
- MonoDepth Estimation - ICRA 2018 "Sparse-to-Dense: Depth Prediction from Sparse Depth Samples and a Single Image"☆12Oct 18, 2019Updated 6 years ago
- ☆12Nov 8, 2023Updated 2 years ago
- PyTorch implementation of TinyWASE described in our paper "Compressing Speaker Extraction Model with Ultra-low Precision Quantization and…☆11Jun 28, 2021Updated 4 years ago
- Batch inference version of Jetson-inference, to run several images recognition on TX1/2 and PC at the same time to save time☆12Dec 20, 2017Updated 8 years ago
- PDF slides about NVIDIA's Jetson embedded platform and deep learning.☆64Mar 16, 2024Updated 2 years ago
- Jetbot tools is a set of ROS2 nodes that utilize the Jetson inference DNN vision library for NVIDIA Jetson☆25Feb 6, 2026Updated last month
- 📊 Simple package for monitoring and control your NVIDIA Jetson [Orin, Xavier, Nano, TX] series☆2,516Mar 6, 2026Updated 2 weeks ago
- A Qt5 GUI to simplify the camera calibration process using OpenCV☆14Jun 23, 2022Updated 3 years ago
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆16Mar 28, 2023Updated 2 years ago
- acnn for text-independent speaker recognition☆10Feb 8, 2022Updated 4 years ago
- Sample project to demonstrate how to integrate several existing balena projects in one☆10Dec 27, 2019Updated 6 years ago
- Speaker prediction for captions on the Lex Fridman podcast☆27Feb 14, 2024Updated 2 years ago
- ☆188Jun 13, 2023Updated 2 years ago
- Hosting a tutorial documentation for running Isaac ROS Visual SLAM on Jetson device.☆25Feb 28, 2024Updated 2 years ago
- Inference code for Interspeech 2025 paper, "LSCodec: Low-Bitrate and Speaker-Decoupled Discrete Speech Codec"☆35Oct 23, 2025Updated 4 months ago
- Enable RNNLM lattice rescoring with Pytorch [kaldi]☆12Jun 5, 2020Updated 5 years ago
- The History of Speech Recognition to the Year 2030☆13Aug 14, 2021Updated 4 years ago
- Wenet speech to text for react native☆10Nov 1, 2022Updated 3 years ago
- Textless Speech-to-Music Retrieval Using Emotion Similarity [ICASSP23]☆17Aug 16, 2023Updated 2 years ago
- Optimized local inference for LLMs with HuggingFace-like APIs for quantization, vision/language models, multimodal agents, speech, vector…☆359Oct 18, 2024Updated last year