shahizat / JetsonGPT
Using FastChat-T5 Large Language Model, Vosk API for automatic speech recognition, and Piper for text-to-speech
☆117Updated last year
Alternatives and similar repositories for JetsonGPT:
Users that are interested in JetsonGPT are comparing it to the libraries listed below
- This is a Raspberry Pi 5 whisper C++ voice assistant - backwards compatible with Pi4☆19Updated last year
- OpenAI Whisper for edge devices☆123Updated last year
- A project that optimizes Whisper for low latency inference using NVIDIA TensorRT☆71Updated 4 months ago
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆60Updated last year
- Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.☆46Updated last year
- Open source repo for AI in a Box.☆64Updated 9 months ago
- On-device LLM Inference Powered by X-Bit Quantization☆211Updated this week
- ASR/NLP/TTS deep learning inference library for NVIDIA Jetson using PyTorch and TensorRT☆199Updated last year
- Real-Time Whisper Voice Recognition with vosk model feedback.☆109Updated last year
- Pybind11 bindings for Whisper.cpp☆49Updated 2 weeks ago
- ONNX implementation of Whisper. PyTorch free.☆92Updated 2 months ago
- Whisper realtime streaming for long speech-to-text transcription and translation☆111Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆92Updated 9 months ago
- NVIDIA Riva runnable tutorials☆123Updated 2 months ago
- Optimized local inference for LLMs with HuggingFace-like APIs for quantization, vision/language models, multimodal agents, speech, vector…☆237Updated 4 months ago
- Recipes for on-device voice AI and local LLM☆76Updated 3 weeks ago
- whisper.cpp bindings for python☆87Updated last year
- Efficient approach to speaker diarization using voice characteristics extraction☆88Updated 9 months ago
- Local LLM inference & management server with built-in OpenAI API☆31Updated 9 months ago
- Mobile web app for audio "push-to-talk" + TTS chat interface with OpenAI-like APIs☆41Updated last year
- ☆312Updated 7 months ago
- On-device speaker recognition engine powered by deep learning☆32Updated this week
- An OpenAI API compatible speech to text server for audio transcription and translations, aka. Whisper.☆68Updated 2 weeks ago
- 🎧 | RunPod worker of the faster-whisper model for Serverless Endpoint.☆84Updated last week
- ☆97Updated this week
- LLaVA server (llama.cpp).☆177Updated last year
- This repo has the code of the 3 demos I presented at Google Gemma2 DevDay Tokyo, using Gemma2 on a Jetson Orin Nano device.☆35Updated 3 months ago
- Locally running LLM with internet access☆93Updated 4 months ago
- ONNX and TensorRT implementation of Whisper☆61Updated last year
- Efficient Inference of Transformer models☆420Updated 6 months ago