shahizat / JetsonGPT
Using FastChat-T5 Large Language Model, Vosk API for automatic speech recognition, and Piper for text-to-speech
☆118Updated last year
Alternatives and similar repositories for JetsonGPT:
Users that are interested in JetsonGPT are comparing it to the libraries listed below
- A project that optimizes Whisper for low latency inference using NVIDIA TensorRT☆81Updated 6 months ago
- OpenAI Whisper for edge devices☆125Updated 2 years ago
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆63Updated last year
- This is a Raspberry Pi 5 whisper C++ voice assistant - backwards compatible with Pi4☆21Updated last year
- Pybind11 bindings for Whisper.cpp☆57Updated last week
- Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.☆46Updated last year
- On-device speaker recognition engine powered by deep learning☆35Updated this week
- ☆107Updated last month
- Open source repo for AI in a Box.☆63Updated last year
- LLaVA server (llama.cpp).☆180Updated last year
- Live transcription with OpenAi Whisper☆50Updated 2 years ago
- Passively collect images for computer vision datasets on the edge.☆33Updated last year
- Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port…☆19Updated 8 months ago
- Optimized local inference for LLMs with HuggingFace-like APIs for quantization, vision/language models, multimodal agents, speech, vector…☆262Updated 6 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆94Updated last year
- ONNX implementation of Whisper. PyTorch free.☆95Updated 5 months ago
- streaming speech to text server using Whisper☆92Updated last year
- Recipes for on-device voice AI and local LLM☆81Updated this week
- Zero-copy multimodal vector DB with CUDA and CLIP/SigLIP☆55Updated this week
- Real-Time Whisper Voice Recognition with vosk model feedback.☆112Updated last year
- ☆156Updated last year
- whisper.cpp bindings for python☆95Updated last year
- A ggml (C++) re-implementation of tortoise-tts☆178Updated 8 months ago
- A high-throughput and memory-efficient inference and serving engine for Whisper, https://mesolitica.com/blog/vllm-whisper☆25Updated 9 months ago
- ☆24Updated 2 weeks ago
- Inference of Large Multimodal Models in C/C++. LLaVA and others☆46Updated last year
- Deploy your GGML models to HuggingFace Spaces with Docker and gradio☆36Updated last year
- Speech To Speech: an effort for an open-sourced and modular GPT4-o☆56Updated 6 months ago
- NVIDIA Riva runnable tutorials☆130Updated last month
- ASR/NLP/TTS deep learning inference library for NVIDIA Jetson using PyTorch and TensorRT☆206Updated last year