shahizat / jetsonGPT
Using FastChat-T5 Large Language Model, Vosk API for automatic speech recognition, and Piper for text-to-speech
☆117Updated last year
Alternatives and similar repositories for jetsonGPT:
Users that are interested in jetsonGPT are comparing it to the libraries listed below
- This is a Raspberry Pi 5 whisper C++ voice assistant - backwards compatible with Pi4☆19Updated last year
- A project that optimizes Whisper for low latency inference using NVIDIA TensorRT☆73Updated 4 months ago
- streaming speech to text server using Whisper☆90Updated last year
- Optimized local inference for LLMs with HuggingFace-like APIs for quantization, vision/language models, multimodal agents, speech, vector…☆245Updated 4 months ago
- ONNX implementation of Whisper. PyTorch free.☆92Updated 3 months ago
- ASR/NLP/TTS deep learning inference library for NVIDIA Jetson using PyTorch and TensorRT☆202Updated last year
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆60Updated last year
- Like ChatGPT's voice conversations with an AI, but entirely offline/private/trade-secret-friendly, using local AI models such as LLama 2 …☆153Updated 6 months ago
- A simple TTS server for generating speech using StyleTTS2☆36Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆94Updated 10 months ago
- On-device speaker recognition engine powered by deep learning☆32Updated 3 weeks ago
- Pybind11 bindings for Whisper.cpp☆54Updated last week
- Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.☆46Updated last year
- FastAPI service on top of WhisperX☆71Updated last week
- On-device LLM Inference Powered by X-Bit Quantization☆220Updated this week
- Efficient approach to speaker diarization using voice characteristics extraction☆91Updated 10 months ago
- ONNX and TensorRT implementation of Whisper☆61Updated last year
- Experiments to test different speech recognition systems for SEPIA Framework☆58Updated last year
- Recipes for on-device voice AI and local LLM☆77Updated 2 weeks ago
- Open source repo for AI in a Box.☆63Updated 10 months ago
- Zero-copy multimodal vector DB with CUDA and CLIP/SigLIP☆48Updated 9 months ago
- A ggml (C++) re-implementation of tortoise-tts☆177Updated 6 months ago
- Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)☆69Updated 9 months ago
- Deploy your GGML models to HuggingFace Spaces with Docker and gradio☆36Updated last year
- Inference of Large Multimodal Models in C/C++. LLaVA and others☆46Updated last year
- OpenAI Whisper for edge devices☆124Updated last year
- WhisperX Service love docker!☆13Updated 6 months ago
- Locally running LLM with internet access☆94Updated last week
- A quick and optimized solution to manage llama based gguf quantized models, download gguf files, retreive messege formatting, add more mo…☆12Updated last year
- whisper.cpp bindings for python☆89Updated last year