jitsi / skynetLinks
AI core services for Jitsi
☆57Updated last week
Alternatives and similar repositories for skynet
Users that are interested in skynet are comparing it to the libraries listed below
Sorting:
- ☆26Updated 2 years ago
- Transcription and annotation interface for recorded audio or video files☆35Updated this week
- An automatic speech recognition API☆61Updated this week
- Deploy Jibri using Pulse-Audio. Also supports streaming to Facebook and uses Rclone to copy the files to any S3 compatible storage.☆22Updated 2 years ago
- Physical meetings rooms, reimagined.☆73Updated this week
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆64Updated last year
- A high-throughput and memory-efficient inference and serving engine for Whisper, https://mesolitica.com/blog/vllm-whisper☆28Updated 11 months ago
- Blueprint by Mozilla.ai for finetuning a Speech-To-Text model in your own language☆38Updated 2 months ago
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆63Updated last week
- ☆18Updated 5 months ago
- Official Deepgram resources for deploying Deepgram services in a self-hosted environment☆19Updated this week
- whisper-cpp-serve Real-time speech recognition and c+ of OpenAI's Whisper model in C/C++☆67Updated last year
- Delayed Streams Modeling (DSM) is a flexible formulation for streaming, multimodal sequence-to-sequence learning.☆310Updated this week
- SEPIA server to support open-source speech recognition via WebSocket connection.☆128Updated 7 months ago
- streaming speech to text server using Whisper☆93Updated 2 years ago
- Build Phone Calling Voice Agent fully powered by open source models.☆46Updated 2 months ago
- ☆143Updated last year
- faster-whisper as serverless endpoint☆105Updated last month
- TTS Client for Coqui TTS server☆13Updated 2 years ago
- AI chatbot for Matrix with infinite personalties, using ollama☆48Updated last week
- Transcribe audio and video files with speaker diarization and logically grouped timestamps using Gemini Flash☆30Updated last week
- Joint speech-language model - respond directly to audio!☆30Updated last year
- On-device voice activity detection (VAD) powered by deep learning☆219Updated this week
- Jitsi deployment on Kubernetes with JVB autoscale and OCTO region enabled☆36Updated 3 years ago
- Real-Time Whisper Voice Recognition with vosk model feedback.☆115Updated last year
- Toolkit for training/converting LibreTranslate compatible language models 🚂☆52Updated this week
- Speaker diarization model☆27Updated 2 years ago
- 🚀 A Jitsi deployment on Kubernetes with autoscaling features☆14Updated last year
- On-device noise suppression powered by deep learning☆73Updated this week
- kokoro text to speech using javascript☆58Updated 4 months ago