ymrohit / openscenesense-ollamaLinks

OpenSceneSense Ollama is a Python library that harnesses AI for advanced local video analysis, offering customizable frame and audio insights for dynamic applications in media, education, and content moderation.

☆26

Alternatives and similar repositories for openscenesense-ollama

Users that are interested in openscenesense-ollama are comparing it to the libraries listed below

Sorting:

tarun7r / Vocal-Agent
Cascading voice assistant combining real-time speech recognition, AI reasoning, and neural text-to-speech capabilities.
☆102Updated 2 weeks ago
Lex-au / Vocalis
Speech-to-speech AI assistant with natural conversation flow, mid-speech interruption, vision capabilities and AI-initiated follow-ups. F…
☆198Updated 3 months ago
ymrohit / openscenesense
OpenSceneSense is a Python library that harnesses AI for advanced video analysis, offering customizable frame and audio insights for dyna…
☆16Updated 8 months ago
Lex-au / Orpheus-FastAPI
High-performance Text-to-Speech server with OpenAI-compatible API, 8 voices, emotion tags, and modern web UI. Optimized for RTX GPUs.
☆484Updated 3 weeks ago
phildougherty / sesame_csm_openai
OpenAI compatible TTS for Sesame CSM:1b & dia:1.6b - Voice Cloning from File/YT
☆374Updated 2 weeks ago
kaminoer / KokoDOS
Yet another self-hosted AI voice assistant. GlaDOS' blazing fast pipeline with Kokoro TTS voice and vision.
☆57Updated 6 months ago
devnen / Dia-TTS-Server
Self-host the powerful Dia TTS model. This server offers a user-friendly Web UI, flexible API endpoints (incl. OpenAI compatible), suppor…
☆298Updated 2 months ago
ValyrianTech / OpenVoice_server
API server for Instant voice cloning by MyShell.
☆98Updated 10 months ago
KartDriver / mira_converse
☆80Updated 5 months ago
amanvirparhar / weebo
A real-time speech-to-speech chatbot powered by Whisper Small, Llama 3.2, and Kokoro-82M.
☆233Updated 6 months ago
ExoFi-Labs / OllamaGTTS
☆186Updated 4 months ago
bigsk1 / voice-chat-ai
🎙️ Speak with AI - Run locally using Ollama, OpenAI, Anthropic or xAI - Speech uses XTTS, OpenAI, ElevenLabs or Kokoro
☆292Updated 2 weeks ago
ijub / sesame_ai
Python client library for the Sesame AI API, enabling voice conversations with AI characters like Miles and Maya.
☆85Updated 4 months ago
sammyf / ollimca
OLLama IMage CAtegorizer
☆67Updated 6 months ago
akiani / aidialer
A full stack app for interruptible, low-latency and near-human quality AI phone calls built from stitching LLMs, speech understanding too…
☆145Updated 3 months ago
Softcatala / open-dubbing
Open dubbing is an AI dubbing system which uses machine learning models to automatically translate and synchronize audio dialogue into di…
☆268Updated 3 weeks ago
jesuscopado / samantha-os1-openai-realtime
Samantha OS1 is a conversational AI assistant powered by the Realtime API from OpenAI
☆157Updated 7 months ago
diffusionstudio / agent
The agentic video editing framework
☆144Updated 5 months ago
ayaansh-roy / voice_assistant_llm
☆68Updated last year
m92vyas / llm-reader
Turn Webpage to LLM friendly input text. Similar to Firecrawl and Jina Reader API. Makes RAG, AI web scraping, image & webpage links extr…
☆205Updated 3 weeks ago
HafizalJohari / lclv
☆89Updated 2 months ago
flatsiedatsie / papeg_ai
Code for Papeg.ai
☆225Updated 6 months ago
phildougherty / dia_openai
OpenAI compatible API for Dia-1.6B
☆35Updated 3 months ago
caspianmoon / memoripy
An AI memory layer with short- and long-term storage, semantic clustering, and optional memory decay for context-aware applications.
☆644Updated 6 months ago
nazdridoy / kokoro-tts
A CLI text-to-speech tool using the Kokoro model, supporting multiple languages, voices (with blending), and various input formats includ…
☆594Updated last week
BandarLabs / clickclickclick
A framework to enable autonomous android and computer use using any LLM (local or remote)
☆480Updated 2 weeks ago
freddyaboulton / orpheus-cpp
Fast Streaming TTS with Orpheus + WebRTC (with FastRTC)
☆302Updated 3 months ago
OminousIndustries / phone-use-agent
☆74Updated 4 months ago
Anil-matcha / AI-Voice-Agent
Self-hosted AI voice agent
☆112Updated 11 months ago
SamurAIGPT / AI-Faceless-Video-Generator
Generate a video script, voice and a talking face completely with AI
☆335Updated 5 months ago