ymrohit / openscenesense-ollama
OpenSceneSense Ollama is a Python library that harnesses AI for advanced local video analysis, offering customizable frame and audio insights for dynamic applications in media, education, and content moderation.
☆19Updated 6 months ago
Alternatives and similar repositories for openscenesense-ollama
Users that are interested in openscenesense-ollama are comparing it to the libraries listed below
Sorting:
- A cutting-edge Cascading voice assistant combining real-time speech recognition, AI reasoning, and neural text-to-speech capabilities.☆87Updated last week
- The agentic video editing framework☆117Updated 3 months ago
- Video Voiceover with gpt-4o-mini☆33Updated 7 months ago
- A full stack app for interruptible, low-latency and near-human quality AI phone calls built from stitching LLMs, speech understanding too…☆136Updated 3 weeks ago
- ☆57Updated 3 months ago
- Convert your PDFs and EPUBs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and…☆73Updated last month
- High level tool use for LLMs☆34Updated 9 months ago
- YouTube Script Writer is an open-source AI agent that generates tailored video scripts based on title, language, tone, and length. It str…☆11Updated 2 months ago
- ☆29Updated 11 months ago
- Example LangGraph flow that does "competitor analysis" on the web.☆28Updated 11 months ago
- Agent Studio is an AI agent application designed to handle real-time interactions through phone calls, web-based voice user interfaces (V…☆35Updated 6 months ago
- Generate full fledged PDF reports using LLMs like GPT, Claude, Llama☆52Updated 11 months ago
- Experience the power of AI with this free AI voice generator demo. Utilizing Deepgram and Groq, we transform text into voice seamlessly. …☆37Updated 11 months ago
- Insanely Fast Transcription: A Python-based utility for rapid audio transcription from YouTube videos or local files. Leverages GPU accel…☆82Updated 9 months ago
- Choose a topic, a music genre and wait for the agents to generate a song☆55Updated 10 months ago
- Groq-Whisper Fast Transcription App built using Groq API and Streamlit.☆23Updated 7 months ago
- A free local Manus alternative AI agent app, specializing in marketing. It helps you write, edit, and cross-post marketing content across…☆20Updated this week
- Clip any moment from any video with prompts☆115Updated 4 months ago
- ☆18Updated last year
- A framework that uses multi-agents to enable users to perform a systematic data science pipeline with just two inputs.☆41Updated 9 months ago
- A real-time speech-to-speech chatbot powered by Whisper Small, Llama 3.2, and Kokoro-82M.☆225Updated 3 months ago
- B-Llama3o a llama3 with Vision Audio and Audio understanding as well as text and Audio and Animation Data output.☆26Updated 11 months ago
- Open-source Perplexity app.☆124Updated last month
- Self-hosted Ollama + Whisper powered AI medical scribe.☆28Updated this week
- This repo implements a GUI for Chatting with your PDF files using PaLM embedding and LLM via API.☆26Updated last year
- Complex RAG backend☆28Updated last year
- OLLama IMage CAtegorizer☆67Updated 4 months ago
- Multimodal AI App using Llava 7B and Gradio.☆38Updated last year
- AI-powered tool for automatic podcast script and audio generation.☆67Updated 2 years ago
- VideoDB Python SDK☆71Updated last week