ymrohit / openscenesense-ollama
OpenSceneSense Ollama is a Python library that harnesses AI for advanced local video analysis, offering customizable frame and audio insights for dynamic applications in media, education, and content moderation.
☆18Updated 5 months ago
Alternatives and similar repositories for openscenesense-ollama:
Users that are interested in openscenesense-ollama are comparing it to the libraries listed below
- High level tool use for LLMs☆34Updated 8 months ago
- The agentic video editing framework☆104Updated last month
- A cutting-edge Cascading voice assistant combining real-time speech recognition, AI reasoning, and neural text-to-speech capabilities.☆50Updated this week
- Convert your PDFs and EPUBs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and…☆61Updated last week
- Local & Private LLM that drafts responses LIKE you automatically☆78Updated 4 months ago
- Agent Studio is an AI agent application designed to handle real-time interactions through phone calls, web-based voice user interfaces (V…☆30Updated 4 months ago
- ☆29Updated 10 months ago
- ☆44Updated 8 months ago
- An extension that lets the AI take the wheel, allowing it to use the mouse and keyboard, recognize UI elements, and prompt itself :3...no…☆117Updated 5 months ago
- ☆27Updated this week
- ☆22Updated 5 months ago
- Docling with Ollama - RAG on Local Files with Local Models☆55Updated 3 months ago
- Agentic RAG to help you build a startup🚀☆18Updated this week
- Python language chat with Ollama models locally, anthropic and openai☆17Updated this week
- Jockey is a conversational video agent.☆76Updated 2 months ago
- Clip any moment from any video with prompts☆98Updated 3 months ago
- Video Voiceover with gpt-4o-mini☆33Updated 6 months ago
- Choose a topic, a music genre and wait for the agents to generate a song☆53Updated 9 months ago
- Multimodal AI agent with Llama 3.2: A Streamlit app that processes text, images, PDFs, and PPTs, integrating NIM microservices, Milvus, a…☆107Updated 6 months ago
- Generate full fledged PDF reports using LLMs like GPT, Claude, Llama☆49Updated 10 months ago
- A comprehensive platform for managing, testing, and leveraging Ollama AI models with advanced features for customization, workflow automa…☆47Updated 3 weeks ago
- ☆28Updated 6 months ago
- This package is the Python implementation of Deepgram's WebVTT and SRT formatting. Given a transcription, this package can return a valid…☆19Updated 5 months ago
- 🎥➡️📝 Hermes: Blazing-fast video transcription powered by AI gods! Transcribe 6.5 minutes of video in just 1 second using Groq's LPU. Ch…☆79Updated 7 months ago
- Experience the power of AI with this free AI voice generator demo. Utilizing Deepgram and Groq, we transform text into voice seamlessly. …☆37Updated 9 months ago
- A discovery and compression tool for your Python codebase. Creates a knowledge graph for a LLM context window, efficiently outlining your…☆79Updated 4 months ago
- Reliable RAG setup that uses Semantic Double Merging Chunking from llamaindex, Qdrant Hybrid Search, colBERT for reranking and Google Gem…☆37Updated 3 months ago
- Groq-Whisper Fast Transcription App built using Groq API and Streamlit.☆24Updated 6 months ago
- Speech To Speech: an effort for an open-sourced and modular GPT4-o☆51Updated 5 months ago
- Simple Streamlit UI for Ollama☆18Updated 10 months ago