ymrohit / openscenesense-ollama
OpenSceneSense Ollama is a Python library that harnesses AI for advanced local video analysis, offering customizable frame and audio insights for dynamic applications in media, education, and content moderation.
☆18Updated 3 months ago
Alternatives and similar repositories for openscenesense-ollama:
Users that are interested in openscenesense-ollama are comparing it to the libraries listed below
- High level tool use for LLMs☆34Updated 6 months ago
- Convert your PDFs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and efficient…☆44Updated last week
- Automate complex business workflows with our Multi-AI-Agent Systems using crewAI. This framework leverages autonomous, role-specific AI a…☆63Updated 8 months ago
- On-device real-time RAG App built using Jina Reader, Mediapipe, Gemma 2b IT LLM.☆12Updated 10 months ago
- A multimodal RAG application that enables semantic search on multimedia sources like audio, video and images☆31Updated last year
- ☆29Updated 8 months ago
- Speech To Speech: an effort for an open-sourced and modular GPT4-o☆40Updated 4 months ago
- Multi-person podcast audio to videocast☆10Updated 4 months ago
- Embed anything.☆29Updated 8 months ago
- Multimodal AI App using Llava 7B and Gradio.☆38Updated 9 months ago
- Run Ollama LLM models in Google Colab for free☆32Updated 2 months ago
- A library for real-time Speech to Text (STT), and Text to Speech (TTS) capability☆35Updated last year
- A framework that uses multi-agents to enable users to perform a systematic data science pipeline with just two inputs.☆38Updated 6 months ago
- Clip any moment from any video with prompts☆83Updated last month
- Choose a topic, a music genre and wait for the agents to generate a song☆52Updated 7 months ago
- The PyVisionAI Official Repo☆60Updated this week
- SLIM Models by LLMWare. A streamlit app showing the capabilities for AI Agents and Function Calls.☆20Updated last year
- Qwen2 VL Fine Tuning using Llama Factory☆12Updated 5 months ago
- All the content of my youtube channel : https://youtube.com/@florenzerstling?si=7t10PBr6MDha74PO☆12Updated 2 weeks ago
- Medical Mixture of Experts LLM using Mergekit.☆20Updated 11 months ago
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆82Updated last month
- ☆13Updated last month
- Experience the power of AI with this free AI voice generator demo. Utilizing Deepgram and Groq, we transform text into voice seamlessly. …☆37Updated 8 months ago
- Local & Private LLM that drafts responses LIKE you automatically☆75Updated 3 months ago
- Insanely Fast Transcription: A Python-based utility for rapid audio transcription from YouTube videos or local files. Leverages GPU accel…☆73Updated 7 months ago
- Example LangGraph flow that does "competitor analysis" on the web.☆23Updated 8 months ago
- Dabarqus is incredibly fast RAG that runs everywhere.☆56Updated 3 weeks ago