shahizat / Vision2Audio_2
Vision2Audio - Giving the blind an understanding through AI. Utilizing the LLaVA through MLC LLM to describe the image using Nvidia Riva Speech AI SDK
☆11Updated last year
Alternatives and similar repositories for Vision2Audio_2:
Users that are interested in Vision2Audio_2 are comparing it to the libraries listed below
- Visual similarity search engine demo with use of PyTorch Metric Learning and Qdrant☆12Updated 2 years ago
- Example showing how to do inference on a video file with Roboflow Infer☆48Updated last year
- YouTube Assistant☆12Updated last year
- ⚕️ Applying LLM-powered Personal AI Assistant to Enhance Support for Physical Rehabilitation & Telerehabilitation Therapists, Students, a…☆13Updated last year
- Simple demo project with OpenAI's API and TTS☆15Updated 2 years ago
- The Customer Care Bot is a cutting-edge customer support solution designed to revolutionize the way e-commerce websites interact with and…☆9Updated last year
- Luann allows you to create a LLM agent,which has complete memory module (long-term memory, short-term memory) and knowledge module(Variou…☆19Updated 2 weeks ago
- The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained mode…☆11Updated 8 months ago
- An LLM-powered self-studying app using retrieval-augmented generation prompting | Streamlit LLM Hackathon 2023☆17Updated last year
- ☆29Updated last year
- This repo implements a GUI for Chatting with your PDF files using PaLM embedding and LLM via API.☆26Updated last year
- The Yahoo Finance Agent is an application that combines OpenAI's LLMs, the Yahoo Finance Python library, and LangChain's tools to provide…☆14Updated 7 months ago
- Example LangGraph flow that does "competitor analysis" on the web.☆28Updated 9 months ago
- Simple CogVLM client script☆14Updated last year
- AI Multi-agent system for real-time, adaptive supply chain coordination and optimization leveraging responsive AI clusters.☆16Updated last year
- ☆16Updated 10 months ago
- An open-source project that uses cutting-edge NLP models and real-time web search to provide dynamic voice query responses. Features incl…☆16Updated 10 months ago
- Agent Watch is an AgentOps monitoring library designed for Crew AI applications.☆17Updated 4 months ago
- This repo explains the custom object detection training using Yolov8.☆17Updated 2 years ago
- Computer Vision Helping Library☆26Updated 4 months ago
- On-device LLM Inference using Mediapipe LLM Inference API.☆21Updated last year
- ☆14Updated last year
- Open-source, knowledge-grounded conversational AI☆13Updated 4 months ago
- Implementing an interactive AI avatar using Python, Blender and GPT☆10Updated last year
- LangChain & Prompt Engineering tutorials on Large Language Models (LLMs) such as ChatGPT with custom data. Jupyter notebooks on loading a…☆9Updated last year
- Financial CrewAI Agents (LangChain, YF Tools, Ai Crew, Groq Inference)☆20Updated 8 months ago
- Upload personal docs and Chat with your PDF files with this GPT4-powered app. Built with LangChain, Pinecone Vector Database, deployed on…☆38Updated 3 months ago
- Extensible ChatGPT Frontend to search the web, create files and execute arbitrary commands☆9Updated last year
- Detect objects in images in a web browser using the YOLOv8 neural network☆34Updated last year
- ☆11Updated last year