shahizat / Vision2Audio_2
Vision2Audio - Giving the blind an understanding through AI. Utilizing the LLaVA through MLC LLM to describe the image using Nvidia Riva Speech AI SDK
☆11Updated last year
Alternatives and similar repositories for Vision2Audio_2
Users that are interested in Vision2Audio_2 are comparing it to the libraries listed below
Sorting:
- AI Agents with Google's Gemini Pro and Gemini Pro Vision Models☆27Updated last year
- Passively collect images for computer vision datasets on the edge.☆33Updated last year
- Example LangGraph flow that does "competitor analysis" on the web.☆28Updated 11 months ago
- AI_Powered_Dev_Search_Engine☆12Updated last year
- Experiments with CV☆29Updated 4 months ago
- The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained mode…☆11Updated 9 months ago
- Repo of the code from the Medium article☆20Updated 11 months ago
- Streamlit App for Blood Cell Count Dataset☆18Updated 2 years ago
- ⚕️ Applying LLM-powered Personal AI Assistant to Enhance Support for Physical Rehabilitation & Telerehabilitation Therapists, Students, a…☆14Updated last year
- MLFlow End to End Workshop at Chandigarh University☆11Updated 2 years ago
- On-device real-time RAG App built using Jina Reader, Mediapipe, Gemma 2b IT LLM.☆13Updated last year
- Your Chatbot Mastery: build a super small custom AI assistant with Gradio_client Python and Streamlit - Chapter 1☆16Updated last year
- AURORA (Artificial Unified Responsive Optimized Reasoning Agent) uses lobes and web research for RAG based memory and learning.☆17Updated 6 months ago
- Computer Vision Helping Library☆33Updated 6 months ago
- An open-source project that uses cutting-edge NLP models and real-time web search to provide dynamic voice query responses. Features incl…☆17Updated 11 months ago
- On-device LLM Inference using Mediapipe LLM Inference API.☆21Updated last year
- This repo implements a GUI for Chatting with your PDF files using PaLM embedding and LLM via API.☆26Updated last year
- Mobile (i.e., Android, iOS) foundation model (i.e., LLM, VLM) deployed with MLC☆16Updated 3 months ago
- AI voice assistant made with Streamlit python and powered by Gemini, Mistral and PHI-3☆12Updated 8 months ago
- Visual similarity search engine demo with use of PyTorch Metric Learning and Qdrant☆12Updated 2 years ago
- 👁️ Multimodal LLM vision multitool☆26Updated 7 months ago
- Enhance your photos using AI☆48Updated 8 months ago
- ☆18Updated 3 months ago
- Chatbot web-applications with LLM, OpenAI API Assistants, LangChain, vector databases, and other AI stuff☆25Updated last year
- Chat with Documents from scratch using LLMs and a vector databse☆18Updated last year
- AI narrator☆15Updated last year
- ☆20Updated this week
- Simple demo project with OpenAI's API and TTS☆15Updated 2 years ago
- Demo of realtime face recognition with Taipy☆32Updated 8 months ago
- Explore the latest AI Agent Framework!☆60Updated 9 months ago