rorro6787 / img-desc-visually-impaired
Image description System for Impaired people
☆14Updated 2 months ago
Alternatives and similar repositories for img-desc-visually-impaired:
Users that are interested in img-desc-visually-impaired are comparing it to the libraries listed below
- Eye exploration☆25Updated last month
- Discover advanced AI techniques in my repository combining Multi-Hop Chain of Thought (CoT) and Retrieval-Augmented Generation (RAG) usin…☆13Updated 8 months ago
- ☆19Updated last year
- This is a repository for the course "From Beginner to LLM Developer" by Towards AI.☆11Updated 3 months ago
- ☆13Updated last year
- Simple CogVLM client script☆14Updated last year
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆18Updated 5 months ago
- LoRA fine-tuned Stable Diffusion Deployment☆31Updated 2 years ago
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆80Updated 10 months ago
- The open source implementation of "NeVA: NeMo Vision and Language Assistant"☆18Updated last year
- YouTube Video Summarization App built using open source LLM and Framework like Llama 2, Haystack, Whisper, and Streamlit. This app smooth…☆56Updated 10 months ago
- The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained mode…☆11Updated 8 months ago
- Medical Mixture of Experts LLM using Mergekit.☆20Updated last year
- Real-time, YOLO-like object detection using the Florence-2-base-ft model with a user-friendly GUI.☆20Updated last week
- Tool to take your ML model from local to production with one-line of code.☆25Updated last year
- ☆21Updated 4 months ago
- EdgeSAM model for use with Autodistill.☆26Updated 9 months ago
- Gen AI Large Language Model Projects☆57Updated 10 months ago
- Create topological graph for image segments.☆22Updated 6 months ago
- This project breathes life into video characters by using AI to describe their personality and then chat with you as them.☆45Updated last year
- Multi-vision Sensor Perception and Reasoning (MS-PR) benchmark, assessing VLMs on their capacity for sensor-specific reasoning.☆13Updated last month
- Hub for researchers exploring VLMs and Multimodal Learning:)☆19Updated this week
- Building a Chain of Thought RAG Model with DSPy, Qdrant and Ollama☆31Updated last year
- Use Florence 2 to auto-label data for use in training fine-tuned object detection models.☆62Updated 7 months ago
- Build Agentic workflows with function calling using open LLMs☆26Updated last week
- Chat Complex PDF with Tables Using IBM WatsonX, Langchain and LlamaParser.☆12Updated 11 months ago
- Train a production grade GPT in less than 400 lines of code. Better than Karpathy's verison and GIGAGPT☆16Updated 2 weeks ago
- Uses a Gradio interface to stream coding related responses from local and cloud based large language models. Pulls context from GitHub Re…☆21Updated 3 weeks ago
- ☆16Updated 2 months ago
- ☆16Updated 10 months ago