rorro6787 / img-desc-visually-impairedLinks
Image description System for Impaired people
☆15Updated 6 months ago
Alternatives and similar repositories for img-desc-visually-impaired
Users that are interested in img-desc-visually-impaired are comparing it to the libraries listed below
Sorting:
- ☆39Updated 11 months ago
- Real-time, YOLO-like object detection using Florence-2 with a user-friendly GUI.☆29Updated last week
- This repo gives a start for the docker.☆31Updated last year
- Eye exploration☆27Updated 6 months ago
- Use Florence 2 to auto-label data for use in training fine-tuned object detection models.☆67Updated last year
- ☆19Updated last year
- Simple CogVLM client script☆14Updated last year
- This project breathes life into video characters by using AI to describe their personality and then chat with you as them.☆47Updated last year
- EdgeSAM model for use with Autodistill.☆27Updated last year
- The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained mode…☆13Updated last year
- Notebooks using the Neural Magic libraries 📓☆40Updated last year
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆82Updated last year
- Code for paper https://arxiv.org/abs/2501.00522☆12Updated 3 months ago
- LoRA fine-tuned Stable Diffusion Deployment☆31Updated 2 years ago
- Unofficial implementation and experiments related to Set-of-Mark (SoM) 👁️☆87Updated last year
- Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.☆10Updated last year
- ClickDiffusion: Harnessing LLMs for Interactive Precise Image Editing☆69Updated last year
- Grounded-SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and …☆11Updated last year
- OmegaViT (ΩViT) is a cutting-edge vision transformer architecture that combines multi-query attention, rotary embeddings, state space mod…☆14Updated 3 weeks ago
- A list of language models with permissive licenses such as MIT or Apache 2.0☆24Updated 5 months ago
- ☆21Updated last year
- Create topological graph for image segments.☆22Updated 10 months ago
- Use Grounding DINO, Segment Anything, and GPT-4V to label images with segmentation masks for use in training smaller, fine-tuned models.☆66Updated last year
- ☆21Updated 9 months ago
- 100 Days of GPU Challenge☆21Updated last month
- ☆13Updated last year
- Computer Vision Helping Library☆43Updated 9 months ago
- Discover advanced AI techniques in my repository combining Multi-Hop Chain of Thought (CoT) and Retrieval-Augmented Generation (RAG) usin…☆14Updated last year
- ☆11Updated last year
- Multimodal Open Source Framework for Conversational Agent Research and Development.☆19Updated 5 months ago