rorro6787 / img-desc-visually-impaired
Image description System for Impaired people
☆15Updated 3 months ago
Alternatives and similar repositories for img-desc-visually-impaired:
Users that are interested in img-desc-visually-impaired are comparing it to the libraries listed below
- Simple CogVLM client script☆14Updated last year
- Gen AI Large Language Model Projects☆57Updated 11 months ago
- YOLOv10: Real-Time End-to-End Object Detection☆10Updated 11 months ago
- Eye exploration☆26Updated 2 months ago
- ☆40Updated 7 months ago
- Discover advanced AI techniques in my repository combining Multi-Hop Chain of Thought (CoT) and Retrieval-Augmented Generation (RAG) usin…☆13Updated 8 months ago
- Using a single image and just 10 seconds of sample audio, our project enables you to create a video where it appears as if you're speakin…☆37Updated last year
- ☆21Updated 5 months ago
- This Repository demostrates various examples using YOLO☆13Updated last year
- The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained mode…☆11Updated 8 months ago
- Solving Computer Vision with AI agents☆29Updated this week
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆80Updated 10 months ago
- Use Florence 2 to auto-label data for use in training fine-tuned object detection models.☆63Updated 8 months ago
- LoRA fine-tuned Stable Diffusion Deployment☆31Updated 2 years ago
- EdgeSAM model for use with Autodistill.☆26Updated 10 months ago
- This project breathes life into video characters by using AI to describe their personality and then chat with you as them.☆45Updated last year
- Roboflow Workflows on ComfyUI☆32Updated 7 months ago
- ☆46Updated last year
- List of resources helping you become a better AI engineer.☆23Updated 3 months ago
- Uses a Gradio interface to stream coding related responses from local and cloud based large language models. Pulls context from GitHub Re…☆21Updated last month
- 101 for setting up adaptive RAG☆15Updated 7 months ago
- BH hackathon☆14Updated last year
- ☆10Updated 10 months ago
- Use Grounding DINO, Segment Anything, and GPT-4V to label images with segmentation masks for use in training smaller, fine-tuned models.☆66Updated last year
- LLM as a Chatbot Service☆16Updated last year
- Create topological graph for image segments.☆22Updated 6 months ago
- Medical Mixture of Experts LLM using Mergekit.☆20Updated last year
- ☆16Updated last year
- ☆24Updated last year
- ☆18Updated 2 months ago