rorro6787 / img-desc-visually-impairedLinks
Image description System for Impaired people
☆15Updated 8 months ago
Alternatives and similar repositories for img-desc-visually-impaired
Users that are interested in img-desc-visually-impaired are comparing it to the libraries listed below
Sorting:
- Use Florence 2 to auto-label data for use in training fine-tuned object detection models.☆67Updated last year
- Simple CogVLM client script☆13Updated last year
- Eye exploration☆29Updated 8 months ago
- ☆26Updated last year
- This repo gives a start for the docker.☆33Updated last year
- ☆39Updated last year
- EdgeSAM model for use with Autodistill.☆29Updated last year
- ☆24Updated last year
- Unofficial implementation and experiments related to Set-of-Mark (SoM) 👁️☆87Updated last year
- The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained mode…☆12Updated last year
- This project breathes life into video characters by using AI to describe their personality and then chat with you as them.☆46Updated last year
- ☆18Updated 2 years ago
- Play Chrome's Dinosaur Game with Reinforcement Learning☆11Updated 2 years ago
- 6D Rotation Representation for Unconstrained Head Pose Estimation☆15Updated 2 months ago
- ClickDiffusion: Harnessing LLMs for Interactive Precise Image Editing☆69Updated last year
- Real-time object detection using Florence-2 with a user-friendly GUI.☆30Updated 2 months ago
- ☆29Updated last year
- ☆16Updated last year
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆84Updated last year
- Streamlit app presented to the Streamlit LLMs Hackathon September 23☆15Updated last year
- LoRA fine-tuned Stable Diffusion Deployment☆31Updated 2 years ago
- This repository holds the "Fully automated landmarking and facial segmentation on 3D photographs" files☆29Updated last year
- Vehicle speed estimation using YOLOv8☆29Updated last year
- ☆12Updated 4 months ago
- Enhancement in Multimodal Representation Learning.☆40Updated last year
- Take your LLM to the optometrist.☆40Updated 2 months ago
- Evaluate the performance of computer vision models and prompts for zero-shot models (Grounding DINO, CLIP, BLIP, DINOv2, ImageBind, model…☆37Updated 2 years ago
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.☆20Updated last year
- Use Grounding DINO, Segment Anything, and GPT-4V to label images with segmentation masks for use in training smaller, fine-tuned models.☆65Updated last year
- Automatic Thief Detection via CCTV with Alarm System and Perpetrator Image Capture using YOLOv5 + ROI. This project utilizes computer vis…☆13Updated 11 months ago