rorro6787 / img-desc-visually-impairedLinks
Image description System for Impaired people
☆15Updated 5 months ago
Alternatives and similar repositories for img-desc-visually-impaired
Users that are interested in img-desc-visually-impaired are comparing it to the libraries listed below
Sorting:
- Eye exploration☆27Updated 5 months ago
- This repo gives a start for the docker.☆30Updated last year
- ClickDiffusion: Harnessing LLMs for Interactive Precise Image Editing☆69Updated last year
- Simple CogVLM client script☆14Updated last year
- ☆39Updated 10 months ago
- Real-time, YOLO-like object detection using Florence-2 with a user-friendly GUI.☆28Updated 3 months ago
- The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained mode…☆13Updated 11 months ago
- Take your LLM to the optometrist.☆33Updated last week
- Use Florence 2 to auto-label data for use in training fine-tuned object detection models.☆64Updated 11 months ago
- ☆23Updated 9 months ago
- Unofficial implementation and experiments related to Set-of-Mark (SoM) 👁️☆87Updated last year
- Streamlit app presented to the Streamlit LLMs Hackathon September 23☆16Updated last year
- This project breathes life into video characters by using AI to describe their personality and then chat with you as them.☆47Updated last year
- EdgeSAM model for use with Autodistill.☆27Updated last year
- ☆24Updated last year
- ☆11Updated last year
- This repository is an implementation of converting sketches into lively videos using Google's Veo 3 model.☆43Updated 2 weeks ago
- ☆20Updated last year
- A Gradio web UI for Depth-Pro, Sharp Monocular Metric Depth Estimation☆51Updated 9 months ago
- Auto generated blog posts of papers in the field of AI powered by "Paper Reviewer" project.☆18Updated 2 months ago
- Run Vision LLMs, TTS and STT APIs. Website and API for https://text-generator.io☆35Updated this week
- YOLOv10: Real-Time End-to-End Object Detection☆11Updated last year
- Code for paper https://arxiv.org/abs/2501.00522☆12Updated 2 months ago
- ☆40Updated last year
- ☆21Updated 8 months ago
- ☆46Updated last year
- A collection of notebooks for the Hugging Face blog series (https://huggingface.co/blog).☆45Updated 11 months ago
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆81Updated last year
- Roboflow Workflows on ComfyUI☆33Updated 9 months ago
- Python scripts performing optical flow estimation using the NeuFlowV2 model in ONNX.☆48Updated 10 months ago