rorro6787 / img-desc-visually-impairedLinks
Image description System for Impaired people
☆15Updated 8 months ago
Alternatives and similar repositories for img-desc-visually-impaired
Users that are interested in img-desc-visually-impaired are comparing it to the libraries listed below
Sorting:
- ☆18Updated 2 years ago
- ☆39Updated last year
- Eye exploration☆28Updated 7 months ago
- Use Florence 2 to auto-label data for use in training fine-tuned object detection models.☆67Updated last year
- This repo gives a start for the docker.☆32Updated last year
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆84Updated last year
- Unofficial implementation and experiments related to Set-of-Mark (SoM) 👁️☆88Updated last year
- Vehicle speed estimation using YOLOv8☆30Updated last year
- This project breathes life into video characters by using AI to describe their personality and then chat with you as them.☆48Updated last year
- The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained mode…☆13Updated last year
- Real-time object detection using Florence-2 with a user-friendly GUI.☆30Updated last month
- ☆21Updated 10 months ago
- Notebooks using the Neural Magic libraries 📓☆39Updated last year
- ClickDiffusion: Harnessing LLMs for Interactive Precise Image Editing☆70Updated last year
- Take your LLM to the optometrist.☆40Updated last month
- ☆16Updated last year
- Code for paper https://arxiv.org/abs/2501.00522☆12Updated 4 months ago
- Simple CogVLM client script☆14Updated last year
- Practice Notebook for AI Course☆11Updated 6 months ago
- ☆25Updated last year
- ☆20Updated last year
- ☆13Updated last year
- YouTube Video Summarization App built using open source LLM and Framework like Llama 2, Haystack, Whisper, and Streamlit. This app smooth…☆56Updated last year
- Streamlit app presented to the Streamlit LLMs Hackathon September 23☆16Updated last year
- Daily.co + Pipecat + Tavus AI Avatar Agent☆13Updated 5 months ago
- ☆17Updated last year
- EdgeSAM model for use with Autodistill.☆29Updated last year
- ☆46Updated last year
- ☆13Updated last year
- OmegaViT (ΩViT) is a cutting-edge vision transformer architecture that combines multi-query attention, rotary embeddings, state space mod…☆14Updated this week