rorro6787 / img-desc-visually-impaired
Image description System for Impaired people
☆13Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for img-desc-visually-impaired
- The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained mode…☆10Updated 3 months ago
- Eye exploration☆22Updated this week
- ☆41Updated 2 months ago
- A Gradio web UI for Depth-Pro, Sharp Monocular Metric Depth Estimation☆45Updated last month
- Train a production grade GPT in less than 400 lines of code. Better than Karpathy's verison and GIGAGPT☆15Updated last week
- LoRA fine-tuned Stable Diffusion Deployment☆31Updated last year
- EdgeSAM model for use with Autodistill.☆25Updated 5 months ago
- Vehicle speed estimation using YOLOv8☆30Updated 7 months ago
- Computer Vision Helping Library☆12Updated 2 weeks ago
- Repository for running LLMs efficiently on Mac silicon (M1, M2, M3). Features Jupyter notebook for Meta-Llama-3 setup using MLX framework…☆9Updated 6 months ago
- ☆15Updated 6 months ago
- Create topological graph for image segments.☆18Updated last month
- ☆18Updated last year
- Accurately locating each head's position in the crowd scenes is a crucial task in the field of crowd analysis. However, traditional densi…☆20Updated 8 months ago
- Discover advanced AI techniques in my repository combining Multi-Hop Chain of Thought (CoT) and Retrieval-Augmented Generation (RAG) usin…☆10Updated 3 months ago
- The open source implementation of "NeVA: NeMo Vision and Language Assistant"☆18Updated last year
- Jupyter Notebooks with GPT Examples☆13Updated 4 months ago
- Notebooks using the Neural Magic libraries 📓☆41Updated 3 months ago
- This repository includes the code to download the curated HuggingFace papers into a single markdown formatted file☆14Updated 3 months ago
- ☆24Updated 11 months ago
- Transform unstructured documents into actionable, structured data with enterprise-grade precision and reliability, ready for large-scale …☆11Updated last week
- The open source implementation of the base model behind GPT-4 from OPENAI [Language + Multi-Modal]☆11Updated last year
- Which model is the best at object detection? Which is best for small or large objects? We compare the results in a handy leaderboard.☆47Updated this week
- Professional Wargaming LLM Toolbox☆10Updated last month
- Tools for merging pretrained large language models.☆19Updated 5 months ago
- Testing the different LLM and RAG Tests while I learn along the way☆17Updated 2 months ago
- BH hackathon☆14Updated 7 months ago
- Simple CogVLM client script☆14Updated 11 months ago
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆77Updated 5 months ago
- ☆12Updated 2 months ago