rorro6787 / img-desc-visually-impaired
Image description System for Impaired people
☆13Updated 4 months ago
Alternatives and similar repositories for img-desc-visually-impaired:
Users that are interested in img-desc-visually-impaired are comparing it to the libraries listed below
- Computer Vision Helping Library☆17Updated 2 months ago
- EdgeSAM model for use with Autodistill.☆26Updated 7 months ago
- Eye exploration☆23Updated 2 months ago
- ☆16Updated 8 months ago
- LoRA fine-tuned Stable Diffusion Deployment☆31Updated last year
- Vehicle speed estimation using YOLOv8☆30Updated 9 months ago
- ☆24Updated last year
- ☆13Updated last year
- The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained mode…☆10Updated 5 months ago
- ☆29Updated last year
- Repository for running LLMs efficiently on Mac silicon (M1, M2, M3). Features Jupyter notebook for Meta-Llama-3 setup using MLX framework…☆10Updated 8 months ago
- This project breathes life into video characters by using AI to describe their personality and then chat with you as them.☆45Updated 10 months ago
- ☆59Updated last year
- ☆13Updated 10 months ago
- Accurately locating each head's position in the crowd scenes is a crucial task in the field of crowd analysis. However, traditional densi…☆21Updated 10 months ago
- Real-time, YOLO-like object detection using the Florence-2-base-ft model with a user-friendly GUI.☆16Updated 2 weeks ago
- This project is an implementation of fine-tuning an SDXL model using DreamBooth and LoRA on custom data of interior rooms to generate des…☆10Updated 11 months ago
- BH hackathon☆14Updated 9 months ago
- ☆10Updated 7 months ago
- ☆46Updated 11 months ago
- ☆14Updated last year
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆79Updated 7 months ago
- Python scripts performing optical flow estimation using the NeuFlowV2 model in ONNX.☆40Updated 4 months ago
- ☆40Updated last year
- Use Grounding DINO, Segment Anything, and GPT-4V to label images with segmentation masks for use in training smaller, fine-tuned models.☆64Updated last year
- A list of language models with permissive licenses such as MIT or Apache 2.0☆24Updated 2 months ago
- Join 15k builders to the Real-World ML Newsletter ⬇️⬇️⬇️☆26Updated 8 months ago
- ☆16Updated 11 months ago
- ☆10Updated last year
- Train a production grade GPT in less than 400 lines of code. Better than Karpathy's verison and GIGAGPT☆16Updated last week