SkalskiP / sketchy-visionView external linksLinks
Each week I create sketches covering key Computer Vision concepts. If you want to learn more about CV stick around!
β150Mar 13, 2023Updated 2 years ago
Alternatives and similar repositories for sketchy-vision
Users that are interested in sketchy-vision are comparing it to the libraries listed below
Sorting:
- A component that allows you to annotate an image with points and boxes.β21Dec 12, 2023Updated 2 years ago
- ποΈ + π¬ + π§ = π€ Curated list of top foundation and multimodal models! [Paper + Code + Examples + Tutorials]β638Feb 29, 2024Updated last year
- β11Jan 29, 2023Updated 3 years ago
- Implementation of Hilbert beamforming for SNN-based audio source localisationβ16Oct 2, 2024Updated last year
- Transformer based Trigram Blocking implementation in Tensorflowβ11Feb 26, 2020Updated 5 years ago
- Skynetβ86Jun 20, 2023Updated 2 years ago
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectioβ¦β85May 29, 2024Updated last year
- Simple CogVLM client scriptβ14Dec 20, 2023Updated 2 years ago
- OpenMMLab Detection Toolbox and Benchmark for V3Detβ15Apr 3, 2024Updated last year
- Easy & Modular Computer Vision Detectors, Trackers & SAM - Run YOLOv9,v8,v7,v6,v5,R,X in under 10 lines of code.β614May 23, 2024Updated last year
- [ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"β19Mar 10, 2025Updated 11 months ago
- β19Nov 23, 2023Updated 2 years ago
- β23Nov 5, 2024Updated last year
- A fork of sqlite-utils with CLI etc removedβ17Jan 29, 2026Updated 2 weeks ago
- MSPaint for marimo and other Python notebooksβ25Oct 24, 2025Updated 3 months ago
- [ACL 2024 Findings & ICLR 2024 WS] An Evaluator VLM that is open-source, offers reproducible evaluation, and inexpensive to use. Specificβ¦β80Sep 13, 2024Updated last year
- This repository is a curated collection of the most exciting and influential CVPR 2024 papers. π₯ [Paper + Code + Demo]β742Jun 2, 2025Updated 8 months ago
- Localized Vision-Language Matching for Open-vocabulary Object Detectionβ22Aug 11, 2022Updated 3 years ago
- β25Dec 13, 2024Updated last year
- Optimizing bit-level Jaccard Index and Population Counts for large-scale quantized Vector Search via Harley-Seal CSA and Lookup Tablesβ21May 18, 2025Updated 8 months ago
- A guide to Ultralytics' mission, vision, values, and practices, providing key insights and resources for aligning with our goals.β61Feb 9, 2026Updated last week
- Football Match Analysis Using YOLO (You Only Look Once).β27Nov 19, 2024Updated last year
- Build your own (Level 3) Self Driving Vehicle capable of the following features, 1) Autonomously navigate through the custom bβ¦β14Feb 16, 2023Updated 3 years ago
- Extract Molecular SMILES embeddings from language models pre-trained with various objectives architectures.β18Nov 9, 2023Updated 2 years ago
- Images to inference with no labeling (use foundation models to train supervised models).β2,624May 14, 2025Updated 9 months ago
- Towards LLM Empowered Recommendation via Tool Learningβ22Aug 8, 2025Updated 6 months ago
- β20Mar 4, 2025Updated 11 months ago
- This project allows you to plug in a GitHub repository URL, generate vectors for a LLM and use ChatGPT models to interact. The main frameβ¦β19Jun 4, 2023Updated 2 years ago
- Unofficial implementation and experiments related to Set-of-Mark (SoM) ποΈβ88Oct 20, 2023Updated 2 years ago
- PLIKS: A Pseudo-Linear Inverse Kinematic Solver for 3D Human Body Estimationβ47Sep 28, 2023Updated 2 years ago
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 linesβ196May 6, 2024Updated last year
- Code Release for WACV 2024 , "SSP: Semi-signed prioritized neural fitting for surface reconstruction from unoriented point clouds"β26Jan 8, 2024Updated 2 years ago
- AI Multi-agent system for real-time, adaptive supply chain coordination and optimization leveraging responsive AI clusters.β35Mar 28, 2024Updated last year
- streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VLβ2,660Feb 9, 2026Updated last week
- A collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures lβ¦β9,163Feb 3, 2026Updated last week
- Spiking Neural Network for computing the DFT and the CFAR on automotive radarβ24May 12, 2021Updated 4 years ago
- Multibackend Graph Neural Networks in Keras 3β25Jan 31, 2024Updated 2 years ago
- Accurately locating each head's position in the crowd scenes is a crucial task in the field of crowd analysis. However, traditional densiβ¦β21Mar 16, 2024Updated last year
- YOLOv8 Segmentation with DeepSORT Object Tracking (ID + Trails)β283May 25, 2023Updated 2 years ago