andrewjouffray / salient-extract
Salient feature extractor based on yoloV8
☆72Updated last year
Related projects ⓘ
Alternatives and complementary repositories for salient-extract
- Semi-sythetic data generator using the "copy - paste" method.☆11Updated last year
- Each week I create sketches covering key Computer Vision concepts. If you want to learn more about CV stick around!☆148Updated last year
- A tool for converting computer vision label formats.☆54Updated last year
- Create topological graph for image segments.☆19Updated last month
- Use Grounding DINO, Segment Anything, and GPT-4V to label images with segmentation masks for use in training smaller, fine-tuned models.☆65Updated last year
- Unofficial implementation and experiments related to Set-of-Mark (SoM) 👁️☆77Updated last year
- Use Florence 2 to auto-label data for use in training fine-tuned object detection models.☆59Updated 3 months ago
- Maybe the new state of the art vision model? we'll see 🤷♂️☆154Updated 10 months ago
- Scripts to prep PC for development use after OS installs☆37Updated last week
- ☆60Updated last year
- an optimized, production-ready implementation of active speaker detection☆54Updated 5 months ago
- A colab notebook that combines Stable Diffusion + DALL-E Mini (Craiyon)☆123Updated 2 years ago
- Compare Savant and PyTorch performance☆13Updated 9 months ago
- LLaVA server (llama.cpp).☆177Updated last year
- High resolution image classifier. An expansion of the ResNet50 architecture to allow for high resolution inputs (448, 896, 1792 sq.px.)☆10Updated last year
- YOLOExplorer : Iterate on your YOLO / CV datasets using SQL, Vector semantic search, and more within seconds☆124Updated 2 weeks ago
- Integrate an LLM copilot within your Keras model development workflow☆28Updated last year
- EdgeSAM model for use with Autodistill.☆25Updated 5 months ago
- Python scripts performing optical flow estimation using the NeuFlowV2 model in ONNX.☆32Updated 2 months ago
- Evaluate the performance of computer vision models and prompts for zero-shot models (Grounding DINO, CLIP, BLIP, DINOv2, ImageBind, model…☆34Updated last year
- The GeoV model is a large langauge model designed by Georges Harik and uses Rotary Positional Embeddings with Relative distances (RoPER).…☆122Updated last year
- Production-ready audio and video transcription app that can run on your laptop or in the cloud.☆72Updated 11 months ago
- ☆48Updated last year
- 🐤The next evolution of evolution.☆37Updated last month
- Fast Real-time Object Detection with High-Res Output https://x.com/_akhaliq/status/1840213012818329826☆52Updated last month
- This repository contains the TensorFlow implementation of the paper "AN IMAGE IS WORTH 16X16 WORDS: TRANSFORMERS FOR IMAGE RECOGNITION AT…☆29Updated last year
- ☆38Updated last year
- Implementation of MAXIM in TensorFlow.☆133Updated last year
- Our idea is to combine the power of computer vision model and LLMs. We use YOLO, CLIP and DINOv2 to extract high-level features from imag…☆100Updated last year
- Notebooks using the Neural Magic libraries 📓☆41Updated 3 months ago