FocoosAI / focoosLinks
π Lightning-fast computer vision models. Fine-tune SOTA models with just a few lines of code. Ready for cloud βοΈ and edge π± deployment.
β331Updated last week
Alternatives and similar repositories for focoos
Users that are interested in focoos are comparing it to the libraries listed below
Sorting:
- Open source AI/ML capabilities for the FiftyOne ecosystemβ141Updated last month
- Improving Semantic Correspondences with Viewpoint-Guided Spherical Maps (CVPR 2024)β20Updated 6 months ago
- This repo contains the official implementation of ICLR 2024 paper "Is ImageNet worth 1 video? Learning strong image encoders from 1 long β¦β89Updated last year
- [CVPR 2024] Probing the 3D Awareness of Visual Foundation Modelsβ314Updated 11 months ago
- Code for the paper "AMEGO: Active Memory from long EGOcentric videos" published at ECCV 2024β38Updated 6 months ago
- β16Updated 2 months ago
- This is a repository that implements the Dense NN Retrieval Evaluation used for evaluating the In-Context Learning Capabilities of Visionβ¦β20Updated last month
- [CVPR 2025 Highlight] Official code and models for Encoder-only Mask Transformer (EoMT).β185Updated this week
- [WACV 2024] Learning the What and How of Annotation in Video Object Segmentationβ26Updated last year
- [ECCV 2024] Improving 2D Feature Representations by 3D-Aware Fine-Tuningβ282Updated 4 months ago
- [CVPR25] Official repository for the paper: "SAMWISE: Infusing Wisdom in SAM2 for Text-Driven Video Segmentation"β269Updated 3 weeks ago
- Official implementation of 'CLIP-DINOiser: Teaching CLIP a few DINO tricks' paper.β249Updated 8 months ago
- This is the official code release for our work, Denoising Vision Transformers.β368Updated 7 months ago
- A conference poster format with structure, content, creation, and presentation recommendations.β67Updated 4 months ago
- Official Implementation of paper "A Tale of Two Features: Stable Diffusion Complements DINO for Zero-Shot Semantic Correspondence"β320Updated last year
- Official implementation of "A Backpack Full of Skills: Egocentric Video Understanding with Diverse Task Perspectives", accepted at CVPR 2β¦β21Updated last year
- Code for "How far can we go with ImageNet for Text-to-Image generation?" paperβ88Updated last month
- Code for the paper Open-Vocabulary Attention Maps with Token Optimization for Semantic Segmentation in Diffusion Models @ CVPR 2024β66Updated last year
- (ICCV 2023) Generative Multiplane Neural Radiance for 3D Aware Image Generation.β19Updated last year
- [ECCV 2024] PyTorch implementation of CropMAE, introduced in "Efficient Image Pre-Training with Siamese Cropped Masked Autoencoders"β59Updated 4 months ago
- β83Updated 2 months ago
- Official PyTorch implementation of the paper "Chapter-Llama: Efficient Chaptering in Hour-Long Videos with LLMs"β66Updated 3 weeks ago
- π Explore Egocentric Vision: research, data, challenges, real-world apps. Stay updated & contribute to our dynamic repository! Work-in-pβ¦β111Updated 7 months ago
- Official implementation of the CVPR'24 paper [Adaptive Slot Attention: Object Discovery with Dynamic Slot Number]β50Updated 5 months ago
- Dino V2 for Classification, PCA Visualization, Instance Retrival: https://arxiv.org/abs/2304.07193β190Updated last year
- Time Does Tell: Self-Supervised Time-Tuning of Dense Image Representations ICCV23β27Updated 5 months ago
- Code for "Donβt drop your samples! Coherence-aware training benefits Conditional diffusion" CVPR 2024 Highlightβ53Updated 3 months ago
- "Near, far: Patch-ordering enhances vision foundation models' scene understanding": A New SSL Post-Training Approach for Improving DINOv2β¦β24Updated 2 months ago
- Official Code for "LoftUp: Learning a Coordinate-Based Feature Upsampler for Vision Foundation Models"β156Updated 2 weeks ago
- QT-DOG: QUANTIZATION-AWARE TRAINING FOR DOMAIN GENERALIZATIONβ19Updated 8 months ago