FocoosAI / papersLinks
List of papers wrote by Focoos AI research team!
β12Updated 4 months ago
Alternatives and similar repositories for papers
Users that are interested in papers are comparing it to the libraries listed below
Sorting:
- π Lightning-fast computer vision models. Fine-tune SOTA models with just a few lines of code. Ready for cloud βοΈ and edge π± deployment.β¦β343Updated this week
- [CVPR 2025 Highlight] Official code and models for Encoder-only Mask Transformer (EoMT).β431Updated this week
- [CVPR 2025 Highlight] Official repository for the paper: "SAMWISE: Infusing Wisdom in SAM2 for Text-Driven Video Segmentation"β330Updated 3 weeks ago
- β11Updated 4 years ago
- β31Updated last year
- Official code of Franca: Nested Matryoshka Clustering for Scalable Visual Representation Learningβ247Updated 3 weeks ago
- β20Updated 6 months ago
- Testing adaptation of the DINOv2/3 encoders for vision tasks with Low-Rank Adaptation (LoRA)β280Updated 2 weeks ago
- β19Updated 3 years ago
- Test-Time Adaptation of Vision-Language Models for Open-Vocabulary Semantic Segmentationβ18Updated 3 weeks ago
- Open source AI/ML capabilities for the FiftyOne ecosystemβ147Updated last month
- [CVPR 2024] Official implementation of "VRP-SAM: SAM with Visual Reference Prompt"β158Updated last year
- [CVPR2024] Official Pytorch Implementation of SED: A Simple Encoder-Decoder for Open-Vocabulary Semantic Segmentation.β180Updated last year
- [CVPR 2024] PEM: Prototype-based Efficient MaskFormer for Image Segmentationβ118Updated 7 months ago
- [CVPR 2025 - Highlight] Official implementation of the paper "Realistic Test-Time Adaptation of Vision-Language Models" (StatA).β50Updated 5 months ago
- Official implementation of "Align and Distill: Unifying and Improving Domain Adaptive Object Detection" (TMLR 2025)β71Updated 4 months ago
- β14Updated 2 years ago
- An easy way to apply LoRA to CLIP. Implementation of the paper "Low-Rank Few-Shot Adaptation of Vision-Language Models" (CLIP-LoRA) [CVPRβ¦β255Updated 4 months ago
- Source code for Visual Prompting for Generalized Few-shot Segmentation: A Multi-scale Approach (CVPR 2024)β25Updated 10 months ago
- Code for BiseNetV1β14Updated 3 years ago
- [NeurIPS 2024 - Spotlight] Transduction for Vision-Language Models (TransCLIP): code for the paper "Boosting Vision-Language Models with β¦β51Updated 6 months ago
- Official Implementation of "CAT-Segπ±: Cost Aggregation for Open-Vocabulary Semantic Segmentation"β339Updated last year
- [WACV 2025] Official code for our paper "Enhancing Novel Object Detection via Cooperative Foundational Models"β82Updated 7 months ago
- Offical Code for TBSNet(AAAI 2024)β13Updated last year
- [CVPR 2025 Highlight] SoMA: Singular Value Decomposed Minor Components Adaptation for Domain Generalizable Representation Learningβ45Updated 3 months ago
- Official implementation of https://arxiv.org/abs/2106.03496β15Updated 3 years ago
- β66Updated last year
- A curated publication list on open vocabulary semantic segmentation and related area (e.g. zero-shot semantic segmentation) resources..β737Updated last week
- β11Updated 11 months ago
- [CVPR 2025] This repository is the official implementation of "ProKeR: A Kernel Perspective on Few-Shot Adaptation of Large Vision-Languaβ¦β18Updated 6 months ago