apple / ml-vfm-kt
☆12Updated 4 months ago
Related projects ⓘ
Alternatives and complementary repositories for ml-vfm-kt
- ☆58Updated 8 months ago
- Repository for the paper: "TiC-CLIP: Continual Training of CLIP Models".☆94Updated 5 months ago
- ☆11Updated 7 months ago
- EdgeSAM model for use with Autodistill.☆25Updated 5 months ago
- Evaluate the performance of computer vision models and prompts for zero-shot models (Grounding DINO, CLIP, BLIP, DINOv2, ImageBind, model…☆34Updated last year
- Whispering Experts: Neural Interventions for Toxicity Mitigation in Language Models, ICML 2024☆15Updated 4 months ago
- A light-weight implementation of ICCV2023 paper "Reinforce Data, Multiply Impact: Improved Model Accuracy and Robustness with Dataset Rei…☆77Updated last year
- Tune-Mode ConvBN Blocks For Efficient Transfer Learning☆15Updated last year
- ☆12Updated 2 months ago
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆18Updated 3 months ago
- ☆38Updated 3 months ago
- ☆69Updated 10 months ago
- Visualize multi-model embedding spaces. The first goal is to quickly get a lay of the land of any embedding space. Then be able to scroll…☆26Updated 6 months ago
- [ECCV 2024] Official Release of SILC: Improving vision language pretraining with self-distillation☆36Updated last month
- CLIP-Finder enables semantic offline searches of images from gallery photos using natural language descriptions or the camera. Built on A…☆60Updated 3 months ago
- Implementation of the general framework for AMIE, from the paper "Towards Conversational Diagnostic AI", out of Google Deepmind☆53Updated 2 months ago
- Timm model explorer☆36Updated 7 months ago
- Official Pytorch Implementation of Self-emerging Token Labeling☆30Updated 7 months ago
- Research publication code for "Forward Compatible Training for Large-Scale Embedding Retrieval Systems", CVPR 2022, and "FastFill: Effici…☆53Updated last year
- [Under Review] Official PyTorch implementation code for realizing the technical part of Phantom of Latent representing equipped with enla…☆45Updated last month
- ☆30Updated this week
- Which model is the best at object detection? Which is best for small or large objects? We compare the results in a handy leaderboard.☆47Updated this week
- Notebooks for fine tuning pali gemma☆41Updated 3 months ago
- Official repository for the paper "End-to-End Visual Editing with a Generatively Pre-Trained Artist", which is accepted at ECCV 2022. Her…☆29Updated last year
- Recaption large (Web)Datasets with vllm and save the artifacts.☆30Updated last month
- Simplify Your Visual Data Ops. Find and visualize issues with your computer vision datasets such as duplicates, anomalies, data leakage, …☆67Updated last year
- This repository contains the official implementation for the ECCV'22 paper, "SPIN: An Empirical Evaluation on Sharing Parameters of Isotr…☆19Updated last year
- Load any clip model with a standardized interface☆21Updated 6 months ago
- Official code repository for paper: "ExPLoRA: Parameter-Efficient Extended Pre-training to Adapt Vision Transformers under Domain Shifts"☆24Updated last month
- The open source implementation of "NeVA: NeMo Vision and Language Assistant"☆18Updated last year