☆80Oct 17, 2024Updated last year
Alternatives and similar repositories for Dragonfly
Users that are interested in Dragonfly are comparing it to the libraries listed below
Sorting:
- We introduce new approach, Token Reduction using CLIP Metric (TRIM), aimed at improving the efficiency of MLLMs without sacrificing their…☆20Jan 11, 2026Updated last month
- Official code repo of PIN: Positional Insert Unlocks Object Localisation Abilities in VLMs☆26Jan 14, 2025Updated last year
- 🚀 Collection of libraries used with fms-hf-tuning to accelerate fine-tuning and training of large models.☆13Jan 30, 2026Updated last month
- [MICCAI 2024] Official code for the paper "MedContext: Learning Contextual Cues for Efficient Volumetric Medical Segmentation"☆14Nov 1, 2024Updated last year
- A Novel Semantic Segmentation Network using Enhanced Boundaries in Cluttered Scenes (WACV 2025)☆11Aug 11, 2025Updated 6 months ago
- [MICCAI 2025] Hierarchical Self-Supervised Adversarial Training for Robust Vision Models in Histopathology☆12Jun 17, 2025Updated 8 months ago
- A new multi-task learning framework using Vision Transformers☆11Jun 19, 2024Updated last year
- [ICML-2025] We introduce Lie group Relative position Encodings (LieRE) that goes beyond RoPE in supporting n-dimensional inputs.☆14Aug 8, 2025Updated 6 months ago
- Official code for Paper "Mantis: Multi-Image Instruction Tuning" [TMLR 2024 Best Paper]☆239Jan 3, 2026Updated last month
- a family of highly capabale yet efficient large multimodal models☆192Aug 23, 2024Updated last year
- ☆13Apr 30, 2024Updated last year
- THOUGHTSCULPT, a general reasoning and search method for complex tasks☆13Dec 13, 2024Updated last year
- Repository to reproduce the results of the paper "Holomorphic Equilibrium Propagation Computes Exact Gradients Through Finite Size Oscill…☆11Oct 20, 2024Updated last year
- [ICPRAI 2024] DocumentCLIP: Linking Figures and Main Body Text in Reflowed Documents☆16Apr 4, 2024Updated last year
- A full-featured, hackable Next.js AI chatbot built by Vercel but running solely on a VPS, no outside APIs except for LLMs☆12Apr 16, 2024Updated last year
- ☆54Jan 17, 2025Updated last year
- Latent Large Language Models☆19Aug 24, 2024Updated last year
- Learnable Weight Initialization for Volumetric Medical Image Segmentation [Elsevier AIM2024]☆22Oct 27, 2024Updated last year
- ORES: Open-vocabulary Responsible Visual Synthesis☆14Dec 12, 2023Updated 2 years ago
- Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models☆13Dec 23, 2024Updated last year
- The SAIL blog☆13Updated this week
- Implementation of the paper LIMITR: Leveraging Local Information for Medical Image-Text Representation☆17Feb 8, 2024Updated 2 years ago
- Official Implementation for "MyVLM: Personalizing VLMs for User-Specific Queries" (ECCV 2024)☆186Jul 5, 2024Updated last year
- [SCIS 2024] The official implementation of the paper "MMInstruct: A High-Quality Multi-Modal Instruction Tuning Dataset with Extensive Di…☆62Nov 7, 2024Updated last year
- [NeurIPS 2024] Calibrated Self-Rewarding Vision Language Models☆86Oct 26, 2025Updated 4 months ago
- ☆33Nov 4, 2024Updated last year
- [ICML 2025] Logits are All We Need to Adapt Closed Models☆21May 2, 2025Updated 10 months ago
- Improving Few-Shot Learning with Auxiliary Self-Supervised Pretext Tasks☆13Feb 24, 2021Updated 5 years ago
- Keyword search Arxiv and summarize papers with OpenAi's chatGPT.☆15May 12, 2023Updated 2 years ago
- [TACL] Do Vision and Language Models Share Concepts? A Vector Space Alignment Study☆16Nov 22, 2024Updated last year
- Official repository for "Boosting Adversarial Transferability using Dynamic Cues " (ICLR 2023)☆20Aug 24, 2023Updated 2 years ago
- Official Pytorch implementation of 'Facing the Elephant in the Room: Visual Prompt Tuning or Full Finetuning'? (ICLR2024)☆13Mar 8, 2024Updated last year
- [BMVC 2024] On Evaluating Adversarial Robustness of Volumetric Medical Segmentation Models☆15Nov 1, 2024Updated last year
- never forget anything again! combine AI and intelligent tooling for a local knowledge base to track catalogue, annotate, and plan for you…☆37May 14, 2024Updated last year
- Web-grounded natural language instructions☆18Nov 25, 2024Updated last year
- OpenSeg-R: Improving Open-Vocabulary Segmentation via Step-by-Step Visual Reasoning☆27May 24, 2025Updated 9 months ago
- [ECCVW 2024 -- ORAL] Official repository of paper titled "Makeup-Guided Facial Privacy Protection via Untrained Neural Network Priors".☆12Oct 11, 2024Updated last year
- ☆15Jul 24, 2022Updated 3 years ago
- ViLMA: A Zero-Shot Benchmark for Linguistic and Temporal Grounding in Video-Language Models (ICLR 2024, Official Implementation)☆16Jan 18, 2024Updated 2 years ago