TIPS (ICLR'25): Text-Image Pretraining with Spatial Awareness
☆118Apr 9, 2025Updated 10 months ago
Alternatives and similar repositories for tips
Users that are interested in tips are comparing it to the libraries listed below
Sorting:
- ☆29Jul 25, 2025Updated 7 months ago
- ☆10Jul 6, 2022Updated 3 years ago
- Code for the paper "No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen Representations"☆12Oct 31, 2024Updated last year
- ☆16Jun 30, 2025Updated 8 months ago
- 🚀 Collection of libraries used with fms-hf-tuning to accelerate fine-tuning and training of large models.☆13Jan 30, 2026Updated last month
- [ICML 2022] Robust Deep Reinforcement Learning through Bootstrapped Opportunistic Curriculum☆11Jul 15, 2022Updated 3 years ago
- This is an implementation of the paper "Are We Done with Object-Centric Learning?"☆12Sep 11, 2025Updated 5 months ago
- [arXiv 2025] "CoT-UQ: Improving Response-wise Uncertainty Quantification in LLMs with Chain-of-Thought"☆15Apr 3, 2025Updated 11 months ago
- Datapunt open panorama project☆14May 6, 2024Updated last year
- Official repository for BMVC 2022 paper: Global Proxy-based Hard Mining for Visual Place Recognition☆18Mar 7, 2023Updated 2 years ago
- Official code repository for the paper A Large-scale AI-generated Image Inpainting Benchmark☆15Jan 13, 2026Updated last month
- Unlearning Isn't Invisible: Detecting Unlearning Traces in LLMs from Model Outputs☆24Jul 5, 2025Updated 8 months ago
- Implementation for DIY-SC paper.☆23Jul 14, 2025Updated 7 months ago
- A framework for computational imaging of panoramic annular lens.☆15Sep 25, 2023Updated 2 years ago
- XmodelLM☆38Nov 19, 2024Updated last year
- ☆17Apr 9, 2025Updated 10 months ago
- This repo contains VPR models that have been fine-tuned for indoor usage.☆16May 15, 2024Updated last year
- Vinci: A Real-time Embodied Smart Assistant based on Egocentric Vision-Language Model☆82Nov 27, 2025Updated 3 months ago
- Unofficial third-party implementation of FFD (fast feature detector) published in IEEE TIP 2020.☆15Feb 8, 2022Updated 4 years ago
- Segment Anything (SAM) at Home web app using Gradio☆14Aug 7, 2023Updated 2 years ago
- ☆17Mar 5, 2025Updated last year
- A token pruning method that accelerates ViTs for various tasks while maintaining high performance.☆27Jul 21, 2025Updated 7 months ago
- Octic Vision Transformers: Quicker ViTs Through Equivariance☆20Oct 16, 2025Updated 4 months ago
- Certifying Geometric Robustness of Neural Networks☆16Mar 24, 2023Updated 2 years ago
- [ICLR2026] The first W4A4KV4 quantized + 50% sparse LLMs!☆24Jan 26, 2026Updated last month
- ☆16Jan 1, 2023Updated 3 years ago
- Code for the paper "AMEGO: Active Memory from long EGOcentric videos" published at ECCV 2024☆43Dec 7, 2024Updated last year
- 3D Traffic Light & Sign Dataset☆24Mar 24, 2025Updated 11 months ago
- Official repository of the paper "JIST: Joint Image and Sequence Training for Sequential Visual Place Recognition"☆23Dec 15, 2023Updated 2 years ago
- A tool to assist in the interpretation of learned features in sparse autoencoders (in particular the four SAE's trained by Joseph Bloom o…☆19Oct 4, 2024Updated last year
- [ECCV 2024] Improving 2D Feature Representations by 3D-Aware Fine-Tuning☆311Dec 21, 2025Updated 2 months ago
- Official PyTorch implementation of TokenSet.☆128Mar 21, 2025Updated 11 months ago
- Wonderful Matrices to Build Small Language Models☆44Feb 15, 2025Updated last year
- A scikit-learn compatible Python package for GPU-accelerated computation of the signature kernel using CuPy.☆51Jun 6, 2025Updated 9 months ago
- 🔥 [NeurIPS 2025] Official implementation of "Generate, but Verify: Reducing Visual Hallucination in Vision-Language Models with Retrospe…☆54Jan 22, 2026Updated last month
- SpaceJAM: a Lightweight and Regularization-free Method for Fast Joint Alignment of Images (ECCV 2024)☆26May 20, 2025Updated 9 months ago
- Implementation for robust ViT and scaled attention☆21Apr 4, 2025Updated 11 months ago
- TCANet: A Temporal Convolutional Attention Network for Motor Imagery EEG Decoding☆47Jul 8, 2025Updated 7 months ago
- [NeurIPS VLM workshop 2024] In-Context Ensemble Learning from Pseudo Labels Improves Video-Language Models for Low-Level Workflow Underst…☆23Mar 16, 2025Updated 11 months ago