visresearch / LLaVA-STFLinks
The official implementation of "Learning Compact Vision Tokens for Efficient Large Multimodal Models"
☆29Updated 6 months ago
Alternatives and similar repositories for LLaVA-STF
Users that are interested in LLaVA-STF are comparing it to the libraries listed below
Sorting:
- Unlearning Isn't Invisible: Detecting Unlearning Traces in LLMs from Model Outputs☆24Updated 6 months ago
- ☆19Updated 7 months ago
- This is the official code for the paper "Virus: Harmful Fine-tuning Attack for Large Language Models Bypassing Guardrail Moderation"☆53Updated 11 months ago
- [ACL 2025] How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training☆45Updated 5 months ago
- The repository for papaer "Distance between Relevant Information Pieces Causes Bias in Long-Context LLMs"☆14Updated last year
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆34Updated 8 months ago
- ☆86Updated last year
- [arXiv 2025] SAGE: Training Smart Any-Horizon Agents for Long Video Reasoning with Reinforcement Learning☆50Updated 3 weeks ago
- Code for Bolmo: Byteifying the Next Generation of Language Models☆111Updated 2 weeks ago
- Pivotal Token Search☆142Updated 2 weeks ago
- [TMLR'24] This repository includes the official implementation our paper "FedConv: Enhancing Convolutional Neural Networks for Handling D…☆25Updated last year
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆85Updated 9 months ago
- The official implementation of Cross-Task Experience Sharing (COPS)☆29Updated last year
- [ECCV'24 Workshops Oral] DALDA: Data Augmentation Leveraging Diffusion Model and LLM with Adaptive Guidance Scaling☆30Updated last year
- [NeurIPS VLM workshop 2024] In-Context Ensemble Learning from Pseudo Labels Improves Video-Language Models for Low-Level Workflow Underst…☆23Updated 9 months ago
- A tool to assist in the interpretation of learned features in sparse autoencoders (in particular the four SAE's trained by Joseph Bloom o…☆19Updated last year
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya☆124Updated 5 months ago
- [NeurIPS XAIA & Springer] Code and notebooks to paper "A Fresh Look at Sanity Checks for Saliency Maps"☆25Updated last year
- ☆93Updated 2 months ago
- Data recipes and robust infrastructure for training AI agents☆75Updated this week
- THOUGHTSCULPT, a general reasoning and search method for complex tasks☆13Updated last year
- Improving Your Model Ranking on Chatbot Arena by Vote Rigging (ICML 2025)☆26Updated 10 months ago
- The code repository of the paper: Competition and Attraction Improve Model Fusion☆169Updated 4 months ago
- Mixture of Cognitive Reasoners: Modular Reasoning with Brain-Like Specialization☆37Updated last month
- The official implementation of Preference Data Reward-Augmentation.☆18Updated 8 months ago
- ☆19Updated 10 months ago
- OmegaViT (ΩViT) is a cutting-edge vision transformer architecture that combines multi-query attention, rotary embeddings, state space mod…☆14Updated 2 weeks ago
- Initialization using Update Approximation is a Silver Bullet for Extremely Efficient Low-Rank Fine-Tuning☆52Updated 2 months ago
- ☆144Updated 8 months ago
- Marketplace ML experiment - training without backprop☆27Updated 3 months ago