visresearch / LLaVA-STFLinks
The official implementation of "Learning Compact Vision Tokens for Efficient Large Multimodal Models"
☆29Updated last month
Alternatives and similar repositories for LLaVA-STF
Users that are interested in LLaVA-STF are comparing it to the libraries listed below
Sorting:
- ☆19Updated 2 months ago
- Unlearning Isn't Invisible: Detecting Unlearning Traces in LLMs from Model Outputs☆22Updated last week
- How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training☆36Updated 3 months ago
- This is the official code for the paper "Virus: Harmful Fine-tuning Attack for Large Language Models Bypassing Guardrail Moderation"☆50Updated 5 months ago
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆71Updated 4 months ago
- Wonderful Matrices to Build Small Language Models☆44Updated 5 months ago
- ☆101Updated last month
- Code associated with the EMNLP 2024 Main paper: "Image, tell me your story!" Predicting the original meta-context of visual misinformatio…☆41Updated 2 months ago
- [ECCV'24 Workshops Oral] DALDA: Data Augmentation Leveraging Diffusion Model and LLM with Adaptive Guidance Scaling☆31Updated 8 months ago
- The repository for papaer "Distance between Relevant Information Pieces Causes Bias in Long-Context LLMs"☆12Updated 7 months ago
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆32Updated 2 months ago
- OmegaViT (ΩViT) is a cutting-edge vision transformer architecture that combines multi-query attention, rotary embeddings, state space mod…☆14Updated 2 weeks ago
- ☆126Updated 2 months ago
- ☆16Updated 3 months ago
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya☆117Updated last week
- ☆78Updated 8 months ago
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆64Updated 8 months ago
- Pivotal Token Search☆109Updated this week
- [arXiv 2025] "CoT-UQ: Improving Response-wise Uncertainty Quantification in LLMs with Chain-of-Thought"☆13Updated 3 months ago
- The first dense retrieval model that can be prompted like an LM☆81Updated 2 months ago
- [TMLR'24] This repository includes the official implementation our paper "FedConv: Enhancing Convolutional Neural Networks for Handling D…☆25Updated last year
- This repository contains the resource introduced in the paper: "Truth or Mirage? Towards End-to-End Factuality Evaluation with LLM-Oasis"…☆22Updated 7 months ago
- Clue inspired puzzles for testing LLM deduction abilities☆38Updated 3 months ago
- The official implementation of the paper "Chain-of-Tools: Utilizing Massive Unseen Tools in the CoT Reasoning of Frozen Language Models".☆79Updated 3 months ago
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆100Updated 6 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆55Updated 5 months ago
- Initialization using Update Approximation is a Silver Bullet for Extremely Efficient Low-Rank Fine-Tuning☆49Updated last month
- [NeurIPS VLM workshop 2024] In-Context Ensemble Learning from Pseudo Labels Improves Video-Language Models for Low-Level Workflow Underst…☆23Updated 4 months ago
- XmodelLM☆39Updated 7 months ago
- Multi-vision Sensor Perception and Reasoning (MS-PR) benchmark, assessing VLMs on their capacity for sensor-specific reasoning.☆16Updated 4 months ago