OpenCSGs / Awesome-SLMs
survery of small language models
☆14Updated 8 months ago
Alternatives and similar repositories for Awesome-SLMs:
Users that are interested in Awesome-SLMs are comparing it to the libraries listed below
- Control LLM☆13Updated this week
- ☆20Updated 4 months ago
- Pytorch implementation of HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models☆28Updated last year
- Evaluation and dataset construction code for the CVPR 2025 paper "Vision-Language Models Do Not Understand Negation"☆19Updated 2 weeks ago
- ZO2 (Zeroth-Order Offloading): Full Parameter Fine-Tuning 175B LLMs with 18GB GPU Memory☆71Updated last week
- ☆15Updated 8 months ago
- Scaling Multi-modal Instruction Fine-tuning with Tens of Thousands Vision Task Types☆13Updated last month
- The official GitHub page for paper "NegativePrompt: Leveraging Psychology for Large Language Models Enhancement via Negative Emotional St…☆21Updated 10 months ago
- DeepPerception: Advancing R1-like Cognitive Visual Perception in MLLMs for Knowledge-Intensive Visual Grounding☆38Updated this week
- [EMNLP 2024] Official code for "Beyond Embeddings: The Promise of Visual Table in Multi-Modal Models"☆16Updated 5 months ago
- The repository for our paper: Neighboring Perturbations of Knowledge Editing on Large Language Models☆16Updated 10 months ago
- "Towards Improving Document Understanding: An Exploration on Text-Grounding via MLLMs" 2023☆14Updated 4 months ago
- Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"☆14Updated 3 weeks ago
- ☆13Updated last year
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆30Updated 10 months ago
- Official Repository of Are Your LLMs Capable of Stable Reasoning?☆22Updated last week
- The official repo for "VisualWebInstruct: Scaling up Multimodal Instruction Data through Web Search"☆20Updated last week
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Updated last year
- [CVPR 2024] DiffAgent: Fast and Accurate Text-to-Image API Selection with Large Language Model☆17Updated 11 months ago
- ☆16Updated 2 months ago
- ☆35Updated last month
- Official implementation of the paper: "A deeper look at depth pruning of LLMs"☆14Updated 8 months ago
- Parameter-Efficient Fine-Tuning for Foundation Models☆48Updated last month
- This is the code repo for our paper "Benchmarking Retrieval-Augmented Generation in Multi-Modal Contexts".☆28Updated 2 weeks ago
- Official implementation of the paper "MMInA: Benchmarking Multihop Multimodal Internet Agents"☆41Updated last month
- Code for paper: Unified Text-to-Image Generation and Retrieval☆14Updated 8 months ago
- We introduce new approach, Token Reduction using CLIP Metric (TRIM), aimed at improving the efficiency of MLLMs without sacrificing their…☆12Updated 3 months ago
- The released data for paper "Measuring and Improving Chain-of-Thought Reasoning in Vision-Language Models".☆32Updated last year
- This is a PyTorch implementation of the paperViP A Differentially Private Foundation Model for Computer Vision☆36Updated last year
- ☆36Updated 6 months ago