OpenCSGs / Awesome-SLMs
survery of small language models
☆14Updated 6 months ago
Alternatives and similar repositories for Awesome-SLMs:
Users that are interested in Awesome-SLMs are comparing it to the libraries listed below
- Code for our Paper "All in an Aggregated Image for In-Image Learning"☆29Updated 10 months ago
- We introduce new approach, Token Reduction using CLIP Metric (TRIM), aimed at improving the efficiency of MLLMs without sacrificing their…☆12Updated 2 months ago
- [EMNLP 2024] Official code for "Beyond Embeddings: The Promise of Visual Table in Multi-Modal Models"☆14Updated 4 months ago
- An End-to-End Model with Adaptive Filtering for Retrieval-Augmented Generation☆10Updated 3 months ago
- [ACL 2024] Multi-modal preference alignment remedies regression of visual instruction tuning on language model☆33Updated 3 months ago
- ☆15Updated 7 months ago
- On The Planning Abilities of OpenAI's o1 Models: Feasibility, Optimality, and Generalizability☆37Updated last month
- ☆32Updated last month
- The repository for our paper: Neighboring Perturbations of Knowledge Editing on Large Language Models☆16Updated 9 months ago
- This project aims to collect and collate various datasets for multimodal large model training, including but not limited to pre-training …☆17Updated 4 months ago
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆28Updated 8 months ago
- Pytorch implementation of HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models☆28Updated 11 months ago
- [CVPR 2024] DiffAgent: Fast and Accurate Text-to-Image API Selection with Large Language Model☆17Updated 10 months ago
- The released data for paper "Measuring and Improving Chain-of-Thought Reasoning in Vision-Language Models".☆32Updated last year
- ☆37Updated last year
- Code for paper: Unified Text-to-Image Generation and Retrieval☆13Updated 7 months ago
- [ICCV23] Official implementation of eP-ALM: Efficient Perceptual Augmentation of Language Models.☆27Updated last year
- This is a PyTorch implementation of the paperViP A Differentially Private Foundation Model for Computer Vision☆36Updated last year
- An Enhanced CLIP Framework for Learning with Synthetic Captions☆26Updated 2 months ago
- The official repo for “TextCoT: Zoom In for Enhanced Multimodal Text-Rich Image Understanding”.☆38Updated 4 months ago
- This repo contains code for "VISTA: Enhancing Long-Duration and High-Resolution Video Understanding by VIdeo SpatioTemporal Augmentation"☆11Updated last month
- Official repository for ICML 2024 paper "MoRe Fine-Tuning with 10x Fewer Parameters"☆17Updated last week
- BESA is a differentiable weight pruning technique for large language models.☆14Updated 11 months ago
- ☆47Updated last year
- Code for T-MARS data filtering☆35Updated last year