OpenCSGs / Awesome-SLMs
survery of small language models
☆14Updated 5 months ago
Alternatives and similar repositories for Awesome-SLMs:
Users that are interested in Awesome-SLMs are comparing it to the libraries listed below
- Official implementation of the paper "MMInA: Benchmarking Multihop Multimodal Internet Agents"☆40Updated 9 months ago
- Official implementation of the paper: "A deeper look at depth pruning of LLMs"☆12Updated 5 months ago
- ☆15Updated 5 months ago
- On The Planning Abilities of OpenAI's o1 Models: Feasibility, Optimality, and Generalizability☆36Updated 2 months ago
- Representing Rule-based Chatbots with Transformers☆19Updated 6 months ago
- Pytorch implementation of HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models☆28Updated 10 months ago
- ☆27Updated last month
- ☆19Updated 2 months ago
- The official GitHub page for paper "NegativePrompt: Leveraging Psychology for Large Language Models Enhancement via Negative Emotional St…☆19Updated 8 months ago
- ☆36Updated 4 months ago
- [CVPR 2024] DiffAgent: Fast and Accurate Text-to-Image API Selection with Large Language Model☆17Updated 9 months ago
- An End-to-End Model with Adaptive Filtering for Retrieval-Augmented Generation☆10Updated 2 months ago
- ☆29Updated last week
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆29Updated 7 months ago
- [ACL 2023] PuMer: Pruning and Merging Tokens for Efficient Vision Language Models☆28Updated 3 months ago
- We introduce new approach, Token Reduction using CLIP Metric (TRIM), aimed at improving the efficiency of MLLMs without sacrificing their…☆11Updated last month
- [ICLR 2024] This is the official PyTorch implementation of "QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Mod…☆22Updated 10 months ago
- Code for our Paper "All in an Aggregated Image for In-Image Learning"☆29Updated 9 months ago
- ☆13Updated 2 months ago
- Self Reproduction Code of Paper "Reducing Transformer Key-Value Cache Size with Cross-Layer Attention (MIT CSAIL)☆12Updated 7 months ago
- ☆13Updated last year
- ☆64Updated 9 months ago
- [EMNLP 2024] Official code for "Beyond Embeddings: The Promise of Visual Table in Multi-Modal Models"☆14Updated 3 months ago
- BESA is a differentiable weight pruning technique for large language models.☆14Updated 10 months ago
- The open source implementation of "AnyMAL: An Efficient and Scalable Any-Modality Augmented Language Model"☆22Updated this week
- Efficient Mixture of Experts for LLM Paper List☆26Updated last month
- This repo contains code for "VISTA: Enhancing Long-Duration and High-Resolution Video Understanding by VIdeo SpatioTemporal Augmentation"☆10Updated 2 weeks ago
- Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*