OpenCSGs / Awesome-SLMsLinks
survery of small language models
☆15Updated 10 months ago
Alternatives and similar repositories for Awesome-SLMs
Users that are interested in Awesome-SLMs are comparing it to the libraries listed below
Sorting:
- Control LLM☆14Updated 2 months ago
- ☆20Updated 7 months ago
- ☆16Updated 10 months ago
- ☆17Updated 5 months ago
- [NAACL 2025] Representing Rule-based Chatbots with Transformers☆21Updated 3 months ago
- KVTuner: Sensitivity-Aware Layer-wise Mixed Precision KV Cache Quantization for Efficient and Nearly Lossless LLM Inference☆11Updated 3 weeks ago
- DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails☆23Updated 3 months ago
- [NAACL 2025] A Closer Look into Mixture-of-Experts in Large Language Models☆52Updated 3 months ago
- The official GitHub page for paper "NegativePrompt: Leveraging Psychology for Large Language Models Enhancement via Negative Emotional St…☆22Updated last year
- ☆40Updated 3 weeks ago
- Unsupervised GRPO☆24Updated this week
- ☆49Updated 3 weeks ago
- ☆15Updated last month
- A Framework for Decoupling and Assessing the Capabilities of VLMs☆43Updated 11 months ago
- [ICLR 2024] This is the official PyTorch implementation of "QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Mod…☆27Updated last year
- X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains☆43Updated 3 weeks ago
- ☆36Updated 9 months ago
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Updated last year
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆37Updated last year
- A subset of YFCC100M. Tools, checking scripts and links of web drive to download datasets(uncompressed).☆19Updated 6 months ago
- ☆37Updated 2 years ago
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆32Updated last year
- The released data for paper "Measuring and Improving Chain-of-Thought Reasoning in Vision-Language Models".☆32Updated last year
- Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*☆20Updated last week
- [CVPR 2024] DiffAgent: Fast and Accurate Text-to-Image API Selection with Large Language Model☆17Updated last year
- Code for paper: "LASeR: Learning to Adaptively Select Reward Models with Multi-Arm Bandits"☆13Updated 8 months ago
- ☆64Updated last year
- The repository for our paper: Neighboring Perturbations of Knowledge Editing on Large Language Models☆16Updated last year
- Official implementation of the paper: "A deeper look at depth pruning of LLMs"☆15Updated 10 months ago
- This repository contains papers for a comprehensive survey on accelerated generation techniques in Large Language Models (LLMs).☆11Updated last year