zhuzilin / vllm-groupLinks
☆12Updated 7 months ago
Alternatives and similar repositories for vllm-group
Users that are interested in vllm-group are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2023] Repetition In Repetition Out: Towards Understanding Neural Text Degeneration from the Data Perspective☆31Updated last year
- Suri: Multi-constraint instruction following for long-form text generation (EMNLP’24)☆22Updated 6 months ago
- Code for preprint "Metadata Conditioning Accelerates Language Model Pre-training (MeCo)"☆39Updated last month
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆50Updated 5 months ago
- [ACL'24 Oral] Analysing The Impact of Sequence Composition on Language Model Pre-Training☆21Updated 9 months ago
- Calculate the probability of a paper being accepted by EMNLP2023 based on score distribution of ACL2023.☆14Updated last year
- Codebase for Instruction Following without Instruction Tuning☆34Updated 8 months ago
- Official repository for ACL 2025 paper "Model Extrapolation Expedites Alignment"☆73Updated 2 weeks ago
- LongHeads: Multi-Head Attention is Secretly a Long Context Processor☆29Updated last year
- The official implementation for Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink-Free☆40Updated 3 weeks ago
- ☆22Updated last year
- ☆35Updated last year
- The code and data for the paper JiuZhang3.0☆45Updated last year
- Code for "RepoST: Scalable Repository-Level Coding Environment Construction with Sandbox Testing"☆20Updated 2 months ago
- GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.☆62Updated 10 months ago
- Repo for ICML23 "Why do Nearest Neighbor Language Models Work?"☆57Updated 2 years ago
- Source code and data for The Magic of IF: Investigating Causal Reasoning Abilities in Large Language Models of Code (Findings of ACL 2023…☆29Updated 2 years ago
- Code and data for paper "Context-faithful Prompting for Large Language Models".☆40Updated 2 years ago
- Code for M4LE: A Multi-Ability Multi-Range Multi-Task Multi-Domain Long-Context Evaluation Benchmark for Large Language Models☆23Updated 10 months ago
- The paper list of multilingual pre-trained models (Continual Updated).☆22Updated 11 months ago
- Code for paper "Nearest Neighbor Knowledge Distillation for Neural Machine Translation" by Zhixian Yang, Renliang Sun, and Xiaojun Wan. T…☆30Updated 2 years ago
- Towards Systematic Measurement for Long Text Quality☆35Updated 9 months ago
- The official code of EMNLP 2022, "SCROLLS: Standardized CompaRison Over Long Language Sequences".☆69Updated last year
- ☆50Updated last year
- ☆24Updated 2 years ago
- Revisiting Mid-training in the Era of RL Scaling☆48Updated last month
- [EMNLP'23] Code for "Non-autoregressive Text Editing with Copy-aware Latent Alignments".☆20Updated last year
- Official code repo for paper "Great Memory, Shallow Reasoning: Limits of kNN-LMs"☆23Updated last month
- MUFFIN: Curating Multi-Faceted Instructions for Improving Instruction-Following☆16Updated 7 months ago
- The official repository for the paper "From Zero to Hero: Examining the Power of Symbolic Tasks in Instruction Tuning".☆65Updated 2 years ago