xlang-ai / batch-promptingLinks
[EMNLP 2023 Industry Track] A simple prompting approach that enables the LLMs to run inference in batches.
☆74Updated last year
Alternatives and similar repositories for batch-prompting
Users that are interested in batch-prompting are comparing it to the libraries listed below
Sorting:
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆79Updated last year
- Code for ICML 25 paper "Metadata Conditioning Accelerates Language Model Pre-training (MeCo)"☆41Updated 2 weeks ago
- Astraios: Parameter-Efficient Instruction Tuning Code Language Models☆58Updated last year
- Retrieval as Attention☆83Updated 2 years ago
- Code for paper 'Data-Efficient FineTuning'☆29Updated 2 years ago
- A dataset of LLM-generated chain-of-thought steps annotated with mistake location.☆81Updated 11 months ago
- Implementation of the paper: "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" from Google in pyTO…☆55Updated 2 weeks ago
- Scalable Meta-Evaluation of LLMs as Evaluators☆42Updated last year
- Long Context Extension and Generalization in LLMs☆57Updated 9 months ago
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆50Updated last month
- ☆35Updated last year
- Codes for our paper "Speculative Decoding: Exploiting Speculative Execution for Accelerating Seq2seq Generation" (EMNLP 2023 Findings)☆42Updated last year
- Transformers at any scale☆41Updated last year
- Codebase for Instruction Following without Instruction Tuning☆35Updated 9 months ago
- ☆64Updated last year
- the instructions and demonstrations for building a formal logical reasoning capable GLM☆53Updated 10 months ago
- NAACL '24 (Best Demo Paper RunnerUp) / MlSys @ NeurIPS '23 - RedCoast: A Lightweight Tool to Automate Distributed Training and Inference☆66Updated 7 months ago
- [AAAI 2024] Investigating the Effectiveness of Task-Agnostic Prefix Prompt for Instruction Following☆79Updated 10 months ago
- Official implementation of ACL 2025 Findings paper "Autonomous Data Selection with Zero-shot Generative Classifiers for Mathematical Text…☆83Updated 2 months ago
- ☆51Updated last year
- [EACL 2023] CoTEVer: Chain of Thought Prompting Annotation Toolkit for Explanation Verification☆41Updated 2 years ago
- Code for the preprint "Cache Me If You Can: How Many KVs Do You Need for Effective Long-Context LMs?"☆38Updated this week
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆43Updated last year
- This is the official repo of "QuickLLaMA: Query-aware Inference Acceleration for Large Language Models"☆53Updated last year
- LongHeads: Multi-Head Attention is Secretly a Long Context Processor☆29Updated last year
- The source code of our work "Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models" [AISTATS …☆59Updated 9 months ago
- Syntax Error-Free and Generalizable Tool Use for LLMs via Finite-State Decoding☆27Updated last year
- Code for paper titled "Towards the Law of Capacity Gap in Distilling Language Models"☆100Updated last year
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆145Updated 8 months ago
- Source code of "Reasons to Reject? Aligning Language Models with Judgments"☆58Updated last year