xlang-ai / batch-prompting
[EMNLP 2023 Industry Track] A simple prompting approach that enables the LLMs to run inference in batches.
☆72Updated 10 months ago
Alternatives and similar repositories for batch-prompting:
Users that are interested in batch-prompting are comparing it to the libraries listed below
- Implementation of the paper: "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" from Google in pyTO…☆53Updated this week
- The source code of our work "Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models"☆58Updated 3 months ago
- ☆50Updated 2 months ago
- Retrieval as Attention☆83Updated 2 years ago
- Scalable Meta-Evaluation of LLMs as Evaluators☆42Updated 11 months ago
- ☆64Updated 9 months ago
- Simple and efficient pytorch-native transformer training and inference (batched)☆66Updated 9 months ago
- Astraios: Parameter-Efficient Instruction Tuning Code Language Models☆57Updated 9 months ago
- Codebase for Instruction Following without Instruction Tuning☆33Updated 3 months ago
- ☆33Updated 9 months ago
- Code and Data for "Long-context LLMs Struggle with Long In-context Learning"☆97Updated 6 months ago
- Fast and Robust Early-Exiting Framework for Autoregressive Language Models with Synchronized Parallel Decoding (EMNLP 2023 Long)☆56Updated 3 months ago
- The Efficiency Spectrum of LLM☆52Updated last year
- A dataset of LLM-generated chain-of-thought steps annotated with mistake location.☆77Updated 5 months ago
- Implementation of "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"☆42Updated 2 months ago
- Code for paper 'Data-Efficient FineTuning'☆29Updated last year
- Official repository for MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models [NeurIPS 2024]☆56Updated 2 months ago
- List of papers on Self-Correction of LLMs.☆69Updated 3 weeks ago
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Updated last year
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆129Updated 2 months ago
- ☆38Updated 9 months ago
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆42Updated last year
- PyTorch building blocks for OLMo☆47Updated this week
- Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"☆46Updated last year
- [AAAI 2024] Investigating the Effectiveness of Task-Agnostic Prefix Prompt for Instruction Following☆79Updated 4 months ago
- ☆78Updated 3 months ago
- ☆72Updated 8 months ago
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆76Updated last year