xlang-ai / batch-promptingLinks

[EMNLP 2023 Industry Track] A simple prompting approach that enables the LLMs to run inference in batches.

☆76

Alternatives and similar repositories for batch-prompting

Users that are interested in batch-prompting are comparing it to the libraries listed below

Sorting:

GAIR-NLP / Entropy-ABF
Official implementation for 'Extending LLMs’ Context Window with 100 Samples'
☆80Updated last year
RUCAIBox / BAMBOO
☆35Updated last year
jzbjyb / ReAtt
Retrieval as Attention
☆82Updated 2 years ago
sunyt32 / torchscale
Transformers at any scale
☆41Updated last year
locuslab / scaling_laws_data_filtering
☆65Updated last year
TIGER-AI-Lab / LongICLBench
Code and Data for "Long-context LLMs Struggle with Long In-context Learning" [TMLR2025]
☆108Updated 8 months ago
wwxu21 / CUT
Source code of "Reasons to Reject? Aligning Language Models with Judgments"
☆58Updated last year
princeton-pli / MeCo
Code for ICML 25 paper "Metadata Conditioning Accelerates Language Model Pre-training (MeCo)"
☆44Updated 3 months ago
allenai / bff
☆39Updated last year
TIGER-AI-Lab / MAmmoTH2
Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]
☆148Updated 11 months ago
kyegomez / Infini-attention
Implementation of the paper: "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" from Google in pyTO…
☆56Updated this week
kyegomez / LM-Infinite
Implementation of "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"
☆39Updated 11 months ago
john-hewitt / implicit-ins
Codebase for Instruction Following without Instruction Tuning
☆36Updated last year
bigcode-project / astraios
Astraios: Parameter-Efficient Instruction Tuning Code Language Models
☆62Updated last year
EleutherAI / pile_dedupe
Pile Deduplication Code
☆19Updated 2 years ago
LuLuLuyi / LongHeads
[EMNLP'24] LongHeads: Multi-Head Attention is Secretly a Long Context Processor
☆31Updated last year
Leooyii / LCEG
Long Context Extension and Generalization in LLMs
☆62Updated last year
WHGTyen / BIG-Bench-Mistake
A dataset of LLM-generated chain-of-thought steps annotated with mistake location.
☆82Updated last year
GAIR-NLP / scaleeval
Scalable Meta-Evaluation of LLMs as Evaluators
☆42Updated last year
NormXU / Consistent-DynamicNTKRoPE
An Experiment on Dynamic NTK Scaling RoPE
☆64Updated last year
allenai / easy-to-hard-generalization
Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"
☆48Updated last year
ernie-research / Tool-Augmented-Reward-Model
[ICLR'24 spotlight] Tool-Augmented Reward Modeling
☆51Updated 4 months ago
dwzhu-pku / LongEmbed
LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)
☆144Updated 11 months ago
siyan-zhao / prepacking
The source code of our work "Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models" [AISTATS …
☆60Updated last year
hadasah / btm
☆76Updated last year
sail-sg / sailcraft
🚢 Data Toolkit for Sailor Language Models
☆94Updated 8 months ago
csitfun / LogiCoT
the instructions and demonstrations for building a formal logical reasoning capable GLM
☆54Updated last year
thunlp / Prompt-Transferability
On Transferability of Prompt Tuning for Natural Language Processing
☆100Updated last year
hkust-nlp / llm-compression-intelligence
Official github repo for the paper "Compression Represents Intelligence Linearly" [COLM 2024]
☆142Updated last year
thu-coai / PICL
Code for ACL2023 paper: Pre-Training to Learn in Context
☆107Updated last year