imagination-research / sotLinks

[ICLR 2024] Skeleton-of-Thought: Prompting LLMs for Efficient Parallel Generation

☆182

Alternatives and similar repositories for sot

Users that are interested in sot are comparing it to the libraries listed below

Sorting:

jeffreysijuntan / lloco
The official repo for "LLoCo: Learning Long Contexts Offline"
☆118Updated last year
withmartian / routerbench
The code for the paper ROUTERBENCH: A Benchmark for Multi-LLM Routing System
☆151Updated last year
hao-ai-lab / Dynasor
[NeurIPS 2025] Simple extension on vLLM to help you speed up reasoning model without training.
☆209Updated 6 months ago
FasterDecoding / BitDelta
☆204Updated last year
LLM360 / amber-data-prep
Data preparation code for Amber 7B LLM
☆93Updated last year
ServiceNow / Fast-LLM
Accelerating your LLM training to full speed! Made with ❤️ by ServiceNow Research
☆262Updated last week
Digitous / LLM-SLERP-Merge
Spherical Merge Pytorch/HF format Language Models with minimal feature loss.
☆141Updated 2 years ago
felipemaiapolo / tinyBenchmarks
Evaluating LLMs with fewer examples
☆169Updated last year
arcee-ai / EvolKit
EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…
☆243Updated last year
LLM360 / Analysis360
Open Implementations of LLM Analyses
☆107Updated last year
wang-research-lab / agentinstruct
Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"
☆117Updated last month
diagram-of-thought / diagram-of-thought
Official implementation of paper "On the Diagram of Thought" (https://arxiv.org/abs/2409.10038)
☆188Updated 3 months ago
lm-sys / llm-decontaminator
Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"
☆315Updated last year
IBM / ModuleFormer
ModuleFormer is a MoE-based architecture that includes two different types of experts: stick-breaking attention heads and feedforward exp…
☆226Updated 2 months ago
dwzhu-pku / PoSE
Positional Skip-wise Training for Efficient Context Window Extension of LLMs to Extremely Length (ICLR 2024)
☆205Updated last year
microsoft / FILM
Official repo for "Make Your LLM Fully Utilize the Context"
☆261Updated last year
wuhy68 / Parameter-Efficient-MoE
Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks (EMNLP'24)
☆147Updated last year
allenai / WildBench
Benchmarking LLMs with Challenging Tasks from Real Users
☆246Updated last year
wdlctc / mini-s
☆53Updated last year
facebookresearch / LayerSkip
Code for "LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding", ACL 2024
☆349Updated 7 months ago
ScalingIntelligence / Archon
Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.
☆189Updated 8 months ago
dust-tt / llama-ssp
Experiments on speculative sampling with Llama models
☆127Updated 2 years ago
aymeric-roucher / agent_reasoning_benchmark
🔧 Compare how Agent systems perform on several benchmarks. 📊🚀
☆102Updated 4 months ago
UbiquitousLearning / SLM_Survey
☆100Updated last year
allenai / CommonGen-Eval
Evaluating LLMs with CommonGen-Lite
☆93Updated last year
VITA-Group / Q-GaLore
Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.
☆202Updated last year
swj0419 / detect-pretrain-code-contamination
☆78Updated last year
astramind-ai / Mixture-of-depths
Unofficial implementation for the paper "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"
☆175Updated last year
zai-org / ComplexFuncBench
Complex Function Calling Benchmark.
☆149Updated 10 months ago
SalesforceAIResearch / LaTRO
☆124Updated 9 months ago