tim-lawson/skip-middle

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/tim-lawson/skip-middle)

tim-lawson / skip-middle

Learning to Skip the Middle Layers of Transformers

☆17

Alternatives and similar repositories for skip-middle

Users that are interested in skip-middle are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

akhilkedia / TranformersGetStable
View on GitHub
[ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"
☆11Jul 19, 2024Updated 2 years ago
HEmile / a-nesi
View on GitHub
A Scalable Approximate Method for Probabilistic Neurosymbolic Inference
☆25Jan 27, 2025Updated last year
adrianjav / causal-flows
View on GitHub
CausalFlows: A library for Causal Normalizing Flows in Pytorch
☆33Apr 30, 2025Updated last year
KaiLv69 / DuoDecoding
View on GitHub
DuoDecoding: Hardware-aware Heterogeneous Speculative Decoding with Dynamic Multi-Sequence Drafting
☆19Mar 4, 2025Updated last year
FrankYang-17 / Mavors
View on GitHub
☆16May 30, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
gudiandian / ElasticFlow
View on GitHub
☆17May 10, 2024Updated 2 years ago
Guinan-Su / auto-merge-llm
View on GitHub
An official repository for GPTailor
☆18Jun 29, 2025Updated last year
Leosang-lx / FlowSpec
View on GitHub
Continuous Pipelined Speculative Decoding
☆21May 25, 2026Updated last month
KevinLee1110 / dynamic-batching
View on GitHub
The official repo for the paper "Optimizing LLM Inference Throughput via Memory-aware and SLA-constrained Dynamic Batching"
☆18Mar 17, 2025Updated last year
thu-ml / Adaptive-Sparse-Trainer
View on GitHub
Official implementation for "Pruning Large Language Models with Semi-Structural Adaptive Sparse Training" (AAAI 2025)
☆19Jul 1, 2025Updated last year
allenai / understanding_mcqa
View on GitHub
Code for the arXiv preprint "Answer, Assemble, Ace: Understanding How Transformers Answer Multiple Choice Questions"
☆15Aug 2, 2025Updated 11 months ago
TianjinYellow / SPAM-Optimizer
View on GitHub
☆36Mar 12, 2025Updated last year
LiuXiaoxuanPKU / Cost-Model-papers
View on GitHub
☆13Feb 22, 2023Updated 3 years ago
RnMor777 / x86-Assembly-Chess
View on GitHub
Fully working chess game implemented in the x86 Intel Assembly language
☆12Oct 3, 2022Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
Anonymous1252022 / Megatron-DeepSpeed
View on GitHub
☆18Sep 22, 2024Updated last year
UNITES-Lab / C2R-MoE
View on GitHub
[NAACL'25 🏆 SAC Award] Official code for "Advancing MoE Efficiency: A Collaboration-Constrained Routing (C2R) Strategy for Better Expert…
☆16Feb 4, 2025Updated last year
trappmartin / AdvancedProbabilisticCircuits.jl
View on GitHub
Probabilistic Circuits in Julia
☆10Dec 27, 2023Updated 2 years ago
w4nderlust / cppn-tensorflow
View on GitHub
Very Simple and Basic Implementation of Compositional Pattern Producing Network in TensorFlow
☆11Nov 27, 2019Updated 6 years ago
chenyiqun / Agentic-RAG
View on GitHub
This is the code of a agentic rag method with dynamic workflow.
☆14Jan 22, 2026Updated 6 months ago
CubasMike / plagiarism_detection_pan2015
View on GitHub
Plagiarism Detection Approach for PAN 2015 Text Alignment task
☆11May 11, 2018Updated 8 years ago
r1fl / 8086-Assembly-Chess
View on GitHub
8086 Assembly Chess
☆11Feb 11, 2019Updated 7 years ago
christopher-beckham / annotated-conditional-diffusion
View on GitHub
☆10Aug 26, 2022Updated 3 years ago
lindermanlab / hackathons
View on GitHub
Jupyter notebooks from our weekly (or so) hackathons
☆11Dec 3, 2024Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
kyriemao / ChatRetriever
View on GitHub
☆13Apr 18, 2024Updated 2 years ago
dmis-lab / Outlier-Safe-Pre-Training
View on GitHub
[ACL 2025] Outlier-Safe Pre-Training for Robust 4-Bit Quantization of Large Language Models
☆39Nov 4, 2025Updated 8 months ago
wenge-research / CRE-SFT
View on GitHub
A supervised fine-tuning method for controllable reasoning length in large language models (一种通过有监督微调实现大语言模型思考长度可控的方法)
☆11May 8, 2025Updated last year
cambridge-mlg / SPVAE
View on GitHub
Tensorflow code for "Hierarchical Decompositional Mixtures of Variational Autoencoders" (ICML'19)
☆12Jun 7, 2020Updated 6 years ago
cfmata / CoPT
View on GitHub
[ECCV 2024] Official Implementation of CoPT: Unsupervised Domain Adaptive Segmentation using Domain-Agnostic Text Embeddings
☆10Feb 24, 2025Updated last year
cat538 / MxMoE
View on GitHub
[ICML 2025] MxMoE: Mixed-precision Quantization for MoE with Accuracy and Performance Co-Design
☆30Jul 4, 2025Updated last year
iQua / llmpebase
View on GitHub
This is a unified platform for implementing and evaluating test-time reasoning mechanisms in Large Language Models (LLMs).
☆18Jan 16, 2025Updated last year
pevnak / SumProductTransform.jl
View on GitHub
An experimental implementation of sum-product networks with dense unitary transformations in leaves
☆13Sep 8, 2022Updated 3 years ago
ruipeterpan / failfast
View on GitHub
Artifact for "Fail Fast, Win Big: Rethinking the Drafting Strategy in Speculative Decoding via Diffusion LLMs" [arXiv '25]
☆20May 4, 2026Updated 2 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
fKunstner / noise-sgd-adam-sign
View on GitHub
☆16Apr 26, 2023Updated 3 years ago
xiaofeng1990 / tensorrt-tutorial
View on GitHub
tensorrt部署教程
☆11Aug 1, 2025Updated 11 months ago
webis-de / set-encoder
View on GitHub
Set-Encoder: Permutation-Invariant Inter-Passage Attention for Listwise Passage Re-Ranking with Cross-Encoders
☆19May 23, 2025Updated last year
xinmei9322 / semicrowd
View on GitHub
Code for Semi-crowdsourced Clustering with Deep Generative Models
☆12Dec 9, 2022Updated 3 years ago
ML-KULeuven / klay
View on GitHub
Sparse Circuits on the GPU (ICLR2025)
☆26Jun 18, 2026Updated last month
LINs-lab / ERW
View on GitHub
[Preprint] Efficient Generative Model Training via Embedded Representation Warmup
☆36Oct 15, 2025Updated 9 months ago
aboustati / vargrad
View on GitHub
Code accompanying VarGrad: A Low-Variance Gradient Estimator for Variational Inference
☆12Oct 12, 2020Updated 5 years ago