Adaxry/Unified_Layer_Skipping

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Adaxry/Unified_Layer_Skipping)

Adaxry / Unified_Layer_Skipping

☆15

Alternatives and similar repositories for Unified_Layer_Skipping

Users that are interested in Unified_Layer_Skipping are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ASISys / AdaSkip
View on GitHub
AdaSkip: Adaptive Sublayer Skipping for Accelerating Long-Context LLM Inference
☆21Jan 24, 2025Updated last year
Adaxry / Post-Instruction
View on GitHub
☆21Sep 5, 2023Updated 2 years ago
Adaxry / ss_on_decoding_steps.
View on GitHub
codes for "Scheduled Sampling Based on Decoding Steps for Neural Machine Translation" (long paper of EMNLP-2022)
☆20Aug 31, 2021Updated 4 years ago
xydaytoy / BMI-NMT
View on GitHub
☆11Jul 28, 2021Updated 5 years ago
DataStates / datastates-llm
View on GitHub
LLM checkpointing for DeepSpeed/Megatron
☆26Nov 30, 2025Updated 7 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
SusCom-Lab / ZSMerge
View on GitHub
☆23Sep 24, 2025Updated 10 months ago
pppa2019 / swie_overmiss_llm4mt
View on GitHub
Code for "Improving Translation Faithfulness of Large Language Models via Augmenting Instructions"
☆12Aug 26, 2023Updated 2 years ago
WeixiangXu / STTN
View on GitHub
☆17Oct 25, 2022Updated 3 years ago
wmt-conference / wmt23-news-systems
View on GitHub
☆14Oct 6, 2025Updated 9 months ago
Mahmoud9876 / locationProblem
View on GitHub
The Multi-Capacity and Multi-Level Localization Project tackles the complex problem of finding optimal locations for elements such as fac…
☆13Aug 19, 2025Updated 11 months ago
afzaalis / tubes-makepal-cicd
View on GitHub
☆15Aug 2, 2025Updated 11 months ago
Equationliu / Kangaroo
View on GitHub
[NeurIPS 2024] The official implementation of "Kangaroo: Lossless Self-Speculative Decoding for Accelerating LLMs via Double Early Exitin…
☆72Jun 26, 2024Updated 2 years ago
selkerdawy / FTWT
View on GitHub
Fire Together Wire Together: A Dynamic Pruning Approach with Self-Supervised Mask Prediction
☆10May 25, 2022Updated 4 years ago
lt2000 / MinFlow
View on GitHub
☆12Jan 12, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Adaxry / get_aligned_BERT_emb
View on GitHub
Get the aligned BERT embedding for sequence labeling tasks
☆18Jun 6, 2019Updated 7 years ago
4kangjc / flexy
View on GitHub
a high performance server framework
☆12Dec 11, 2022Updated 3 years ago
mutonix / pyramidinfer
View on GitHub
☆47Nov 25, 2024Updated last year
iLearn-Lab / ACL25-PTQ1.61
View on GitHub
☆15Apr 6, 2026Updated 3 months ago
wmt-conference / wmt22-news-systems
View on GitHub
☆21Feb 13, 2023Updated 3 years ago
Open-Galapagos / evolution-fine-tuning
View on GitHub
Official code, models, and dataset for "Evolution Fine-Tuning (EFT): Learning to Discover Across 371 Optimization Tasks"
☆25Jun 30, 2026Updated 3 weeks ago
Mixture-AI / Mixture-of-Depths
View on GitHub
Google DeepMind: Mixture of Depths Unofficial Implementation.
☆12May 29, 2024Updated 2 years ago
eddiegaoo / Apt-Serve
View on GitHub
☆21Jun 9, 2025Updated last year
luoyesiqiu / LibInject
View on GitHub
DLL注入工具
☆13Nov 9, 2020Updated 5 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
dongwonjo / FastKV
View on GitHub
[ACL Findings 2026] Official Implementation of "FastKV: Decoupling of Context Reduction and KV Cache Compression for Prefill-Decoding Acc…
☆32Apr 14, 2026Updated 3 months ago
facebookresearch / SIEVE
View on GitHub
SIEVE: Multimodal Dataset Pruning using Image-Captioning Models (CVPR 2024)
☆21Apr 28, 2024Updated 2 years ago
caoshiyi / artifacts
View on GitHub
☆40Nov 28, 2024Updated last year
scholltan / pytorch-playground----fixed-point-quantized
View on GitHub
Base pretrained models and datasets in pytorch (MNIST, SVHN, CIFAR10, CIFAR100, STL10, AlexNet, VGG16, VGG19, ResNet, Inception, SqueezeN…
☆13Jan 7, 2018Updated 8 years ago
deib-polimi / neptune
View on GitHub
Network- and GPU-aware management of serverless functions at the edge
☆15Mar 3, 2023Updated 3 years ago
rutaabali3 / BridgeX
View on GitHub
Aptech's E project. BridgeX is a web based platform showcasing the world's most remarkable bridges, their engineering marvels, and histor…
☆16Apr 15, 2026Updated 3 months ago
CristianCosci / Ant_Colony_Optimization_for_OSSP
View on GitHub
Open Shop Scheduling Problem resolution via Ant Colony Optimization algorithm.
☆14Mar 28, 2023Updated 3 years ago
eva-cam / EvaCAM
View on GitHub
☆14Apr 6, 2025Updated last year
yale-nlp / refdpo
View on GitHub
☆16Jul 23, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Ther-nullptr / Awesome-Transformer-Accleration
View on GitHub
Paper list for accleration of transformers
☆14Jul 1, 2023Updated 3 years ago
Scientific-Computing-Lab / STREAMer
View on GitHub
STREAMer: Benchmarking remote volatile and non-volatile memory bandwidth
☆18Aug 21, 2023Updated 2 years ago
kaiyuhwang / MLLM-Survey
View on GitHub
The paper list of multilingual pre-trained models (Continual Updated).
☆25Jun 18, 2024Updated 2 years ago
facebookresearch / spartan
View on GitHub
Spartan is an algorithm for training sparse neural network models. This repository accompanies the paper "Spartan Differentiable Sparsity…
☆26Oct 31, 2022Updated 3 years ago
pmem / pmem.github.io
View on GitHub
The pmem.io Website
☆17Jan 20, 2026Updated 6 months ago
da-steve101 / twn_generator
View on GitHub
Generate an FPGA design for a TWN
☆11Nov 4, 2019Updated 6 years ago
HaiBooLang / TryOpenGL
View on GitHub
OpenGL 学习代码
☆15Jun 25, 2023Updated 3 years ago