GeeeekExplorer/transformers-patch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/GeeeekExplorer/transformers-patch)

GeeeekExplorer / transformers-patch

patches for huggingface transformers to save memory

☆36

Alternatives and similar repositories for transformers-patch

Users that are interested in transformers-patch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

GeeeekExplorer / kkbot
View on GitHub
A Feishu/Lark AI agent bot
☆15Feb 27, 2026Updated 4 months ago
KuangjuX / AttnLink
View on GitHub
An experimental communicating attention kernel based on DeepEP.
☆34Jul 29, 2025Updated 11 months ago
Linzwcs / AFT
View on GitHub
☆13Jan 22, 2025Updated last year
lliu606 / COSMOS
View on GitHub
☆20Feb 2, 2026Updated 5 months ago
li-plus / flash-preference
View on GitHub
Accelerate LLM preference tuning via prefix sharing with a single line of code
☆52Jul 4, 2025Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
AXERA-TECH / Qwen2.5-VL-3B-Instruct.axera
View on GitHub
Demo for Qwen2.5-VL-3B-Instruct on Axera device.
☆16Sep 3, 2025Updated 10 months ago
FlexFusion / FlexFusion
View on GitHub
The official implementation for the intra-stage fusion technique introduced in https://arxiv.org/abs/2409.13221
☆31Apr 22, 2025Updated last year
GeeeekExplorer / cupytorch
View on GitHub
A small framework mimics PyTorch using CuPy or NumPy
☆57Feb 16, 2022Updated 4 years ago
BaichuanSEED / BaichuanSEED.github.io
View on GitHub
Official Repository for Paper "BaichuanSEED: Sharing the Potential of ExtensivE Data Collection and Deduplication by Introducing a Compet…
☆18Aug 28, 2024Updated last year
pku-minic / next-gen-ir-proposal
View on GitHub
Proposal for the next generation of course-oriented IR.
☆10Dec 24, 2021Updated 4 years ago
Dao-AILab / gemm-cublas
View on GitHub
☆22May 5, 2025Updated last year
AstorYH / PASB
View on GitHub
An end-to-end security evaluation framework tailored for real-world personalized agent.
☆15Feb 28, 2026Updated 4 months ago
promptfoo / crabcode
View on GitHub
Generic tmux-based workspace manager for multi-repo development. Lightning-fast dev productivity tool.
☆19Updated this week
Shawn-Guo-CN / Lossless_Text_Compression_with_Transformer
View on GitHub
This repo is to demo the concept of lossless compression with Transformers as encoder and decoder.
☆14May 2, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Adlik / smoothquantplus
View on GitHub
[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models
☆23Mar 15, 2024Updated 2 years ago
jack-willturner / gymnastics
View on GitHub
A "gym" style toolkit for building lightweight NAS systems.
☆13Jun 13, 2022Updated 4 years ago
Oneflow-Inc / dfccl
View on GitHub
☆26Feb 17, 2025Updated last year
TheoViel / kaggle_contrails
View on GitHub
2nd Place Solution for the Google Research - Identify Contrails to Reduce Global Warming Competition
☆14Aug 15, 2023Updated 2 years ago
antgroup / DeepXTrace
View on GitHub
DeepXTrace is a lightweight tool for precisely diagnosing slow ranks in DeepEP-based environments.
☆100Jan 16, 2026Updated 6 months ago
brendanhogan / completion_tree_view
View on GitHub
☆15Apr 26, 2025Updated last year
tigert1998 / tair-last-jedi
View on GitHub
阿里云第二届数据库大赛新手门槛队（季军）解决方案
☆10Apr 19, 2021Updated 5 years ago
chaojin0310 / Ditto
View on GitHub
Artifacts for our SIGCOMM'23 paper Ditto
☆15Oct 17, 2023Updated 2 years ago
jiegec / naiverouter
View on GitHub
A router IP written in Verilog.
☆12Dec 20, 2019Updated 6 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
Hacking-pch / Generador-CCS
View on GitHub
☆12Mar 25, 2020Updated 6 years ago
Infrasys-AI / aiinfra-docs
View on GitHub
☆21Nov 6, 2025Updated 8 months ago
Harry-Chen / TrivialTomasulo
View on GitHub
Tomasulo Simulator written in React as the project for Computer Architecture course, Spring 2019, Tsinghua University
☆12Jun 9, 2019Updated 7 years ago
wkentaro / logboard
View on GitHub
logboard: Monitor and Compare Logs on Browser/Terminal.
☆21Sep 19, 2019Updated 6 years ago
flagos-ai / libtriton_jit
View on GitHub
A Triton JIT runtime and ffi provider in C++
☆37Updated this week
PKUFlyingPig / pku-os-course
View on GitHub
Course website for Operating System course in Peking University.
☆14Nov 28, 2021Updated 4 years ago
tile-ai / tilelang-puzzles
View on GitHub
Learning TileLang with 10 puzzles!
☆338May 28, 2026Updated last month
princeton-pli / LongProc
View on GitHub
LongProc: Benchmarking Long-Context Language Models on Long Procedural Generation
☆36Feb 26, 2026Updated 4 months ago
kuangliu / kitti-utils
View on GitHub
KITTI coordinate transition & visualization tool
☆11Jun 26, 2019Updated 7 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
uw-mad-dash / decoding-speculative-decoding
View on GitHub
☆16Aug 19, 2024Updated last year
madsys-dev / deepseekv2-profile
View on GitHub
☆156Mar 4, 2025Updated last year
ZenithalHourlyRate / naming
View on GitHub
☆11Apr 29, 2024Updated 2 years ago
wangrunji0408 / rjrouter
View on GitHub
[AFK] Hardware router in Chisel (THU Network Joint Lab 2020)
☆14Oct 8, 2020Updated 5 years ago
alkaidpku / DQ-ToolQA
View on GitHub
☆10Nov 15, 2023Updated 2 years ago
avik-das / seam-carver
View on GitHub
Implementation of the content-aware image resizing algorithm presented in the paper "Seam carving for content-aware image resizing"
☆13Jul 22, 2019Updated 6 years ago
VimalWill / Vstream
View on GitHub
Vstream - Video Analytics pipeline with Hardware based accelerations (dev - stage)
☆10Feb 2, 2024Updated 2 years ago