akhilkedia/TranformersGetStable

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/akhilkedia/TranformersGetStable)

akhilkedia / TranformersGetStable

[ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"

☆11

Alternatives and similar repositories for TranformersGetStable

Users that are interested in TranformersGetStable are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

snap-research / VIMI
View on GitHub
☆13Jul 10, 2024Updated 2 years ago
wtybest / EnMMDiT
View on GitHub
[TPAMI 2026] Enhancing MMDiT-Based Text-to-Image Models for Similar Subject Generation
☆15Mar 7, 2026Updated 4 months ago
Anonymous1252022 / Megatron-DeepSpeed
View on GitHub
☆18Sep 22, 2024Updated last year
Nordskog / ClassHunter
View on GitHub
Android library for recognizing java classes and methods based on their signatures
☆10Mar 18, 2020Updated 6 years ago
phuocphn / uniq
View on GitHub
Pytorch implementation of our UniQ method, IEEE Access -- Training Multi-bit Quantized and Binarized Networks with A Learnable Symmetric …
☆11Apr 7, 2021Updated 5 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
tim-lawson / skip-middle
View on GitHub
Learning to Skip the Middle Layers of Transformers
☆17Aug 7, 2025Updated 11 months ago
hobinkwak / ExpectedGradients_IntegratedGradients_pytorch
View on GitHub
simple implementation of Expected Gradients and Integrated Gradients by pytorch
☆12May 11, 2022Updated 4 years ago
rezashkv / diffusion_pruning
View on GitHub
[ICLR 2025] Adaptive prompt tailored pruning of T2I diffusion models.
☆15Feb 1, 2025Updated last year
akhilkedia / Android-Auto-Localization_Translate-Strings.XML
View on GitHub
Simple strings.xml files generator for android project using bing translator. It requires microsoft bing translation api key.
☆11Dec 29, 2021Updated 4 years ago
Xinsen-Zhang / lstm-chinese
View on GitHub
利用 LSTM 进行中文的文本生成. PyTorch implement
☆14Apr 30, 2019Updated 7 years ago
xwhan / pylucene-bm25
View on GitHub
Lucene open-domain QA retrieval in python
☆11Feb 18, 2021Updated 5 years ago
snap-research / AVLink
View on GitHub
AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation
☆17Aug 3, 2025Updated 11 months ago
EIFY / mup-vit
View on GitHub
Everything you need to reproduce "Better plain ViT baselines for ImageNet-1k" in PyTorch, and more
☆12Updated this week
hannamw / gpt2-greater-than
View on GitHub
Code Release for the 2023 NeurIPS Paper How does GPT-2 compute greater-than?: Interpreting mathematical abilities in a pre-trained langua…
☆17Dec 6, 2024Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
INFINIQ-AI1 / CLIPVQDiffusion
View on GitHub
official implementation of "CLIP-VQDiffusion : Langauge Free Training of Text To Image generation using CLIP and vector quantized diffusi…
☆19Sep 5, 2024Updated last year
Qichuzyy / POA
View on GitHub
Official implementation of ECCV24 paper: POA
☆24Aug 8, 2024Updated last year
CookiePPP / podcast_rss_feeds
View on GitHub
List of Podcast Feeds using iTunes API and script to download 6,000,000~ hours of English speech.
☆31Apr 13, 2023Updated 3 years ago
atosystem / SSL_Interface
View on GitHub
Interface Design for Self-Supervised Speech Models, Accepted to Interspeech2024
☆16Nov 19, 2024Updated last year
HanbaekLyu / NNetwork
View on GitHub
Custom graph/network/multi-weighted network class based on storing list of neighbors for each nodes (as opposed to edge list) for scalabl…
☆11Jan 18, 2024Updated 2 years ago
hiranumn / IntegratedGradientsTF
View on GitHub
Tensorflow implementation of integrated gradients presented in "Axiomatic Attribution for Deep Networks". It explains connections between…
☆17Mar 11, 2019Updated 7 years ago
aiha-lab / MX-QLLM
View on GitHub
LLM Inference with Microscaling Format
☆35Nov 12, 2024Updated last year
Cc-bugwriter / CER
View on GitHub
Homework of CER
☆11Apr 12, 2021Updated 5 years ago
TianjinYellow / StableSPAM
View on GitHub
☆28Jul 2, 2026Updated 3 weeks ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
shaoweinuaa / HM2TFS
View on GitHub
Hyper-graph based Multi-task Feature Selection for Multi-modal Classification of Alzheimer's Disease
☆12Jun 10, 2019Updated 7 years ago
sp-uhh / gen-se-demo
View on GitHub
Diffusion-based Speech Enhancement: Demonstration of Performance and Generalization
☆14Dec 21, 2024Updated last year
Xingyu-Zheng / BiDM
View on GitHub
(NeurIPS 2024) BiDM: Pushing the Limit of Quantization for Diffusion Models
☆22Jul 16, 2026Updated last week
tianyi-lab / DisCL
View on GitHub
[ICCV 2025] Diffusion Curriculum (DisCL)
☆18Sep 26, 2025Updated 9 months ago
xiaohan2012 / signed-local-community
View on GitHub
[WebConf 2020] Searching for polarization in signed graphs: a local spectral approach
☆10Feb 3, 2024Updated 2 years ago
bethelmelesse / UnifiedCrawl
View on GitHub
☆17Nov 26, 2024Updated last year
K-Mitsuno / hierarchical-group-sparse-regularization
View on GitHub
Hierarchical Group Sparse Regularization for Deep Convolutional Neural Networks
☆11Apr 13, 2020Updated 6 years ago
zqu1992 / ALQ
View on GitHub
☆14Oct 24, 2022Updated 3 years ago
takahirom / NestedListView
View on GitHub
☆20Dec 26, 2015Updated 10 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
mikel-brostrom / YooChoose_Pytorch_Geometric
View on GitHub
YooChoose challenge with GCN in Pytorch Geometric
☆13May 8, 2020Updated 6 years ago
bisakha / Connectomics
View on GitHub
☆10May 9, 2017Updated 9 years ago
CAMP-eXplain-AI / PathwayGrad
View on GitHub
CVPR 2021 | Code to reproduce the results of the paper: A Khakzar, S Baselizadeh, S Khanduja, C Rupprecht, ST Kim, N Navab, Neural Respon…
☆12Jun 23, 2021Updated 5 years ago
Roblox / SmoothCache
View on GitHub
Implementation of SmoothCache, a project aimed at speeding-up Diffusion Transformer (DiT) based GenAI models with error-guided caching.
☆48Jul 17, 2025Updated last year
wangruinju / Double-Fusion
View on GitHub
A bayesian approach to examining default mode network functional connectivity and cognitive performance in major depressive disorder
☆13Aug 23, 2019Updated 6 years ago
andrewk1 / correctandsmooth
View on GitHub
Simple correct&smooth implementation in PyTorch.
☆13Nov 8, 2022Updated 3 years ago
TianjinYellow / SPAM-Optimizer
View on GitHub
☆36Mar 12, 2025Updated last year