mayank31398/ladder-residual-inference

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/mayank31398/ladder-residual-inference)

mayank31398 / ladder-residual-inference

☆14

Alternatives and similar repositories for ladder-residual-inference

Users that are interested in ladder-residual-inference are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

aaronwalsman / ltron-torch-eccv22
View on GitHub
☆12Jul 29, 2022Updated 4 years ago
open-lm-engine / lm-engine
View on GitHub
LM engine is a library for pretraining/finetuning LLMs
☆184Updated this week
jopetty / word-problem
View on GitHub
Experiments on the impact of depth in transformers and SSMs.
☆44Oct 23, 2025Updated 9 months ago
limenlp / SEA
View on GitHub
Official Implementation for the paper "Discovering Knowledge Deficiencies of Language Models on Massive Knowledge Base"
☆27Sep 2, 2025Updated 10 months ago
Dao-AILab / gemm-cublas
View on GitHub
☆22May 5, 2025Updated last year
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
OliverSieberling / dynamic-conv1d
View on GitHub
Triton kernels for dynamic causal short convolutions.
☆24Jun 4, 2026Updated last month
cjyaras / monarch-attention
View on GitHub
MonarchAttention: Zero-Shot Conversion to Fast, Hardware-Aware Structured Attention (NeurIPS'25 Spotlight)
☆26Feb 22, 2026Updated 5 months ago
Nanami18 / Snowballed_Hallucination
View on GitHub
☆43Sep 3, 2024Updated last year
thomasahle / cce
View on GitHub
Clustered Compositional Embeddings
☆13Oct 25, 2023Updated 2 years ago
inria-thoth / csa
View on GitHub
Official Pytorch implementation of Chromatic Graph Transformers
☆10Jun 14, 2023Updated 3 years ago
brando90 / ultimate-anatome
View on GitHub
Ἀνατομή is a PyTorch library to analyze representation of neural networks
☆13Jan 31, 2024Updated 2 years ago
GuoTianYu2000 / Active-Dormant-Attention
View on GitHub
codes and plots for "Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs"
☆11Dec 30, 2024Updated last year
wtong98 / mlp-icl
View on GitHub
☆12Sep 16, 2024Updated last year
Sys-KU / DeepPlan
View on GitHub
[ACM EuroSys 2023] Fast and Efficient Model Serving Using Multi-GPUs with Direct-Host-Access
☆56Aug 6, 2025Updated 11 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
nhatpd / iADMM
View on GitHub
iADMM for a low-rank representation optimization problem
☆13Feb 5, 2021Updated 5 years ago
eth-easl / fmengine
View on GitHub
Utilities for Training Very Large Models
☆58Sep 25, 2024Updated last year
google-deepmind / torch-oocairo
View on GitHub
Cairo lua bindings with extensions for torch
☆15Jun 12, 2016Updated 10 years ago
Bond1995 / Markov
View on GitHub
Code for experiments on transformers using Markovian data.
☆22Nov 22, 2024Updated last year
umich-sota / TF-as-SVM
View on GitHub
☆12Jan 17, 2024Updated 2 years ago
wesg52 / llm-context-neurons
View on GitHub
Find context neurons in Pythia models.
☆13Jun 13, 2023Updated 3 years ago
SamuelQZQ / StyleBank.Pytorch
View on GitHub
Deep Learning Model for Stylebank with Pytorch
☆10Nov 15, 2019Updated 6 years ago
james-oldfield / MxD
View on GitHub
[NeurIPS'25] Towards Interpretability Without Sacrifice: Faithful Dense Layer Decomposition with Mixture of Decoders
☆16May 28, 2025Updated last year
oseledets / nla2024
View on GitHub
Skoltech NLA 2024 course.
☆37Dec 10, 2024Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
bigcode-project / bigcode-inference-benchmark
View on GitHub
☆19Aug 10, 2024Updated last year
canyilu / Least-Squares-Regression-for-subspace-clustering
View on GitHub
Least Squares Regression for subspace clustering
☆11May 27, 2018Updated 8 years ago
ssbuild / aigc_evals
View on GitHub
aigc evals
☆10Dec 2, 2023Updated 2 years ago
BryceZhuo / HybridNorm
View on GitHub
The official implementation of HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization
☆19Mar 7, 2025Updated last year
berlino / seq_icl
View on GitHub
☆54May 20, 2024Updated 2 years ago
ledmaster / unified-embeddings
View on GitHub
Implementation of Unified Embedding: Battle-Tested Feature Representations for Web-Scale ML Systems
☆15Nov 11, 2023Updated 2 years ago
borjanG / 2023-transformers
View on GitHub
Codes for the paper The emergence of clusters in self-attention dynamics.
☆17Dec 18, 2023Updated 2 years ago
a-rahimi / hessian
View on GitHub
The Hessian of tall-skinny networks is easy to invert
☆17Updated this week
fw-ai / llama-cuda-graph-example
View on GitHub
Example of applying CUDA graphs to LLaMA-v2
☆11Aug 25, 2023Updated 2 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
alexzhang13 / Triton-Puzzles-Solutions
View on GitHub
Personal solutions to the Triton Puzzles
☆21Jul 18, 2024Updated 2 years ago
oh-my-ocr / PICK-pytorch
View on GitHub
Code for the paper "PICK: Processing Key Information Extraction from Documents using Improved Graph Learning-Convolutional Networks" (ICP…
☆10Nov 20, 2020Updated 5 years ago
LMCache / lmcache-agent-trace
View on GitHub
Agent application/benchmark/workload traces should be placed here.
☆15Apr 13, 2026Updated 3 months ago
johnmarktaylor91 / pytorch_feature_analysis
View on GitHub
☆12Mar 19, 2021Updated 5 years ago
perimosocordiae / graphs
View on GitHub
Graph-based learning in Python
☆17Mar 9, 2018Updated 8 years ago
FLAIROx / popjym
View on GitHub
POPGym Library in JAX
☆14Apr 15, 2024Updated 2 years ago
Anirudhaagrawal / mlsys-reading-list
View on GitHub
Impactful systems For ML papers.
☆12Aug 21, 2024Updated last year