Zcchill/Value-Residual-Learning

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Zcchill/Value-Residual-Learning)

Zcchill / Value-Residual-Learning

☆15

Alternatives and similar repositories for Value-Residual-Learning

Users that are interested in Value-Residual-Learning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zzjchen / Tailored-Visions
View on GitHub
☆23Sep 4, 2025Updated 10 months ago
rhubarbwu / linguistic-collapse
View on GitHub
Codebase for Linguistic Collapse: Neural Collapse in (Large) Language Models [NeurIPS 2024] [arXiv:2405.17767]
☆19Apr 14, 2025Updated last year
rhubarbwu / neural-collapse
View on GitHub
Generic library for neural collapse and several derivative works on the phenomenon.
☆18Apr 14, 2025Updated last year
LINs-lab / ELICIT
View on GitHub
[ICLR 2025] ELICIT: LLM Augmentation Via External In-context Capability
☆14Mar 11, 2025Updated last year
jvlmdr / omniglot-eval
View on GitHub
☆14Dec 11, 2018Updated 7 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
epfml / pam
View on GitHub
☆16Dec 9, 2023Updated 2 years ago
zwhe99 / RaSA
View on GitHub
[ICLR 2025] RaSA: Rank-Sharing Low-Rank Adaptation
☆10May 19, 2025Updated last year
JJBT / RevO
View on GitHub
Few-Shot Object Detection with Transformer
☆10Jan 3, 2024Updated 2 years ago
GATECH-EIC / Linearized-LLM
View on GitHub
[ICML 2024] When Linear Attention Meets Autoregressive Decoding: Towards More Effective and Efficient Linearized Large Language Models
☆35Jun 12, 2024Updated 2 years ago
ivanenko / ydisk_commander
View on GitHub
Yandex Disk wfx plugin for Total/Double commander
☆16Mar 15, 2019Updated 7 years ago
RobertCsordas / linear_layer_as_attention
View on GitHub
The official repository for our paper "The Dual Form of Neural Networks Revisited: Connecting Test Time Predictions to Training Patterns …
☆16Jun 11, 2025Updated last year
thunlp / SparsingLaw
View on GitHub
The open-source materials for paper "Sparsing Law: Towards Large Language Models with Greater Activation Sparsity".
☆32Nov 12, 2024Updated last year
maximzubkov / fft-scan
View on GitHub
Efficient PScan implementation in PyTorch
☆17Jan 2, 2024Updated 2 years ago
MasterVito / SwS
View on GitHub
Official Repo for SwS: A Weakness-driven Problem Synthesis Framework in RL for LLM Reasoning
☆41Nov 11, 2025Updated 8 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
qiuzh20 / RMoE
View on GitHub
Official implementation of RMoE (Layerwise Recurrent Router for Mixture-of-Experts)
☆33Aug 4, 2024Updated last year
zaydzuhri / flame
View on GitHub
Fork of Flame repo for training of some new stuff in development
☆20Updated this week
yanhong-lbh / text_or_pixels
View on GitHub
Codebase for EMNLP 2025 Findings paper "Text or Pixels? Evaluating Efficiency and Understanding of LLMs with Visual Text Inputs"
☆19Nov 14, 2025Updated 8 months ago
ericjang / RNN-dynamics
View on GitHub
Code and report for APMA136 Final Project
☆19May 6, 2015Updated 11 years ago
gmongaras / Cottention_Transformer
View on GitHub
Code for the paper "Cottention: Linear Transformers With Cosine Attention"
☆20Nov 15, 2025Updated 8 months ago
xyhan-github / neuralcollapse
View on GitHub
Code reproducing Neural Collapse phenomenon on MSE and cross-entropy loss
☆13Feb 8, 2022Updated 4 years ago
RobertCsordas / moe_layer
View on GitHub
sigma-MoE layer
☆21Jan 5, 2024Updated 2 years ago
salesforce / simplification
View on GitHub
☆23Jun 25, 2026Updated 3 weeks ago
gccnlp / Light-PEFT
View on GitHub
[ACL 2024 Findings] Light-PEFT: Lightening Parameter-Efficient Fine-Tuning via Early Pruning
☆13Sep 2, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
AndriiShramko / Shramko-GoPro13-Control-App-for-multicamera-4dgs-3dgs-rigs
View on GitHub
Shramko GoPro13 Control App for multicamera 4dgs 3dgs rigs
☆19Jul 13, 2026Updated last week
jxiw / MambaByte
View on GitHub
[CoLM 24] Official Repository of MambaByte: Token-free Selective State Space Model
☆27Oct 12, 2024Updated last year
qhfan / RALA
View on GitHub
[CVPR2025] Breaking the Low-Rank Dilemma of Linear Attention
☆44Mar 11, 2025Updated last year
TianheL / LM-Implicit-Reasoning
View on GitHub
[ACL 2025 Findings] Implicit Reasoning in Transformers is Reasoning through Shortcuts
☆18Mar 11, 2025Updated last year
RobertCsordas / ndr
View on GitHub
The official repository for our paper "The Neural Data Router: Adaptive Control Flow in Transformers Improves Systematic Generalization".
☆34Jun 11, 2025Updated last year
Oliverbansk / Holistic-Robot-Pose-Estimation
View on GitHub
[ECCV 2024] PyTorch implementation of "Real-time Holistic Robot Pose Estimation with Unknown States"
☆40Jan 6, 2025Updated last year
adihaviv / nopos
View on GitHub
☆23Jul 27, 2023Updated 2 years ago
MasterVito / SvS
View on GitHub
Official Repo for SvS: A Self-play with Variational Problem Synthesis strategy for RLVR training
☆54Dec 13, 2025Updated 7 months ago
tianyi-lab / R2-T2
View on GitHub
[ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"
☆19Mar 10, 2025Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
jkminder / SALSA-CLRS
View on GitHub
Implementation of "SALSA-CLRS: A Sparse and Scalable Benchmark for Algorithmic Reasoning". SALSA-CLRS is an extension to the original clr…
☆22Nov 21, 2023Updated 2 years ago
tianyi-lab / MoE-Embedding
View on GitHub
[ICLR 2025 Oral] "Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free"
☆92Oct 15, 2024Updated last year
Tobotis / fourier-guide
View on GitHub
An interactive guide to the Fourier-Transform.
☆27Nov 17, 2024Updated last year
google-deepmind / spectral_ssm
View on GitHub
☆35Apr 12, 2024Updated 2 years ago
BryceZhuo / HybridNorm
View on GitHub
The official implementation of HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization
☆19Mar 7, 2025Updated last year
LAMDA-NeSy / Self-Backtracking
View on GitHub
☆52Feb 12, 2025Updated last year
EvanZhuang / vector-icl
View on GitHub
Official implementation of Vector-ICL: In-context Learning with Continuous Vector Representations (ICLR 2025)
☆24Jun 2, 2025Updated last year