jiaconghu / Transformer-DoctorView external linksLinks
Transformer Doctor: Diagnosing and Treating Vision Transformers
☆11Jan 15, 2025Updated last year
Alternatives and similar repositories for Transformer-Doctor
Users that are interested in Transformer-Doctor are comparing it to the libraries listed below
Sorting:
- Model LEGO: Creating Models Like Disassembling and Assembling Building Blocks☆17Jan 15, 2025Updated last year
- Model Doctor: A Simple Gradient Aggregation Strategy for Diagnosing and Treating CNN Classifiers [https://arxiv.org/pdf/2112.04934.pdf]☆15May 13, 2023Updated 2 years ago
- A Survey of Direct Preference Optimization (DPO)☆91Jul 4, 2025Updated 7 months ago
- SeRL: Self-Play Reinforcement Learning for Large Language Models with Limited Data☆21Jan 24, 2026Updated 2 weeks ago
- [ICLR 2024] Dynamic Neural Response Tuning☆16Nov 26, 2025Updated 2 months ago
- A lightweight and extensible toolbox for image classification and MORE☆19Dec 30, 2025Updated last month
- ☆32Oct 4, 2025Updated 4 months ago
- [IEEE Transactions on Power Systems] Transmission Interface Power Flow Adjustment: A Deep Reinforcement Learning Approach based on Multi-…☆24Jun 2, 2024Updated last year
- [TPAMI] Interaction Pattern Disentangling for Multi-Agent Reinforcement Learning☆33May 17, 2024Updated last year
- [SIGKDD' 24] PyTorch implementation of Temporal Prototype-Aware Learning for Active Voltage Control on Power Distribution Networks☆13Jul 28, 2024Updated last year
- Odyssey: Empowering Minecraft Agents with Open-World Skills☆365Oct 22, 2025Updated 3 months ago
- FusionBench: A Comprehensive Benchmark/Toolkit of Deep Model Fusion☆203Feb 6, 2026Updated last week
- [Pattern Recognition, 2020] Covariance Descriptors on a Gaussian Manifold and their Application to Image Set Classification☆12May 28, 2022Updated 3 years ago
- Reference implementation of models from Nyonic Model Factory☆12May 13, 2024Updated last year
- ☆17Feb 3, 2026Updated last week
- Official implementation of CytoSAE: Interpretable Cell Embeddings for Hematology☆21Jul 17, 2025Updated 6 months ago
- ManifoldNet Paper Implementation for SPD(n)☆11Nov 10, 2021Updated 4 years ago
- Residual vector quantization for KV cache compression in large language model☆11Oct 22, 2024Updated last year
- [ICLR 2025 SynthData Workshop Spotlight] Empowering LLMs in Decision Games through Algorithmic Data Synthesis☆25Apr 27, 2025Updated 9 months ago
- Official PyTorch implementation of CD-MOE☆12Mar 29, 2025Updated 10 months ago
- The dataset and codes of the paper UniMod1K: Towards a More Universal Large-Scale Dataset and Benchmark for Multi-Modal Learning.☆16Sep 21, 2025Updated 4 months ago
- TiC: Exploring Vision Transformer in Convolution☆11Oct 24, 2023Updated 2 years ago
- Coming soon~☆11Jul 15, 2025Updated 6 months ago
- Clustered Compositional Embeddings☆11Oct 25, 2023Updated 2 years ago
- [AAAI 2026] Official repository of the EMAformer paper: "EMAformer: Enhancing Transformer through Embedding Armor for Time Series Forecas…☆34Dec 3, 2025Updated 2 months ago
- This repo contains the official code release of the Neural Experts paper, published in NeurIPS 2024.☆13Dec 3, 2024Updated last year
- [COLM 2025: 1st Workshop on the Application of LLM Explainability to Reasoning and Planning] Latent Chain-of-Thought? Decoding the Depth-…☆17Oct 4, 2025Updated 4 months ago
- The official implementation for SETA (TIP 2024).☆11Feb 17, 2025Updated 11 months ago
- This repository contains the python scripts developed as a part of the work presented in the paper "Low-latency auditory spatial attentio…☆10Sep 15, 2021Updated 4 years ago
- A PyTorch native platform for training generative AI models☆15Nov 18, 2025Updated 2 months ago
- ☆14Apr 29, 2025Updated 9 months ago
- Our EMNLP 2022 paper on VIP-Based Prompting for Parameter-Efficient Learning☆10Oct 22, 2022Updated 3 years ago
- ☆36Jan 13, 2026Updated last month
- ☆20Oct 22, 2025Updated 3 months ago
- Official codebase for paper Disentangled Condensation for Graphs (DisCo). This codebase is based on the open-source Pytorch Geometric fra…☆11Feb 12, 2025Updated last year
- This repository contains the python scripts developed as a part of the work presented in the paper "STAnet: A Spatiotemporal Attention Ne…☆15May 10, 2023Updated 2 years ago
- An annotated transformer.☆13Jul 11, 2021Updated 4 years ago
- [JBHI 2024] Self-supervised pre-training on ECG collected in the wild☆15Nov 14, 2023Updated 2 years ago
- 这项目主要收集大规模GNN(图神经网络)的相关研究☆10May 26, 2020Updated 5 years ago