Understanding the Difficulty of Training Transformers
☆47Oct 30, 2022Updated 3 years ago
Alternatives and similar repositories for admin-torch
Users that are interested in admin-torch are comparing it to the libraries listed below
Sorting:
- An adaptive training algorithm for residual network☆17Aug 22, 2020Updated 5 years ago
- Official repository for Fourier model that can generate periodic signals☆10Mar 10, 2022Updated 3 years ago
- Official repository for the paper: "Trees with Attention for Set Prediction Tasks" (ICML21)☆10Jan 19, 2022Updated 4 years ago
- ☆11Apr 14, 2022Updated 3 years ago
- Official implementation for Text Generation Beyond Discrete Token Sampling☆21Aug 11, 2025Updated 6 months ago
- Official Implementation of Knowledge Flow Prompting☆35Oct 20, 2025Updated 4 months ago
- [TMLR 2024] Revisiting Random Weight Perturbation for Efficiently Improving Generalization☆12Oct 18, 2024Updated last year
- Code for the paper "What Makes Better Augmentation Strategies? Augment Difficult but Not too Different" (ICLR 22)☆12Aug 28, 2023Updated 2 years ago
- Code for: "Neural Controlled Differential Equations for Online Prediction Tasks"☆41Oct 19, 2022Updated 3 years ago
- This is the source code of PFRec☆14Dec 16, 2022Updated 3 years ago
- Community Detection Based on Structure and Content☆12Oct 12, 2018Updated 7 years ago
- Code for the SIGIR20 paper -- Measuring and Mitigating Item Under-Recommendation Bias inPersonalized Ranking Systems☆16Apr 28, 2020Updated 5 years ago
- lecture materials of the ML for Physics course 2021 in Perimeter Institute☆21Mar 31, 2021Updated 4 years ago
- Re-implementation of 'Grokking: Generalization beyond overfitting on small algorithmic datasets'☆38Dec 4, 2021Updated 4 years ago
- Official PyTorch code for the CVPR 2022 paper - Consistent Explanations by Contrastive Learning☆18Sep 11, 2022Updated 3 years ago
- Official repository for Automated Learning Rate Scheduler for Large-Batch Training (8th ICML Workshop on AutoML)☆40Dec 3, 2021Updated 4 years ago
- Official code for the paper: "Metadata Archaeology"☆19May 10, 2023Updated 2 years ago
- Group-conditional DRO to alleviate spurious correlations☆15Jul 15, 2021Updated 4 years ago
- Mining GOLD Samples for Conditional GANs (NeurIPS 2019)☆18Oct 22, 2019Updated 6 years ago
- Survey-on-Implicit-Neural-Representation☆36Mar 31, 2021Updated 4 years ago
- ☆22Aug 14, 2021Updated 4 years ago
- [CVPR 2022] "The Principle of Diversity: Training Stronger Vision Transformers Calls for Reducing All Levels of Redundancy" by Tianlong C…☆25Mar 9, 2022Updated 3 years ago
- A basic implementation of the paper Eigengame : PCA as a Nash Equilibrium☆21Jun 7, 2021Updated 4 years ago
- Early Detection of Fake News with Multi-source Weak Social Supervision☆23Jun 12, 2023Updated 2 years ago
- Self-Similarity Priors: Neural Collages as Differentiable Fractal Representations☆29Nov 26, 2022Updated 3 years ago
- This is repository for a I/O benchmark which represents Scientific Deep Learning Workloads.☆23Dec 6, 2022Updated 3 years ago
- [ NeurIPS '22 ] ∞-AE model's implementation in JAX. Kernel-only method outperforms complicated SoTA models with a closed-form solution an…☆54Jun 8, 2023Updated 2 years ago
- This repository contains code released by DiffEqML Research☆93Mar 9, 2022Updated 3 years ago
- Source code of "Grid-to-Graph: Flexible Spatial Relational Inductive Biases for Reinforcement Learning" (AAMAS 2021).☆29Aug 19, 2021Updated 4 years ago
- Cross-Domain Imitation Learning via Optimal Transport☆25Jun 24, 2022Updated 3 years ago
- A curated list of resources to help with computational research.☆20Jun 11, 2022Updated 3 years ago
- ☆24Mar 2, 2023Updated 3 years ago
- [ICLR 2022] RelViT: Concept-guided Vision Transformer for Visual Relational Reasoning☆63Sep 10, 2022Updated 3 years ago
- Code for ICLR 2021 Paper, "Anytime Sampling for Autoregressive Models via Ordered Autoencoding"☆26Jun 6, 2023Updated 2 years ago
- repo for paper: Adaptive Checkpoint Adjoint (ACA) method for gradient estimation in neural ODE☆56Mar 13, 2021Updated 4 years ago
- ☆105Feb 6, 2021Updated 5 years ago
- (WSDM2020) "Unbiased Recommender Learning from Missing-Not-At-Random Implicit Feedback"☆25Mar 24, 2023Updated 2 years ago
- Official code for Long Expressive Memory (ICLR 2022, Spotlight)☆71Mar 11, 2022Updated 3 years ago
- Codebase for the paper titled "Continual learning with local module selection"☆25Nov 15, 2021Updated 4 years ago