microsoft/admin-torch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/microsoft/admin-torch)

microsoft / admin-torch

Understanding the Difficulty of Training Transformers

☆47

Alternatives and similar repositories for admin-torch

Users that are interested in admin-torch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

EvanZhuang / vector-icl
View on GitHub
Official implementation of Vector-ICL: In-context Learning with Continuous Vector Representations (ICLR 2025)
☆24Jun 2, 2025Updated last year
Zziwei / Item-Underrecommendation-Bias
View on GitHub
Code for the SIGIR20 paper -- Measuring and Mitigating Item Under-Recommendation Bias inPersonalized Ranking Systems
☆16Apr 28, 2020Updated 6 years ago
EvanZhuang / mixinputs
View on GitHub
Official implementation for Text Generation Beyond Discrete Token Sampling
☆26Aug 11, 2025Updated 11 months ago
EvanZhuang / knowledge_flow
View on GitHub
Official Implementation of Knowledge Flow Prompting
☆35Oct 20, 2025Updated 9 months ago
DiffEqML / tutorials
View on GitHub
☆11Apr 14, 2022Updated 4 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
csinva / cookiecutter-ml-research
View on GitHub
A logical, reasonably standardized, but flexible project structure for conducting ml research 🍪
☆19Apr 9, 2026Updated 3 months ago
violet-zct / group-conditional-DRO
View on GitHub
Group-conditional DRO to alleviate spurious correlations
☆15Jul 15, 2021Updated 5 years ago
EvanZhuang / wavspa
View on GitHub
WavSpA: Wavelet Space Attention for Enhancing Transformer's Long Sequence Learning
☆13Feb 24, 2024Updated 2 years ago
microsoft / MWSS
View on GitHub
Early Detection of Fake News with Multi-source Weak Social Supervision
☆24Jun 12, 2023Updated 3 years ago
KuNyaa / berkeleydeeprlcourse-homework-pytorch-solution
View on GitHub
Solutions for CS294-112 Fall2018 assignments in Pytorch
☆20Oct 13, 2018Updated 7 years ago
PreferredAI / seer
View on GitHub
Code of the paper "Synthesizing Aspect-Driven Recommendation Explanations from Reviews", IJCAI'20
☆10Apr 5, 2024Updated 2 years ago
TAU-MLwell / Set-Tree
View on GitHub
Official repository for the paper: "Trees with Attention for Set Prediction Tasks" (ICML21)
☆10Jan 19, 2022Updated 4 years ago
microsoft / AMOS
View on GitHub
[ICLR 2022] Pretraining Text Encoders with Adversarial Mixture of Training Signal Generators
☆26Jul 26, 2023Updated 3 years ago
jiyounglee-0523 / FourierDecoder
View on GitHub
Official repository for Fourier model that can generate periodic signals
☆10Mar 10, 2022Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
quanmingyao / SIF
View on GitHub
Efficient Neural Interaction Functions Search for Collaborative Filtering
☆18Feb 15, 2020Updated 6 years ago
xuaoxiqi / Numerical-Methods
View on GitHub
implementation of basic numerical methods
☆13Apr 21, 2021Updated 5 years ago
naver-ai / mid.metric
View on GitHub
☆30Jan 3, 2023Updated 3 years ago
microsoft / platformer-ml-game
View on GitHub
Edutainment game teaching players concepts around machine learning
☆15Feb 18, 2020Updated 6 years ago
ChajinShin / Survey-on-Implicit-Neural-Representation
View on GitHub
Survey-on-Implicit-Neural-Representation
☆36Mar 31, 2021Updated 5 years ago
xuaoxiqi / KFVS
View on GitHub
Codes for solving flow problems based on Kinetic Flux Vector Splitting (KFVS) Scheme
☆10Jun 1, 2015Updated 11 years ago
microsoft / Efficient-Large-LM-Trainer
View on GitHub
☆39Jul 25, 2024Updated 2 years ago
microsoft / aicreator
View on GitHub
aicreator for aidata
☆14May 17, 2023Updated 3 years ago
LiyuanLucasLiu / LD-Net
View on GitHub
Language Model Pruning for Sequence Labeling
☆147Feb 29, 2020Updated 6 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
fistyee / MixPro
View on GitHub
🔥MixPro: Data Augmentation with MaskMix and Progressive Attention Labeling for Vision Transformer [Official, ICLR 2023]
☆22Nov 3, 2023Updated 2 years ago
microsoft / MAMBA
View on GitHub
Imitation learning from multiple experts
☆13Aug 29, 2022Updated 3 years ago
jambo6 / online-neural-cdes
View on GitHub
Code for: "Neural Controlled Differential Equations for Online Prediction Tasks"
☆42Oct 19, 2022Updated 3 years ago
sofiaherrero / lime-ner
View on GitHub
lime-ner: extending LIME for Named Entity Recognition
☆10Aug 15, 2018Updated 7 years ago
YassineYousfi / tiny-rf
View on GitHub
☆17Feb 9, 2026Updated 5 months ago
Ajim63 / Attention-Based-Dynamic-Graph-Learning-Framework-for-Asset-Pricing
View on GitHub
This is a tensorflow-keras implementation of our paper "Attention Based Dynamic Graph Learning Framework for Asset Pricing"
☆14Dec 13, 2021Updated 4 years ago
Charleo85 / seqrec
View on GitHub
A curator of Sequential Recommendation algorithms
☆20Oct 15, 2020Updated 5 years ago
EvanZhuang / MRI-Reconstruction-with-Sparse-Optimization
View on GitHub
Magnetic resonance imaging (MRI) images are known to be sparse. This is an implementation using non-convex penalty function that encourag…
☆19Aug 10, 2019Updated 6 years ago
commoncrawl / ia-web-commons
View on GitHub
Web archiving utility library
☆11Jul 21, 2026Updated last week
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
lemmonation / fcl-nat
View on GitHub
Code for "Fine-Tuning by Curriculum Learning for Non-Autoregressive Neural Machine Translation"
☆13Jul 10, 2020Updated 6 years ago
nathanbreitsch / torchmocks
View on GitHub
Test pytorch code with minimal computational overhead
☆26Jun 8, 2023Updated 3 years ago
wjx-error / ProtoSurv
View on GitHub
The implementations for NeurIPS 2024 paper "Leveraging Tumor Heterogeneity: Heterogeneous Graph Representation Learning for Cancer Surviv…
☆15Jun 11, 2025Updated last year
ermongroup / self-similarity-prior
View on GitHub
Self-Similarity Priors: Neural Collages as Differentiable Fractal Representations
☆30Nov 26, 2022Updated 3 years ago
jleinonen / geogan
View on GitHub
Style-based GAN for geophysical fields
☆13May 7, 2020Updated 6 years ago
ymoslem / MT-Evaluation
View on GitHub
Machine Translation (MT) Evaluation Scripts
☆18May 19, 2024Updated 2 years ago
mscheuerer / NeuralNetworkS2S
View on GitHub
Python code for MWR paper 'Using Artificial Neural Networks for Generating Probabilistic Subseasonal Precipitation Forecasts over Califor…
☆13Jun 5, 2020Updated 6 years ago