paulilioaica / Differential-TransformerLinks
☆18Updated 11 months ago
Alternatives and similar repositories for Differential-Transformer
Users that are interested in Differential-Transformer are comparing it to the libraries listed below
Sorting:
- PyTorch implementation of Pseudo-Riemannian Graph Convolutional Networks (NeurIPS'22))☆17Updated last year
- A code for the NeurIPS 2022 Table Representation Learning Workshop paper: "Diffusion models for missing value imputation in tabular data"☆54Updated last year
- ☆35Updated last year
- "Graph Convolutions Enrich the Self-Attention in Transformers!" NeurIPS 2024☆26Updated 6 months ago
- [NeurIPS 2023, Spotlight] Rank-N-Contrast: Learning Continuous Representations for Regression☆121Updated last year
- Official code for "CoDi: Co-evolving Contrastive Diffusion Models for Mixed-type Tabular Synthesis", ICML 2023☆36Updated last year
- [ICML'24] Official PyTorch Implementation of TimeX++☆26Updated 10 months ago
- Attentive Co-Evolving Neural Ordinary Differential Equations☆30Updated last year
- Official source code for Time is Not Enough: Time-Frequency based Explanation for Time-Series Black-Box Models☆10Updated 9 months ago
- Official code for ICLR 2023 paper "ContraNorm: A Contrastive Learning Perspective on Oversmoothing and Beyond "☆35Updated 2 years ago
- Official code for "STaSy: Score-based Tabular data Synthesis", ICLR 2023☆30Updated 2 years ago
- State Space Models☆70Updated last year
- [ICLR'25 Spotlight] Revisiting Random Walks for Learning on Graphs (RWNN), in PyTorch☆15Updated 6 months ago
- An offical implementation of EHRDiff [TMLR]☆27Updated last year
- Official Code for ICLR 2024 Paper: Non-negative Contrastive Learning☆45Updated last year
- [NeurIPS 2024] Official implementation of the paper "MambaLRP: Explaining Selective State Space Sequence Models" 🐍☆44Updated 10 months ago
- Official implement of 'Advancing Graph Convolutional Networks via General Spectral Wavelets'☆32Updated 2 months ago
- Official code for Fisher information embedding for node and graph learning (ICML 2023)☆19Updated 2 years ago
- Code for the ICLR'23 paper "Temporal Dependencies in Feature Importance for Time Series Prediction"☆26Updated 2 years ago
- [ICML 2024] Recurrent Distance Filtering for Graph Representation Learning☆15Updated last year
- Implementation of Implicit Graphon Neural Representation☆12Updated 2 years ago
- Official repository for Cell Attention Networks☆14Updated last year
- Kolmogorov-Arnold Networks (KAN) using Jacobi polynomials instead of B-splines.☆40Updated last year
- The Official PyTorch Implementation of "Poisson Variational Autoencoder" (NeurIPS 2024 Spotlight Paper)☆21Updated 4 months ago
- C-Mixup for NeurIPS 2022☆73Updated last year
- ☆13Updated 4 years ago
- C-GMVAE: Gaussian Mixture VAE with Contrastive Learning for Multi-Label Classification☆54Updated 2 years ago
- Towards Understanding the Mixture-of-Experts Layer in Deep Learning☆31Updated last year
- Official implementation for KDD'22 paper "Learning Fair Representation via Distributional Contrastive Disentanglement"☆23Updated 3 years ago
- Example code of Sparse Gaussian Process Attention (ICLR 2023)☆25Updated this week