RuslanKhalitov / ChordMixer
The official implementation of the ChordMixer architecture.
☆61Updated last year
Alternatives and similar repositories for ChordMixer:
Users that are interested in ChordMixer are comparing it to the libraries listed below
- Official code for Long Expressive Memory (ICLR 2022, Spotlight)☆69Updated 2 years ago
- FusionBrain Challenge 2.0: creating multimodal multitask model☆16Updated 2 years ago
- ☆20Updated 6 months ago
- Deep Learning Audio Course – AI Masters☆27Updated 8 months ago
- Compression schema for gradients of activations in backward pass☆44Updated last year
- GULAG: GUessing LAnGuages with neural networks☆13Updated 2 years ago
- Continuous Augmented Positional Embeddings (CAPE) implementation for PyTorch☆39Updated 2 years ago
- Implementation of Perceiver AR, Deepmind's new long-context attention network based on Perceiver architecture, in Pytorch☆86Updated last year
- Code for the paper "PALBERT: Teaching ALBERT to Ponder", NeurIPS 2022 Spotlight☆37Updated last year
- ☆21Updated last year
- Sequence Modeling with Multiresolution Convolutional Memory (ICML 2023)☆121Updated last year
- ☆17Updated last month
- Sequence Modeling with Structured State Spaces☆61Updated 2 years ago
- Deep Generative Models course, 2021☆21Updated 3 years ago
- An implementation of PSGD Kron second-order optimizer for PyTorch☆22Updated 2 weeks ago
- Framework for processing and filtering datasets☆27Updated 5 months ago
- Learning to Initialize Neural Networks for Stable and Efficient Training☆138Updated 2 years ago
- Implementation of GateLoop Transformer in Pytorch and Jax☆87Updated 7 months ago
- Simple audio AE☆12Updated 2 months ago
- Code for MSID, a Multi-Scale Intrinsic Distance for comparing generative models, studying neural networks, and more!☆51Updated 5 years ago
- Implementation of the proposed Adam-atan2 from Google Deepmind in Pytorch☆98Updated last month
- Примеры пропозалов для подачи заявки в Open.TLab☆26Updated 2 years ago
- RuCLIP tiny (Russian Contrastive Language–Image Pretraining) is a neural network trained to work with different pairs (images, texts).☆32Updated 2 years ago
- Codes accompanying the paper "LaProp: a Better Way to Combine Momentum with Adaptive Gradient"☆26Updated 4 years ago
- Lightweight knowledge distillation pipeline☆28Updated 3 years ago
- ☆31Updated 2 years ago
- AdaCat☆49Updated 2 years ago
- Implementation of the Kalman Filtering Attention proposed in "Kalman Filtering Attention for User Behavior Modeling in CTR Prediction"☆57Updated last year
- Train punctuation and capitalization models for different languages☆24Updated 2 years ago