vvvm23 / TchAIkovskyLinks
Using JAX to generate piano music as MIDI
☆39Updated last year
Alternatives and similar repositories for TchAIkovsky
Users that are interested in TchAIkovsky are comparing it to the libraries listed below
Sorting:
- This repository contains the implementation of **Alternators**, a novel family of generative models for time-dependent data.☆35Updated 2 months ago
- Implementation of Perceiver AR, Deepmind's new long-context attention network based on Perceiver architecture, in Pytorch☆89Updated 2 years ago
- A JAX implementation of the continuous time formulation of Consistency Models☆85Updated 2 years ago
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"☆101Updated 7 months ago
- Train vision models using JAX and 🤗 transformers☆98Updated last week
- Latent Diffusion Language Models☆69Updated last year
- Diffusion models in PyTorch☆107Updated last month
- Implementation of Gradient Agreement Filtering, from Chaubard et al. of Stanford, but for single machine microbatches, in Pytorch☆25Updated 6 months ago
- A State-Space Model with Rational Transfer Function Representation.☆79Updated last year
- ☆65Updated 8 months ago
- ☆19Updated 2 months ago
- Lightweight package that tracks and summarizes code changes using LLMs (Large Language Models)☆33Updated 5 months ago
- Utilities for PyTorch distributed☆24Updated 5 months ago
- Just some miscellaneous utility functions / decorators / modules related to Pytorch and Accelerate to help speed up implementation of new…☆124Updated last year
- σ-GPT: A New Approach to Autoregressive Models☆67Updated 11 months ago
- Examples of apps built with Nendo, the AI Audio Tool Suite☆55Updated last year
- ☆34Updated 11 months ago
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆130Updated last year
- Focused on fast experimentation and simplicity☆76Updated 7 months ago
- Implementation of GateLoop Transformer in Pytorch and Jax☆89Updated last year
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆19Updated 2 weeks ago
- Collection of autoregressive model implementation☆86Updated 3 months ago
- ☆82Updated last year
- Maximal Update Parametrization (μP) with Flax & Optax.☆16Updated last year
- Code for the paper Don't Pay Attention☆48Updated last month
- Exploration into the proposed "Self Reasoning Tokens" by Felipe Bonetto☆56Updated last year
- Framework for writing deep learning training loops. Lightweight, and retaining full freedom to design as you see fits. It handles checkpo…☆115Updated last year
- DiCE: The Infinitely Differentiable Monte-Carlo Estimator☆31Updated 2 years ago
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆104Updated 5 months ago
- Implementation of the proposed Spline-Based Transformer from Disney Research☆102Updated 9 months ago