Oxen-AI / Score-Entropy-Discrete-DiffusionLinks
Modified Score-Entropy-Discrete-Diffusion to do a character level ml model and integrate with Oxen
☆14Updated last year
Alternatives and similar repositories for Score-Entropy-Discrete-Diffusion
Users that are interested in Score-Entropy-Discrete-Diffusion are comparing it to the libraries listed below
Sorting:
- Explorations into adversarial losses on top of autoregressive loss for language modeling☆37Updated 4 months ago
- ☆23Updated 2 years ago
- Repository for "TESS-2: A Large-Scale, Generalist Diffusion Language Model"☆40Updated 4 months ago
- TriNet: stabilizing self-supervised learning from complete or slow collapse on ASR.☆26Updated 2 years ago
- Official Code Implementation for 'A Simple Early Exiting Framework for Accelerated Sampling in Diffusion Models'☆19Updated 11 months ago
- A Pytorch Implementations for Various Vector Quantization Methods☆30Updated 3 years ago
- Reference implementation of "Softmax Attention with Constant Cost per Token" (Heinsen, 2024)☆24Updated last year
- Exploration into the proposed "Self Reasoning Tokens" by Felipe Bonetto☆56Updated last year
- Official repository for the paper "Approximating Two-Layer Feedforward Networks for Efficient Transformers"☆38Updated last month
- Implementation of a Light Recurrent Unit in Pytorch☆48Updated 9 months ago
- My attempts at applying Soundstream design on learned tokenization of text and then applying hierarchical attention to text generation☆87Updated 9 months ago
- Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch☆18Updated last week
- Implementation of Insertion-deletion Denoising Diffusion Probabilistic Models☆30Updated 3 years ago
- Hifi-like Vocoder implemented in PyTorch☆13Updated 2 years ago
- Implementation of the Kalman Filtering Attention proposed in "Kalman Filtering Attention for User Behavior Modeling in CTR Prediction"☆58Updated last year
- Exploring an idea where one forgets about efficiency and carries out attention across each edge of the nodes (tokens)☆51Updated 3 months ago
- research impl of Native Sparse Attention (2502.11089)☆54Updated 4 months ago
- ☆32Updated last year
- Accompanying repository for the paper "DiffVox: A Differentiable Model for Capturing and Analysing Professional Effects Distributions"☆28Updated this week
- Implementation of 2-simplicial attention proposed by Clift et al. (2019) and the recent attempt to make practical in Fast and Simplex, Ro…☆34Updated this week
- ☆44Updated 8 months ago
- Official code implementation for the work Preference Alignment with Flow Matching (NeurIPS 2024)☆55Updated 8 months ago
- ☆10Updated 8 months ago
- Here we collect trick questions and failed tasks for open source LLMs to improve them.☆32Updated 2 years ago
- Implementation of GateLoop Transformer in Pytorch and Jax☆89Updated last year
- [TOMM 2024] Automatic Lyric Transcription and Automatic Music Transcription from Multimodal Singing☆24Updated 10 months ago
- JAX Scalify: end-to-end scaled arithmetics☆16Updated 8 months ago
- Implementation of SoundtStream from the paper: "SoundStream: An End-to-End Neural Audio Codec"☆12Updated 5 months ago
- GPT for FACodec☆13Updated last year
- [Oral; Neurips OPT2024 ] μLO: Compute-Efficient Meta-Generalization of Learned Optimizers☆13Updated 3 months ago