thedarkzeno / text-diffusionLinks
☆13Updated 2 years ago
Alternatives and similar repositories for text-diffusion
Users that are interested in text-diffusion are comparing it to the libraries listed below
Sorting:
- Latent Diffusion Language Models☆70Updated 2 years ago
- ☆50Updated last year
- implementation of https://arxiv.org/pdf/2312.09299☆21Updated last year
- ☆63Updated last year
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆46Updated 2 years ago
- Approximating the joint distribution of language models via MCTS☆22Updated last year
- ☆40Updated last year
- Implementation of the Mamba SSM with hf_integration.☆55Updated last year
- https://x.com/BlinkDL_AI/status/1884768989743882276☆28Updated 9 months ago
- ☆35Updated 2 years ago
- Focused on fast experimentation and simplicity☆80Updated last year
- Cerule - A Tiny Mighty Vision Model☆68Updated 3 months ago
- Collection of autoregressive model implementation☆85Updated 3 weeks ago
- A client library for LAION's effort to filter CommonCrawl with CLIP, building a large scale image-text dataset.☆32Updated 2 years ago
- Latent Large Language Models☆19Updated last year
- Simple LLM inference server☆20Updated last year
- Text-writing denoising diffusion (and much more)☆30Updated 2 years ago
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.☆20Updated last year
- Lightweight package that tracks and summarizes code changes using LLMs (Large Language Models)☆34Updated 11 months ago
- A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM☆61Updated last year
- Experiments for efforts to train a new and improved t5☆76Updated last year
- ☆17Updated 2 years ago
- Trying to deconstruct RWKV in understandable terms☆14Updated 2 years ago
- ☆12Updated last year
- ☆62Updated 6 months ago
- ☆50Updated 3 months ago
- ☆52Updated 2 years ago
- LayerNorm(SmallInit(Embedding)) in a Transformer to improve convergence☆61Updated 3 years ago
- The simplest, fastest repository for training/finetuning medium-sized xLSTMs.☆41Updated last year
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"☆103Updated last year