thedarkzeno / text-diffusionLinks
☆13Updated 2 years ago
Alternatives and similar repositories for text-diffusion
Users that are interested in text-diffusion are comparing it to the libraries listed below
Sorting:
- Latent Diffusion Language Models☆70Updated 2 years ago
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆46Updated 2 years ago
- ☆50Updated last year
- implementation of https://arxiv.org/pdf/2312.09299☆21Updated last year
- A client library for LAION's effort to filter CommonCrawl with CLIP, building a large scale image-text dataset.☆32Updated 2 years ago
- LayerNorm(SmallInit(Embedding)) in a Transformer to improve convergence☆61Updated 3 years ago
- ☆62Updated 2 years ago
- ☆63Updated last year
- σ-GPT: A New Approach to Autoregressive Models☆70Updated last year
- Implementation of a holodeck, written in Pytorch☆18Updated 2 years ago
- https://x.com/BlinkDL_AI/status/1884768989743882276☆28Updated 8 months ago
- Focused on fast experimentation and simplicity☆80Updated last year
- Collection of autoregressive model implementation☆85Updated 2 weeks ago
- ☆12Updated last year
- Text-writing denoising diffusion (and much more)☆30Updated 2 years ago
- Implementation of the Mamba SSM with hf_integration.☆55Updated last year
- Implementation of Mind Evolution, Evolving Deeper LLM Thinking, from Deepmind☆59Updated 8 months ago
- HomebrewNLP in JAX flavour for maintable TPU-Training☆51Updated 2 years ago
- Train vision models using JAX and 🤗 transformers☆100Updated last month
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆110Updated 10 months ago
- Cerule - A Tiny Mighty Vision Model☆68Updated 2 months ago
- ☆40Updated last year
- Approximating the joint distribution of language models via MCTS☆22Updated last year
- [WIP] Transformer to embed Danbooru labelsets☆13Updated last year
- ☆16Updated last year
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"☆103Updated last year
- ☆50Updated 3 months ago
- ☆29Updated last year
- Code for the examples presented in the talk "Training a Llama in your backyard: fine-tuning very large models on consumer hardware" given…☆15Updated 2 years ago
- ☆91Updated 3 years ago