thedarkzeno / text-diffusionLinks
☆13Updated last year
Alternatives and similar repositories for text-diffusion
Users that are interested in text-diffusion are comparing it to the libraries listed below
Sorting:
- ☆49Updated last year
- Latent Diffusion Language Models☆69Updated last year
- implementation of https://arxiv.org/pdf/2312.09299☆21Updated last year
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆45Updated last year
- A client library for LAION's effort to filter CommonCrawl with CLIP, building a large scale image-text dataset.☆32Updated 2 years ago
- Approximating the joint distribution of language models via MCTS☆21Updated 10 months ago
- Text-writing denoising diffusion (and much more)☆30Updated 2 years ago
- ☆63Updated 11 months ago
- Cerule - A Tiny Mighty Vision Model☆67Updated 11 months ago
- assign color hues to a collection of text fragments based on embeddings☆20Updated last year
- Collection of autoregressive model implementation☆86Updated 4 months ago
- ☆50Updated last year
- [WIP] A 🔥 interface for running code in the cloud☆85Updated 2 years ago
- σ-GPT: A New Approach to Autoregressive Models☆67Updated last year
- LayerNorm(SmallInit(Embedding)) in a Transformer to improve convergence☆59Updated 3 years ago
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆105Updated 5 months ago
- ☆60Updated last month
- Focused on fast experimentation and simplicity☆75Updated 8 months ago
- ☆39Updated last year
- Full finetuning of large language models without large memory requirements☆94Updated last year
- This repository contains code for cleaning your training data of benchmark data to help combat data snooping.☆26Updated 2 years ago
- utilities for loading and running text embeddings with onnx☆44Updated 2 weeks ago
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"☆101Updated 8 months ago
- An introduction to LLM Sampling☆79Updated 8 months ago
- Generates grammer files from typescript for LLM generation☆38Updated last year
- A trio of Google-Colab notebooks (ipynb) for training a GPT-2 (127M) model from scratch (useful for other / non-English languages) using …☆15Updated 5 years ago
- The simplest, fastest repository for training/finetuning medium-sized xLSTMs.☆41Updated last year
- ☆27Updated 2 years ago
- Train vision models using JAX and 🤗 transformers☆99Updated last week
- https://x.com/BlinkDL_AI/status/1884768989743882276☆28Updated 4 months ago