JosefAlbers / Aggressor
Ultra-minimal autoregressive diffusion model for image generation
☆16Updated 3 months ago
Alternatives and similar repositories for Aggressor:
Users that are interested in Aggressor are comparing it to the libraries listed below
- Experiments with BitNet inference on CPU☆52Updated 9 months ago
- ☆27Updated 6 months ago
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Updated last year
- Jax like function transformation engine but micro, microjax☆30Updated 2 months ago
- Latent Large Language Models☆17Updated 4 months ago
- implementation of https://arxiv.org/pdf/2312.09299☆20Updated 6 months ago
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆26Updated this week
- Alpha-Zero Connect Four NN trained via self play☆13Updated 3 months ago
- ☆26Updated last year
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆14Updated 2 weeks ago
- Train, tune, and infer Bamba model☆76Updated this week
- [WIP] Transformer to embed Danbooru labelsets☆13Updated 9 months ago
- Minimum Description Length probing for neural network representations☆18Updated last week
- Video+code lecture on building nanoGPT from scratch☆65Updated 7 months ago
- Collection of autoregressive model implementation☆76Updated last week
- Binary vector search example using Unum's USearch engine and pre-computed Wikipedia embeddings from Co:here and MixedBread☆18Updated 9 months ago
- Trying to deconstruct RWKV in understandable terms☆14Updated last year
- Implementation of the Mamba SSM with hf_integration.☆56Updated 4 months ago
- Public reports detailing responses to sets of prompts by Large Language Models.☆28Updated 2 weeks ago
- ☆36Updated 2 years ago
- ☆31Updated 7 months ago
- Rust bindings for CTranslate2☆14Updated last year
- Training hybrid models for dummies.☆16Updated this week
- RWKV model implementation☆37Updated last year
- ☆22Updated last year
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆29Updated 3 months ago
- ☆44Updated 6 months ago
- ☆62Updated 3 months ago
- An example implementation of RLHF (or, more accurately, RLAIF) built on MLX and HuggingFace.☆22Updated 6 months ago