revalo / tree-diffusionLinks
Diffusion on syntax trees for program synthesis
☆470Updated last year
Alternatives and similar repositories for tree-diffusion
Users that are interested in tree-diffusion are comparing it to the libraries listed below
Sorting:
- Code for the Fractured Entangled Representation Hypothesis position paper!☆145Updated 2 months ago
- LLM verified with Monte Carlo Tree Search☆278Updated 4 months ago
- Reasoning Computers. Lambda Calculus, Fully Differentiable. Also Neural Stacks, Queues, Arrays, Lists, Trees, and Latches.☆267Updated 9 months ago
- Official codebase for the paper "Beyond A* Better Planning with Transformers via Search Dynamics Bootstrapping".☆372Updated last year
- Pytorch script hot swap: Change code without unloading your LLM from VRAM☆126Updated 3 months ago
- Visualizing the internal board state of a GPT trained on chess PGN strings, and performing interventions on its internal board state and …☆208Updated 8 months ago
- The history files when recording human interaction while solving ARC tasks☆114Updated last week
- Domain Specific Language for the Abstraction and Reasoning Corpus☆285Updated 9 months ago
- ☆172Updated 3 months ago
- Reverse Engineering the Abstraction and Reasoning Corpus☆291Updated 5 months ago
- Simple Transformer in Jax☆138Updated last year
- Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)☆190Updated last year
- Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.☆318Updated 9 months ago
- A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and full…☆622Updated 4 months ago
- An interactive HTML pretty-printer for machine learning research in IPython notebooks.☆426Updated 3 months ago
- Visualize the intermediate output of Mistral 7B☆367Updated 6 months ago
- Bootstrapping ARC☆139Updated 8 months ago
- Our solution for the arc challenge 2024☆166Updated last month
- ☆540Updated last year
- ☆154Updated last month
- Draw more samples☆193Updated last year
- Losslessly encode text natively with arithmetic coding and HuggingFace Transformers☆76Updated last year
- Gradient descent is cool and all, but what if we could delete it?☆104Updated last week
- a small code base for training large models☆307Updated 3 months ago
- A compositional diagramming and animation library as an eDSL in Python☆218Updated 8 months ago
- Minimalistic, extremely fast, and hackable researcher's toolbench for GPT models in 307 lines of code. Reaches <3.8 validation loss on wi…☆349Updated last year
- MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the user…☆177Updated 3 weeks ago
- 🧱 Modula software package☆216Updated last week
- An interactive exploration of Transformer programming.☆267Updated last year
- Long context evaluation for large language models☆220Updated 5 months ago