revalo / tree-diffusionLinks

Diffusion on syntax trees for program synthesis

☆470

Alternatives and similar repositories for tree-diffusion

Users that are interested in tree-diffusion are comparing it to the libraries listed below

Sorting:

akarshkumar0101 / fer
Code for the Fractured Entangled Representation Hypothesis position paper!
☆145Updated 2 months ago
namin / llm-verified-with-monte-carlo-tree-search
LLM verified with Monte Carlo Tree Search
☆278Updated 4 months ago
neurallambda / neurallambda
Reasoning Computers. Lambda Calculus, Fully Differentiable. Also Neural Stacks, Queues, Arrays, Lists, Trees, and Latches.
☆267Updated 9 months ago
facebookresearch / searchformer
Official codebase for the paper "Beyond A* Better Planning with Transformers via Search Dynamics Bootstrapping".
☆372Updated last year
valine / training-hot-swap
Pytorch script hot swap: Change code without unloading your LLM from VRAM
☆126Updated 3 months ago
adamkarvonen / chess_llm_interpretability
Visualizing the internal board state of a GPT trained on chess PGN strings, and performing interventions on its internal board state and …
☆208Updated 8 months ago
neoneye / ARC-Interactive-History-Dataset
The history files when recording human interaction while solving ARC tasks
☆114Updated last week
michaelhodel / arc-dsl
Domain Specific Language for the Abstraction and Reasoning Corpus
☆285Updated 9 months ago
iliao2345 / CompressARC
☆172Updated 3 months ago
michaelhodel / re-arc
Reverse Engineering the Abstraction and Reasoning Corpus
☆291Updated 5 months ago
xjdr-alt / simple_transformer
Simple Transformer in Jax
☆138Updated last year
mcleish7 / arithmetic
Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)
☆190Updated last year
SakanaAI / evo-memory
Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.
☆318Updated 9 months ago
PaulPauls / llama3_interpretability_sae
A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and full…
☆622Updated 4 months ago
google-deepmind / treescope
An interactive HTML pretty-printer for machine learning research in IPython notebooks.
☆426Updated 3 months ago
valine / NeuralFlow
Visualize the intermediate output of Mistral 7B
☆367Updated 6 months ago
xu3kev / BARC
Bootstrapping ARC
☆139Updated 8 months ago
da-fr / arc-prize-2024
Our solution for the arc challenge 2024
☆166Updated last month
google-deepmind / tracr
☆540Updated last year
drubinstein / pokemonred_puffer
☆154Updated last month
rgreenblatt / arc_draw_more_samples_pub
Draw more samples
☆193Updated last year
jxmorris12 / gptzip
Losslessly encode text natively with arithmetic coding and HuggingFace Transformers
☆76Updated last year
haraschax / nograd
Gradient descent is cool and all, but what if we could delete it?
☆104Updated last week
Cerebras / gigaGPT
a small code base for training large models
☆307Updated 3 months ago
revalo / iceberg
A compositional diagramming and animation library as an eDSL in Python
☆218Updated 8 months ago
tysam-code / hlb-gpt
Minimalistic, extremely fast, and hackable researcher's toolbench for GPT models in 307 lines of code. Reaches <3.8 validation loss on wi…
☆349Updated last year
JD-P / minihf
MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the user…
☆177Updated 3 weeks ago
modula-systems / modula
🧱 Modula software package
☆216Updated last week
srush / raspy
An interactive exploration of Transformer programming.
☆267Updated last year
magicproduct / hash-hop
Long context evaluation for large language models
☆220Updated 5 months ago