doomslide / attention-graph
☆54Updated 2 weeks ago
Alternatives and similar repositories for attention-graph:
Users that are interested in attention-graph are comparing it to the libraries listed below
- look how they massacred my boy☆63Updated 5 months ago
- Modify Entropy Based Sampling to work with Mac Silicon via MLX☆50Updated 4 months ago
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆63Updated 4 months ago
- A tree-based prefix cache library that allows rapid creation of looms: hierarchal branching pathways of LLM generations.☆67Updated last month
- ☆20Updated 4 months ago
- Simple Transformer in Jax☆136Updated 9 months ago
- ☆97Updated 5 months ago
- smolLM with Entropix sampler on pytorch☆150Updated 4 months ago
- smol models are fun too☆90Updated 4 months ago
- ☆38Updated 7 months ago
- Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.☆31Updated 3 weeks ago
- ComplexTensor: Machine Learning By Bridging Classical and Quantum Computation☆75Updated 4 months ago
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆91Updated 2 weeks ago
- The history files when recording human interaction while solving ARC tasks☆97Updated this week
- Entropy Based Sampling and Parallel CoT Decoding☆17Updated 5 months ago
- Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.☆59Updated 10 months ago
- ☆85Updated 2 months ago
- ☆105Updated 3 months ago
- Repository to create traveling waves integrate special information through time☆49Updated 2 weeks ago
- Claude Deep Research config for Claude Code.☆155Updated last week
- papers.day☆92Updated last year
- Useful resources for LLM-based Diarization and Transcription.☆55Updated 5 months ago
- The open-source implementation of Q*, achieved in context as a zero-shot reprogramming of the attention mechanism. (synthetic data)Updated 3 months ago
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆138Updated last month
- Lego for GRPO☆25Updated last week
- Train your own SOTA deductive reasoning model☆81Updated 2 weeks ago
- Letting Claude Code develop his own MCP tools :)☆90Updated 2 weeks ago
- ☆96Updated 5 months ago
- An introduction to LLM Sampling☆77Updated 3 months ago
- ☆126Updated 7 months ago