A graph visualization of attention
☆56May 20, 2025Updated last year
Alternatives and similar repositories for attention-graph
Users that are interested in attention-graph are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Approximating the joint distribution of language models via MCTS☆22Nov 3, 2024Updated last year
- It's a baby compiler. (Lean btw.)☆16May 19, 2025Updated last year
- Entropy Based Sampling and Parallel CoT Decoding☆17Oct 9, 2024Updated last year
- ☆125Mar 31, 2026Updated last month
- NSA Triton Kernels written with GPT5 and Opus 4.1☆70Aug 12, 2025Updated 9 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- An introduction to LLM Sampling☆80Dec 15, 2024Updated last year
- Simple Transformer in Jax☆143Jun 22, 2024Updated last year
- Stream of my favorite papers and links☆44Apr 19, 2026Updated last month
- look how they massacred my boy☆63Oct 16, 2024Updated last year
- ☆31Apr 24, 2026Updated last month
- This is sample code for Paho MQTT server with Python 2.7☆10Mar 29, 2016Updated 10 years ago
- Code release for the paper "Style Vectors for Steering Generative Large Language Models", accepted to the Findings of the EACL 2024.☆36Sep 26, 2024Updated last year
- smolLM with Entropix sampler on pytorch☆149Oct 31, 2024Updated last year
- Interpretable Control Exploration and Counterfactual Explanation (ICE) on StyleGAN☆17Jan 5, 2022Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- An AI character interaction system with emotional modeling and advanced memory management☆17Oct 26, 2024Updated last year
- Example AI chat UI built with Cloudflare Workers, Vercel AI SDK, and Shadcn☆21Apr 29, 2025Updated last year
- A prompt management, versioning, testing, and evaluation inference server and UI toolkit. Provider agnostic and OpenAI API compatible.☆119Jun 26, 2025Updated 11 months ago
- Collection of resources for RL and Reasoning☆27Feb 3, 2025Updated last year
- Unofficial Implementation of Chain-of-Thought Reasoning Without Prompting☆35Mar 19, 2024Updated 2 years ago
- [AAAI 2026] ReCode: Reinforced Code Knowledge Editing for API Updates☆26Jul 1, 2025Updated 10 months ago
- working implimention of deepseek MLA☆44Jan 8, 2025Updated last year
- NeurIPS 2026 paper: The Geometry of Consolidation — follow-up to HIDE and No-Escape.☆108May 5, 2026Updated 2 weeks ago
- Git Repo for managing the ontological logger☆12Dec 27, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- MoE training for Me and You and maybe other people☆387Mar 15, 2026Updated 2 months ago
- ☆39Aug 4, 2025Updated 9 months ago
- ☆10Oct 24, 2024Updated last year
- A toy Inspect implementation of the Bliss Attractor eval from Claude 4 System Card Welfare Assessment☆38Jun 5, 2025Updated 11 months ago
- A streamlined implementation of Grounding DINO and SAM for advanced image segmentation. This lightweight solution simplifies the integrat…☆68Sep 30, 2024Updated last year
- Deepseek-CoT☆10Oct 6, 2024Updated last year
- Reverse-Engineering Tool☆57May 4, 2026Updated 3 weeks ago
- 📰 Computing the information content of trained neural networks☆23Oct 8, 2021Updated 4 years ago
- ☆10Apr 10, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Codebase for "Linking Surface Facts to Large-Scale Knowledge Graphs" (EMNLP 2023)☆13May 8, 2024Updated 2 years ago
- Asus Prime Z490-A-OpenCore-Hackintosh☆12Aug 19, 2022Updated 3 years ago
- Pokedex for LLMs☆14Apr 14, 2025Updated last year
- ☆14Mar 16, 2023Updated 3 years ago
- C++17 implementation of einops for libtorch - clear and reliable tensor manipulations with einstein-like notation☆11Oct 16, 2023Updated 2 years ago
- [NeurIPS 2024] Source code for our paper "Finding NeMo: Localizing Neurons Responsible For Memorization in Diffusion Models".☆13Jul 18, 2025Updated 10 months ago
- A framework for benchmarking embedding models in hybrid search scenarios (BM25 + vector search) using Weaviate.☆40Updated this week