A graph visualization of attention
☆56May 20, 2025Updated 10 months ago
Alternatives and similar repositories for attention-graph
Users that are interested in attention-graph are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Approximating the joint distribution of language models via MCTS☆22Nov 3, 2024Updated last year
- It's a baby compiler. (Lean btw.)☆16May 19, 2025Updated 10 months ago
- Entropy Based Sampling and Parallel CoT Decoding☆17Oct 9, 2024Updated last year
- An introduction to LLM Sampling☆80Dec 15, 2024Updated last year
- Simple Transformer in Jax☆143Jun 22, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Stream of my favorite papers and links☆44Feb 15, 2026Updated 2 months ago
- A text compressor based on the PAQ architecture.☆22Sep 12, 2025Updated 7 months ago
- look how they massacred my boy☆63Oct 16, 2024Updated last year
- ☆31Dec 20, 2025Updated 3 months ago
- This is sample code for Paho MQTT server with Python 2.7☆10Mar 29, 2016Updated 10 years ago
- Code release for the paper "Style Vectors for Steering Generative Large Language Models", accepted to the Findings of the EACL 2024.☆36Sep 26, 2024Updated last year
- Emacs major mode for editing Futhark programs☆14May 9, 2025Updated 11 months ago
- smolLM with Entropix sampler on pytorch☆149Oct 31, 2024Updated last year
- examples and guides to using Nomic Atlas☆37Apr 18, 2025Updated 11 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- trio async MQTT client that wraps paho-mqtt☆12Feb 8, 2021Updated 5 years ago
- ☆115Dec 1, 2024Updated last year
- AI-Tango's Official Wii Reinforcement Learning Repository☆101Jan 6, 2026Updated 3 months ago
- An AI character interaction system with emotional modeling and advanced memory management☆17Oct 26, 2024Updated last year
- QMC Heston pricing on GPU☆12May 2, 2023Updated 2 years ago
- Example AI chat UI built with Cloudflare Workers, Vercel AI SDK, and Shadcn☆21Apr 29, 2025Updated 11 months ago
- PDF parser using pdfminer and pytesseract for OCR support☆11Sep 19, 2019Updated 6 years ago
- A prompt management, versioning, testing, and evaluation inference server and UI toolkit. Provider agnostic and OpenAI API compatible.☆119Jun 26, 2025Updated 9 months ago
- Data Wrangling, Linear Models & other misc. Inferential Statistics.☆14Jul 16, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- template repo for my web library projects☆28Jan 6, 2025Updated last year
- working implimention of deepseek MLA☆44Jan 8, 2025Updated last year
- ☆21Jul 11, 2022Updated 3 years ago
- MoE training for Me and You and maybe other people☆380Mar 15, 2026Updated last month
- ☆39Aug 4, 2025Updated 8 months ago
- ☆10Oct 24, 2024Updated last year
- An MCP Server that's also an MCP Client. Useful for letting Claude develop and test MCPs without needing to reset the application.☆124Mar 10, 2025Updated last year
- A streamlined implementation of Grounding DINO and SAM for advanced image segmentation. This lightweight solution simplifies the integrat…☆68Sep 30, 2024Updated last year
- Template engine for writing HTML inside *.lua(x) files, like JSX.☆15Mar 10, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- LayerNorm(SmallInit(Embedding)) in a Transformer to improve convergence☆61Feb 21, 2022Updated 4 years ago
- 📰 Computing the information content of trained neural networks☆23Oct 8, 2021Updated 4 years ago
- x dot com cli☆16Sep 1, 2025Updated 7 months ago
- ☆10Apr 10, 2023Updated 3 years ago
- ML from scratch in Jax☆12Aug 20, 2025Updated 7 months ago
- Small extensions of the Bellman-Ford routines in NetworkX, primarily for convenience☆13May 7, 2018Updated 7 years ago
- ☆14Mar 16, 2023Updated 3 years ago