A graph visualization of attention
☆57May 20, 2025Updated 10 months ago
Alternatives and similar repositories for attention-graph
Users that are interested in attention-graph are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Approximating the joint distribution of language models via MCTS☆22Nov 3, 2024Updated last year
- NSA Triton Kernels written with GPT5 and Opus 4.1☆71Aug 12, 2025Updated 7 months ago
- Stream of my favorite papers and links☆44Feb 15, 2026Updated last month
- ☆40Jul 26, 2024Updated last year
- look how they massacred my boy☆63Oct 16, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆31Dec 20, 2025Updated 3 months ago
- ☆54Apr 13, 2025Updated 11 months ago
- smolLM with Entropix sampler on pytorch☆149Oct 31, 2024Updated last year
- examples and guides to using Nomic Atlas☆37Apr 18, 2025Updated 11 months ago
- ☆20Mar 4, 2025Updated last year
- An AI character interaction system with emotional modeling and advanced memory management☆17Oct 26, 2024Updated last year
- ☆12Apr 30, 2023Updated 2 years ago
- PDF parser using pdfminer and pytesseract for OCR support☆11Sep 19, 2019Updated 6 years ago
- Unofficial Implementation of Chain-of-Thought Reasoning Without Prompting☆35Mar 19, 2024Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A prompt management, versioning, testing, and evaluation inference server and UI toolkit. Provider agnostic and OpenAI API compatible.☆119Jun 26, 2025Updated 9 months ago
- Collection of resources for RL and Reasoning☆27Feb 3, 2025Updated last year
- Data Wrangling, Linear Models & other misc. Inferential Statistics.☆14Jul 16, 2022Updated 3 years ago
- ☆16Dec 29, 2024Updated last year
- working implimention of deepseek MLA☆45Jan 8, 2025Updated last year
- MoE training for Me and You and maybe other people☆381Mar 15, 2026Updated last week
- ☆10Oct 24, 2024Updated last year
- A toy Inspect implementation of the Bliss Attractor eval from Claude 4 System Card Welfare Assessment☆38Jun 5, 2025Updated 9 months ago
- An MCP Server that's also an MCP Client. Useful for letting Claude develop and test MCPs without needing to reset the application.☆124Mar 10, 2025Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 🧮 Algebraic Positional Encodings.☆18Aug 20, 2025Updated 7 months ago
- A streamlined implementation of Grounding DINO and SAM for advanced image segmentation. This lightweight solution simplifies the integrat…☆67Sep 30, 2024Updated last year
- 📰 Computing the information content of trained neural networks☆23Oct 8, 2021Updated 4 years ago
- Use miniGPT-4 batch to generate captions for a lot of images! You should be able to create the best captions you always wanted!☆18Jul 20, 2023Updated 2 years ago
- A proselint linter for use with Phabricator's arc command line tool.☆17Jun 17, 2016Updated 9 years ago
- [NeurIPS 2024] Source code for our paper "Finding NeMo: Localizing Neurons Responsible For Memorization in Diffusion Models".☆13Jul 18, 2025Updated 8 months ago
- ML from scratch in Jax☆12Aug 20, 2025Updated 7 months ago
- Pi agent hook for rewinding file changes during coding sessions☆79Jan 31, 2026Updated last month
- Pokedex for LLMs☆14Apr 14, 2025Updated 11 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- C++17 implementation of einops for libtorch - clear and reliable tensor manipulations with einstein-like notation☆11Oct 16, 2023Updated 2 years ago
- ComfyUI GlitchNodes☆63Mar 14, 2026Updated last week
- ☆14Oct 30, 2024Updated last year
- A framework for benchmarking embedding models in hybrid search scenarios (BM25 + vector search) using Weaviate.☆38Mar 14, 2026Updated last week
- Code for EMNLP 2023 paper: DALE: Generative Data Augmentation for Low-Resource Legal NLP☆10Oct 27, 2023Updated 2 years ago
- ☆35Feb 24, 2025Updated last year
- ☆85Jun 14, 2025Updated 9 months ago