Ki-Seki / Awesome-Transformer-Visualization
Explore visualization tools for understanding Transformer-based large language models (LLMs)
☆12Updated 5 months ago
Alternatives and similar repositories for Awesome-Transformer-Visualization
Users that are interested in Awesome-Transformer-Visualization are comparing it to the libraries listed below
Sorting:
- Must-read papers and blogs about parametric knowledge mechanism in LLMs.☆20Updated this week
- It is a comprehensive resource hub compiling all LLM papers accepted at the International Conference on Learning Representations (ICLR) i…☆61Updated last year
- ☆27Updated 3 weeks ago
- [ICML 2025] Flow of Reasoning: Training LLMs for Divergent Problem Solving with Minimal Examples☆85Updated last month
- official implementation of paper "Process Reward Model with Q-value Rankings"☆57Updated 3 months ago
- ☆28Updated last week
- A curated list of awesome LLM Inference-Time Self-Improvement (ITSI, pronounced "itsy") papers from our recent survey: A Survey on Large …☆75Updated 4 months ago
- ☆97Updated 2 months ago
- ☆17Updated last month
- Research Code for preprint "Optimizing Test-Time Compute via Meta Reinforcement Finetuning".☆93Updated 2 months ago
- Official implementation of the paper "Unveiling the Key Factors for Distilling Chain-of-Thought Reasoning". (By Xinghao Chen)☆15Updated 2 months ago
- NeurIPS 2024 tutorial on LLM Inference☆43Updated 5 months ago
- ☆82Updated last week
- The code implementation of Symbolic-MoE☆31Updated 2 months ago
- From GaLore to WeLore: How Low-Rank Weights Non-uniformly Emerge from Low-Rank Gradients. Ajay Jaiswal, Lu Yin, Zhenyu Zhang, Shiwei Liu,…☆47Updated 3 weeks ago
- e☆33Updated 3 weeks ago
- Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆90Updated 2 months ago
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆34Updated last year
- ☆16Updated last month
- ICML2025: Forest-of-Thought: Scaling Test-Time Compute for Enhancing LLM Reasoning☆39Updated 2 weeks ago
- Official repository for the paper Number Cookbook: Number Understanding of Language Models and How to Improve It.☆16Updated last month
- ☆45Updated 3 months ago
- Code Implementation, Evaluations, Documentation, Links and Resources for Min P paper☆33Updated 2 months ago
- ☆46Updated last week
- Lottery Ticket Adaptation☆39Updated 5 months ago
- [ACL 2024] Masked Thought: Simply Masking Partial Reasoning Steps Can Improve Mathematical Reasoning Learning of Language Models☆21Updated 10 months ago
- ☆31Updated 4 months ago
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆97Updated 6 months ago
- ☆66Updated 5 months ago
- ☆30Updated 2 months ago