Ki-Seki / Awesome-Transformer-VisualizationLinks
Explore visualization tools for understanding Transformer-based large language models (LLMs)
☆13Updated 6 months ago
Alternatives and similar repositories for Awesome-Transformer-Visualization
Users that are interested in Awesome-Transformer-Visualization are comparing it to the libraries listed below
Sorting:
- ☆20Updated last month
- Must-read papers and blogs about parametric knowledge mechanism in LLMs.☆21Updated last month
- A curated list of awesome LLM Inference-Time Self-Improvement (ITSI, pronounced "itsy") papers from our recent survey: A Survey on Large …☆80Updated 6 months ago
- ☆36Updated last week
- [ICLR 2025] ELICIT: LLM Augmentation Via External In-context Capability☆11Updated 3 months ago
- The code implementation of Symbolic-MoE☆33Updated 3 months ago
- PGRAG☆51Updated 11 months ago
- ☆46Updated 4 months ago
- [ICML 2025] Flow of Reasoning: Training LLMs for Divergent Reasoning with Minimal Examples☆97Updated 3 weeks ago
- Sys2Bench is a benchmarking suite designed to evaluate reasoning and planning capabilities of large language models across algorithmic, l…☆24Updated 3 months ago
- [ICLR 2025 Workshop] "Landscape of Thoughts: Visualizing the Reasoning Process of Large Language Models"☆26Updated this week
- AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories☆18Updated last month
- It is a comprehensive resource hub compiling all LLM papers accepted at the International Conference on Learning Representations (ICLR) i…☆63Updated last year
- ☆25Updated 2 months ago
- ☆41Updated 6 months ago
- PLM: Efficient Peripheral Language Models Hardware-Co-Designed for Ubiquitous Computing☆19Updated 3 months ago
- RL Tango: Reinforcing Generator and Verifier Together for Language Reasoning☆26Updated last month
- ☆29Updated 2 months ago
- ☆11Updated 4 months ago
- Codebase for Instruction Following without Instruction Tuning☆34Updated 9 months ago
- [ACL 2025 Findings] Official implementation of the paper "Unveiling the Key Factors for Distilling Chain-of-Thought Reasoning".☆15Updated 4 months ago
- Code, benchmark and environment for "ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows"☆85Updated this week
- Code Implementation, Evaluations, Documentation, Links and Resources for Min P paper☆39Updated 3 months ago
- ☆35Updated 4 months ago
- This is the implementation for the paper "LARGE LANGUAGE MODEL CASCADES WITH MIX- TURE OF THOUGHT REPRESENTATIONS FOR COST- EFFICIENT REA…☆23Updated last year
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆110Updated 8 months ago
- An official implementation of "Catastrophic Failure of LLM Unlearning via Quantization" (ICLR 2025)☆27Updated 4 months ago
- [ICML 2025] "SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator"☆75Updated this week
- ☆37Updated 10 months ago
- [ICLR 2025] Official code of "Towards Robust Alignment of Language Models: Distributionally Robustifying Direct Preference Optimization"☆14Updated last year