Ki-Seki / Awesome-Transformer-Visualization
Explore visualization tools for understanding Transformer-based large language models (LLMs)
☆9Updated 4 months ago
Alternatives and similar repositories for Awesome-Transformer-Visualization:
Users that are interested in Awesome-Transformer-Visualization are comparing it to the libraries listed below
- Must-read papers and blogs about parametric knowledge mechanism in LLMs.☆16Updated 2 weeks ago
- ☆85Updated 3 weeks ago
- A curated and updated list of relevant articles and repositories on Reinforcement Learning from AI Feedback (RLAIF)☆12Updated last year
- ☆36Updated 7 months ago
- Official repository for paper: O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning☆64Updated last month
- This is a curated list of "Continual Learning with Pretrained Models" research.☆16Updated last week
- It is a comprehensive resource hub compiling all LLM papers accepted at the International Conference on Learning Representations (ICLR) i…☆56Updated 11 months ago
- A curated list of awesome LLM Inference-Time Self-Improvement (ITSI, pronounced "itsy") papers from our recent survey: A Survey on Large …☆74Updated 3 months ago
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆33Updated last year
- Knowledge transfer from high-resource to low-resource programming languages for Code LLMs☆12Updated 7 months ago
- A curated list of recent efficient video generation methods.☆16Updated 4 months ago
- Flow of Reasoning: Training LLMs for Divergent Problem Solving with Minimal Examples☆80Updated last week
- Lottery Ticket Adaptation☆39Updated 4 months ago
- The official implementation of the paper "Towards Efficient Mixture of Experts: A Holistic Study of Compression Techniques (TMLR)".☆64Updated 2 weeks ago
- TokenSkip: Controllable Chain-of-Thought Compression in LLMs☆103Updated 3 weeks ago
- awesome llm plaza: daily tracking all sorts of awesome topics of llm, e.g. llm for coding, robotics, reasoning, multimod etc.☆191Updated this week
- Automatically update arXiv papers about LLM Reasoning, LLM Evaluation, LLM & MLLM and Video Understanding using Github Actions.☆33Updated this week
- PLM: Efficient Peripheral Language Models Hardware-Co-Designed for Ubiquitous Computing☆14Updated 2 weeks ago
- Minimal implementation of the Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models paper (ArXiv 20232401.01335)☆29Updated last year
- What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective☆63Updated last month
- ☆20Updated last month
- The official github repo for the open online courses: "Dive into LLMs".☆10Updated last year
- ☆72Updated last week
- ☆12Updated 4 months ago
- Interpretable Contrastive Monte Carlo Tree Search Reasoning☆46Updated 4 months ago
- [NAACL 25 main] Awesome LLM Causal Reasoning is a collection of LLM-based casual reasoning works, including papers, codes and datasets.☆49Updated last month
- [EMNLP 2024 Findings] ProSA: Assessing and Understanding the Prompt Sensitivity of LLMs☆24Updated 5 months ago
- Unofficial Implementation of Chain-of-Thought Reasoning Without Prompting☆29Updated last year
- Activation-aware Singular Value Decomposition for Compressing Large Language Models☆59Updated 5 months ago
- 🎓Automatically Update CV Papers Daily using Github Actions (Update Every 12th hours)☆11Updated this week