FateScript / token_visualizer
Token level visualization tools for large language models
☆80Updated 4 months ago
Alternatives and similar repositories for token_visualizer
Users that are interested in token_visualizer are comparing it to the libraries listed below
Sorting:
- Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models☆132Updated 11 months ago
- [ACL 2024 Demo] Official GitHub repo for UltraEval: An open source framework for evaluating foundation models.☆240Updated 6 months ago
- Official completion of “Training on the Benchmark Is Not All You Need”.☆32Updated 4 months ago
- Feeling confused about super alignment? Here is a reading list☆42Updated last year
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆32Updated 11 months ago
- A Comprehensive Survey on Long Context Language Modeling☆142Updated last month
- A lightweight reproduction of DeepSeek-R1-Zero with indepth analysis of self-reflection behavior.☆236Updated last month
- Reproducing R1 for Code with Reliable Rewards☆190Updated last week
- ☆143Updated 10 months ago
- Pretrain、decay、SFT a CodeLLM from scratch 🧙♂️☆35Updated last year
- CLongEval: A Chinese Benchmark for Evaluating Long-Context Large Language Models☆40Updated last year
- Fantastic Data Engineering for Large Language Models☆88Updated 4 months ago
- [ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale☆246Updated this week
- On Memorization of Large Language Models in Logical Reasoning☆65Updated last month
- ZO2 (Zeroth-Order Offloading): Full Parameter Fine-Tuning 175B LLMs with 18GB GPU Memory☆92Updated 2 weeks ago
- [EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMs☆249Updated 5 months ago
- Dataset and evaluation script for "Evaluating Hallucinations in Chinese Large Language Models"☆126Updated 11 months ago
- ☆47Updated 11 months ago
- ☆93Updated 2 months ago
- We aim to provide the best references to search, select, and synthesize high-quality and large-quantity data for post-training your LLMs.☆55Updated 7 months ago
- ☆27Updated 3 months ago
- [COLING 2025] ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios☆66Updated this week
- ☆184Updated last month
- Official github repo for AutoDetect, an automated weakness detection framework for LLMs.☆42Updated 10 months ago
- Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning☆176Updated last month
- code for Scaling Laws of RoPE-based Extrapolation☆73Updated last year
- ACL 2024 | LooGLE: Long Context Evaluation for Long-Context Language Models☆183Updated 7 months ago
- This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and train…☆56Updated last year
- Reformatted Alignment☆113Updated 7 months ago
- ☆63Updated 5 months ago