FateScript / token_visualizer
Token level visualization tools for large language models
☆75Updated 2 months ago
Alternatives and similar repositories for token_visualizer:
Users that are interested in token_visualizer are comparing it to the libraries listed below
- ☆142Updated 8 months ago
- A Comprehensive Survey on Long Context Language Modeling☆86Updated last week
- Official github repo for AutoDetect, an automated weakness detection framework for LLMs.☆42Updated 9 months ago
- Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models☆130Updated 9 months ago
- Feeling confused about super alignment? Here is a reading list☆42Updated last year
- ☆36Updated 6 months ago
- [COLING 2025] ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios☆65Updated 3 months ago
- The official GitHub page for the survey paper "A Survey on Data Augmentation in Large Model Era"☆124Updated 8 months ago
- ☆59Updated 2 months ago
- Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning☆158Updated last week
- Hammer: Robust Function-Calling for On-Device Language Models via Function Masking☆63Updated last month
- Reproducing R1 for Code with Reliable Rewards☆132Updated 3 weeks ago
- CLongEval: A Chinese Benchmark for Evaluating Long-Context Large Language Models☆40Updated last year
- Offical Repo for "Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale"☆229Updated last month
- Reformatted Alignment☆115Updated 6 months ago
- ☆37Updated 3 months ago
- ☆26Updated last month
- Pretrain、decay、SFT a CodeLLM from scratch 🧙♂️☆35Updated 10 months ago
- ☆124Updated 3 weeks ago
- ☆122Updated last year
- Official completion of “Training on the Benchmark Is Not All You Need”.☆30Updated 2 months ago
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning☆147Updated 6 months ago
- 珠算代码大模型(Abacus Code LLM)☆55Updated 6 months ago
- [ACL 2024 Demo] Official GitHub repo for UltraEval: An open source framework for evaluating foundation models.☆237Updated 4 months ago
- We aim to provide the best references to search, select, and synthesize high-quality and large-quantity data for post-training your LLMs.☆54Updated 5 months ago
- ☆45Updated 9 months ago
- [NeurIPS 2024] A comprehensive benchmark for evaluating critique ability of LLMs☆39Updated 4 months ago
- ZO2 (Zeroth-Order Offloading): Full Parameter Fine-Tuning 175B LLMs with 18GB GPU Memory☆59Updated last week
- [ICML 2024] Selecting High-Quality Data for Training Language Models☆158Updated 9 months ago
- We introduce ScaleQuest, a scalable, novel and cost-effective data synthesis method to unleash the reasoning capability of LLMs.☆60Updated 5 months ago