secsilm / chinese-tokens-in-tiktoken
Chinese tokens in tiktoken tokenizers.
☆30Updated 8 months ago
Alternatives and similar repositories for chinese-tokens-in-tiktoken:
Users that are interested in chinese-tokens-in-tiktoken are comparing it to the libraries listed below
- 🔥Your Daily Dose of AI Research from Hugging Face 🔥 Stay updated with the latest AI breakthroughs! This bot automatically collects and…☆40Updated this week
- Evaluation for AI apps and agent☆36Updated last year
- Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models☆127Updated 7 months ago
- A lightweight script for processing HTML page to markdown format with support for code blocks☆78Updated 9 months ago
- Token level visualization tools for large language models☆67Updated last week
- ☆87Updated last month
- ☆36Updated 4 months ago
- 百度QA100万数据集☆48Updated last year
- Pretrain、decay、SFT a CodeLLM from scratch 🧙♂️☆35Updated 8 months ago
- Pytorch implementation of https://arxiv.org/html/2404.07143v1☆19Updated 9 months ago
- ☆99Updated last month
- SUS-Chat: Instruction tuning done right☆48Updated last year
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆29Updated 7 months ago
- DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought☆194Updated 2 weeks ago
- XVERSE-MoE-A36B: A multilingual large language model developed by XVERSE Technology Inc.☆37Updated 4 months ago
- aigc evals☆10Updated last year
- ☆49Updated 2 months ago
- The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.☆70Updated last year
- ☆81Updated 9 months ago
- ☆87Updated 9 months ago
- GLM Series Edge Models☆123Updated 2 weeks ago
- Reformatted Alignment☆113Updated 3 months ago
- code for Scaling Laws of RoPE-based Extrapolation☆71Updated last year
- 我们是第一个完全可商用的角色大模型。☆38Updated 5 months ago
- ☆33Updated 5 months ago
- FuseAI Project☆76Updated last month
- Longitudinal Evaluation of LLMs via Data Compression☆30Updated 7 months ago
- ☆17Updated 3 weeks ago
- Prompt 工程师利器,可同时比较多个 Prompts 在多个 LLM 模型上的效果☆96Updated last year
- connecting humans and agents☆67Updated last month