secsilm / chinese-tokens-in-tiktoken
Chinese tokens in tiktoken tokenizers.
☆31Updated 9 months ago
Alternatives and similar repositories for chinese-tokens-in-tiktoken:
Users that are interested in chinese-tokens-in-tiktoken are comparing it to the libraries listed below
- 🔥Your Daily Dose of AI Research from Hugging Face 🔥 Stay updated with the latest AI breakthroughs! This bot automatically collects and…☆48Updated this week
- A lightweight script for processing HTML page to markdown format with support for code blocks☆78Updated 10 months ago
- DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought☆207Updated last month
- ☆36Updated 5 months ago
- ☆91Updated 2 months ago
- ☆100Updated 2 months ago
- The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.☆68Updated last year
- Token level visualization tools for large language models☆74Updated last month
- Evaluation for AI apps and agent☆36Updated last year
- Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models☆128Updated 8 months ago
- Scholar Copilot is an intelligent academic writing assistant that enhances the research writing process through AI-powered text completio…☆78Updated 2 months ago
- Pretrain、decay、SFT a CodeLLM from scratch 🧙♂️☆36Updated 9 months ago
- ☆30Updated 11 months ago
- Code for paper: "Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines"☆11Updated 4 months ago
- Longitudinal Evaluation of LLMs via Data Compression☆30Updated 8 months ago
- ☆17Updated last month
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆29Updated 8 months ago
- 我们是第一个完全可商用的角色大模型。☆39Updated 6 months ago
- Reformatted Alignment☆114Updated 4 months ago
- A streamlined, user-friendly JSON streaming preprocessor, crafted in Python.☆88Updated 5 months ago
- Prompt 工程师利器,可同时比较多个 Prompts 在多个 LLM 模型上的效果☆95Updated last year
- SUS-Chat: Instruction tuning done right☆48Updated last year
- The official code repo and data hub of top_nsigma sampling strategy for LLMs.☆20Updated last week
- Evaling and unaligning Chinese LLM censorship☆55Updated 4 months ago
- XVERSE-MoE-A36B: A multilingual large language model developed by XVERSE Technology Inc.☆37Updated 5 months ago
- GLM Series Edge Models☆130Updated this week
- A Framework for Decoupling and Assessing the Capabilities of VLMs☆40Updated 7 months ago
- 利用免费的大模型api来结合你的私域数据来生成sft训练数据(妥妥白嫖)支持llamafactory等工具的训练数据格式synthetic data☆129Updated 2 months ago