Chinese tokens in tiktoken tokenizers.
☆32May 15, 2024Updated last year
Alternatives and similar repositories for chinese-tokens-in-tiktoken
Users that are interested in chinese-tokens-in-tiktoken are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Efficient retrieval head analysis with triton flash attention that supports topK probability☆13Jun 15, 2024Updated last year
- ☆11Sep 6, 2024Updated last year
- ☆19May 3, 2025Updated 10 months ago
- RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment☆16Dec 19, 2024Updated last year
- [NeurIPS25] RULE: Reinforcement UnLEarning Achieves Forge-retain Pareto Optimality☆20Oct 22, 2025Updated 5 months ago
- codes for "Self-Checker: Plug-and-Play Modules for Fact-Checking with Large Language Models"☆12Feb 10, 2025Updated last year
- ☆12Dec 14, 2024Updated last year
- test images with not appropriate labels in MNIST dataset☆10Mar 3, 2018Updated 8 years ago
- Training a reward model for RLHF using RWKV.☆15Jun 5, 2023Updated 2 years ago
- RWKV-7 mini☆12Mar 29, 2025Updated 11 months ago
- Description for MV-MATH☆15Jul 20, 2025Updated 8 months ago
- A single Layer CNN on MIST, get an acurray of 97.24%☆11Jun 12, 2015Updated 10 years ago
- ☆25Dec 8, 2025Updated 3 months ago
- ☆11Apr 23, 2023Updated 2 years ago
- ☆10Jun 28, 2022Updated 3 years ago
- Continual Memorization of Factoids in Large Language Models☆12Nov 20, 2024Updated last year
- ☆40Dec 16, 2025Updated 3 months ago
- RWKV Wiki website (archived, please visit official wiki)☆11Mar 26, 2023Updated 2 years ago
- ICML 2025 Spotlight, PCEvolve: Private Contrastive Evolution for Synthetic Dataset Generation via Few-Shot Private Data and Generative AP…☆14Jun 27, 2025Updated 8 months ago
- Official repository for Trustworthy Alignment of Retrieval-Augmented Large Language Models via Reinforcement Learning☆12Sep 2, 2024Updated last year
- Unofficial implementation for the paper 'Improving Diffusion Models for Inverse Problems using Manifold Constraints'[https://arxiv.org/ab…☆12Aug 21, 2022Updated 3 years ago
- A Good Neighbor, A Found Treasure: Mining Treasured Neighbors for Knowledge Graph Entity Typing. EMNLP 2022☆11Feb 1, 2023Updated 3 years ago
- 清华大学宿舍洗衣机空闲提醒小程序☆14Feb 4, 2021Updated 5 years ago
- [ACL2023] Official code repository for VLN-Trans☆14Sep 10, 2023Updated 2 years ago
- [ICLR 2025 SSI-FM] Self-Taught Self-Correction for Small Language Models☆11Sep 19, 2025Updated 6 months ago
- This project demonstrates the computation process of the RWKV (Receptance Weighted Key Value) model through Excel spreadsheets.☆20Jun 7, 2025Updated 9 months ago
- Some test samples for CPG execution logic.☆20Apr 13, 2024Updated last year
- ☆10Aug 3, 2023Updated 2 years ago
- Generates and optimizes Haiku system and user prompts for classification☆15Oct 27, 2025Updated 4 months ago
- Pocket Flow: A minimalist LLM framework. Let Agents build Agents!☆30Apr 11, 2025Updated 11 months ago
- Code for EMNLP 2022 Paper DANLI: Deliberative Agent for Following Natural Language Instructions☆18May 1, 2025Updated 10 months ago
- ☆16Mar 30, 2024Updated last year
- [ISSTA'24] A Large-Scale Dataset Capable of Enhancing the Prowess of Large Language Models for Program Testing☆12Jan 7, 2025Updated last year
- Approximate randomization testing.☆19Apr 17, 2020Updated 5 years ago
- tree-sitter grammar for the CodeQL language☆34Aug 29, 2025Updated 6 months ago
- CCKS 2022 通用信息抽取☆13May 1, 2022Updated 3 years ago
- A static code analysis tool☆18Mar 17, 2025Updated last year
- Some preliminary explorations of Mamba's context scaling.☆13Dec 18, 2024Updated last year
- [ISSTA 2025] A Large-scale Empirical Study on Fine-tuning Large Language Models for Unit Testing☆13Feb 9, 2025Updated last year