An Efficent BPE Algorithm Faster then Hugging Face Tokenizer's Implementation
☆13Sep 9, 2024Updated last year
Alternatives and similar repositories for efficient_bpe
Users that are interested in efficient_bpe are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 使用Sentencepiece对中文语料进行分词☆13Nov 30, 2023Updated 2 years ago
- ☆19Dec 17, 2025Updated 5 months ago
- ☆17Feb 6, 2025Updated last year
- An educational Chinese-to-English NMT project featuring the classic encoder-decoder Transformer and a configurable modern decoder-only GP…☆20Jun 7, 2026Updated last week
- 校园音乐征集投票系统 A system for electing annual school music☆10Jun 8, 2026Updated last week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [ECCV'24] UNIT: Backdoor Mitigation via Automated Neural Distribution Tightening☆10Dec 18, 2025Updated 5 months ago
- Taurix OS kernel. Taurix 系统内核,操作系统原理实(xjb)践(写)☆12Dec 20, 2020Updated 5 years ago
- 量化交易网站,软工三大作业迭代三,团队项目☆11Mar 8, 2018Updated 8 years ago
- Single-step image generation at 306 FPS. Drifting vs Diffusion head-to-head on CIFAR-10.☆44Feb 13, 2026Updated 4 months ago
- NewsApp包含客户端源码、服务端源码、数据库文件。 基于Miscrosoft人工智能项目ProjectOxford中的Recognition Emotion做的, 主要是基于用户的面部表情来推送不同类别的新闻。 Emotion API可以参考:https://www.p…☆10Mar 2, 2016Updated 10 years ago
- ☆16Jan 6, 2025Updated last year
- The UnisonAI Multi-Agent Framework built on custom workflow which allows ai agents to talk together and provides a flexible and extensibl…☆23Feb 24, 2026Updated 3 months ago
- 基于 BPE 实现的中文分词。优化:预处理,并行计算,多字词,多词表☆14May 14, 2022Updated 4 years ago
- 基于MFCC特征构建单核GMM的0-9独立词语音识别,MFCC,GMM,sklearn,Isolated word recognition。☆10Nov 18, 2020Updated 5 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- 同济大学计科机器学习大作业☆10Mar 22, 2025Updated last year
- ☆22Nov 4, 2024Updated last year
- [NeurIPS'25] Backdoor Cleaning without External Guidance in MLLM Fine-tuning☆20Oct 13, 2025Updated 8 months ago
- Fine-tuning embedding models.☆14Nov 25, 2024Updated last year
- 这是一个可通过网页远程登录管理、可接入讯飞星火、ChatGPT等大语言模型的微信聊天机器人,使用微信网页版协议。☆16Feb 20, 2024Updated 2 years ago
- So, I trained a Llama a 130M architecture I coded from ground up to build a small instruct model from scratch. Trained on FineWeb dataset…☆18Mar 26, 2025Updated last year
- Real time faster whisper gradio☆25Aug 17, 2025Updated 9 months ago
- 监控哔哩哔哩直播间数据,实时保存至数据库,并在内置网页上查看精致的可视化统计图表。☆13Jan 4, 2022Updated 4 years ago
- JAX port of FLUX.1 models using flax.nnx☆23Sep 28, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- 2024-2025下半学年人工智能导论(拔尖班)☆17Jun 16, 2025Updated 11 months ago
- By leveraging Bocha AI Search API , your AI applications can now access high-quality, up-to-date knowledge from billions of web pages and…☆21Feb 9, 2025Updated last year
- python爬取股市数据,并对各个行业股票行情、财务数据进行重构分析☆11Jul 26, 2020Updated 5 years ago
- Qwen-WisdomVast is a large model trained on 1 million high-quality Chinese multi-turn SFT data, 200,000 English multi-turn SFT data, and …☆17Apr 12, 2024Updated 2 years ago
- P5js sketches (Processing for JavaScript)☆19Jan 21, 2026Updated 4 months ago
- ☆23Jun 6, 2025Updated last year
- Python class for creating and optimizing quadratic and cubic Bezier curves and path smoothing implementation.☆46Jun 19, 2021Updated 4 years ago
- Jarvis made by Kaushik Shresth Reverse Engineered by Likhi☆16Feb 16, 2025Updated last year
- II-Thought-RL is our initial attempt at developing a large-scale, multi-domain Reinforcement Learning (RL) dataset☆30Apr 8, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- 这是一个大学生互联网+的大创项目:“一点到家”——云滇家政平台助力乡村振兴,系统前台:微信小程序,后端springboot,数据库mysql。属于一个非常值得推荐的项目,系统源码简单宜读,干净简洁、注释详细,可二次开发。创意满满,贴近生活,缓解就业压力,为农民增收致富,促进…☆14Jun 17, 2023Updated 2 years ago
- 「城语」APP基于A级景区、历史古迹、文物保护单位等基础数据,利用先进的大模型能力实现智能化的Citywalk 路线规划,包括设计一条路线、生成路线攻略、生成景点的推荐理由等三大核心功能;利用大模型减少了人工编辑和推荐的工作量,并可以根据游客的需求进行个性化定制,提升了游客…☆19Feb 20, 2024Updated 2 years ago
- Aurora forecasts created from solar wind data (OVATION Prime 2010)☆20Apr 11, 2025Updated last year
- Integrating Large Weather Models with Data Assimilation☆24Jun 2, 2024Updated 2 years ago
- CVAE_XGate model in paper "Xu, Dusek, Konstas, Rieser. Better Conversations by Modeling, Filtering, and Optimizing for Coherence and Dive…☆16Jan 23, 2020Updated 6 years ago
- 一款很棒的书摘软件 微信小程序 中山大学软件创新大赛十强参赛项目☆16May 3, 2018Updated 8 years ago
- 大学整理项目一:一个旅游踩点项目,踩点即对一个个事先有记录的有意思的旅行停驻点进行拜访游玩并留下你的足,这些停驻点我们称之为关注点。在该系统中还可以自己规划行程,事先计划好要前往的关注点 ,路线然后按照系统上的路线规划进行旅游,在旅游中可以写一些文字,发一些图片,整个行程完…☆10Apr 27, 2018Updated 8 years ago