Converted the Jina Tokenizer regex pattern to python.
☆26Aug 26, 2024Updated last year
Alternatives and similar repositories for regex-tokenizer
Users that are interested in regex-tokenizer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 🔨🔨🔨Tool for making model training data set☆20Nov 1, 2024Updated last year
- 🔨🔨🔨(mmplot)used to draw graphs of multiple index parameters such as algorithm accuracy and speed of multiple deep learning models.☆87Aug 22, 2024Updated last year
- APAR: LLMs Can Do Auto-Parallel Auto-Regressive Decoding☆14Jul 22, 2024Updated last year
- [NeurIPS 2025] Bag of Tricks for Inference-time Computation of LLM Reasoning☆17Sep 20, 2025Updated 7 months ago
- ☆17Apr 11, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Forces DeepSeek R1 models to engage in extended reasoning by intercepting early termination tokens.☆19Feb 12, 2025Updated last year
- An RGB to Spectrum Conversion for Reflectances - Smits (1999)☆14Jan 26, 2020Updated 6 years ago
- ☆12Jan 31, 2026Updated 3 months ago
- ☆28May 19, 2024Updated last year
- 从零构建了Agent中最重要的功能-function call☆18Oct 16, 2024Updated last year
- A distributed in-memory store for temporal knowledge graphs☆10Mar 20, 2024Updated 2 years ago
- ☆43Feb 27, 2026Updated 2 months ago
- Just a template for quickly creating a python library.☆10Apr 27, 2026Updated last week
- 🌏 Teddy is a tiny but scalable http server based on Java NIO, inspired by netty.☆11Dec 26, 2019Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 利用tensorflow/serving进行单模型、多模型、同一模型多版本的部署,并进行模型预测,并用Prothemus进行服务监控。☆11Feb 24, 2021Updated 5 years ago
- something for paper agent☆11Dec 18, 2024Updated last year
- ☆17Aug 19, 2024Updated last year
- 企业事件抽取☆13May 20, 2021Updated 4 years ago
- 📝The official repository of "Rethinking Cross-Generator Image Forgery Detection through DINOv3"☆22Dec 2, 2025Updated 5 months ago
- If you can read ~200 lines of Python, you understand MCP.☆62Mar 12, 2026Updated last month
- Unit Minions 的各种数据准备、处理脚本,诸如 OpenAI 处理、格式转换等等。☆14Apr 21, 2023Updated 3 years ago
- Contrast Subgraph Mining from Coherent Cores☆13Feb 20, 2018Updated 8 years ago
- 本项目专为内容创作者、新媒体运营及开发者设计,致力于简化视频制作流程。无论是文章、脚本,甚至是已有的视频文案,都能被快速转换成可直接发布的短视频,适用于抖音、B站、YouTube Shorts、小红书等平台。video-processing, automation, tik…☆80Jan 19, 2026Updated 3 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 中文停用词汇总,持续完善中,欢迎push共建☆16Jun 12, 2023Updated 2 years ago
- Generating Training Data Made Easy☆43Jul 3, 2020Updated 5 years ago
- Textin xParse Web 端集成 - React☆189Feb 24, 2026Updated 2 months ago
- This sample shows how to use the oneAPI Video Processing Library (oneVPL) to perform a single and multi-source video decode and preproces…☆15Jun 15, 2023Updated 2 years ago
- 中文事件抽取☆11Feb 26, 2021Updated 5 years ago
- ☆16Dec 8, 2021Updated 4 years ago
- ☆105Sep 24, 2023Updated 2 years ago
- 这里将paddle中的ocr等模型转为onnx格式,并利用java版深度框架djl加载这些onnx模型进行推理预测尝试。☆14Nov 15, 2022Updated 3 years ago
- Bert文本分类,EMA+AD☆19May 19, 2020Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A graph database based on Python,一个轻量级图数据库☆19Dec 13, 2020Updated 5 years ago
- ☆16May 31, 2024Updated last year
- ☆37Jan 31, 2023Updated 3 years ago
- 一个游览器脚本,通过deepl翻译网站进行快速大量的renpy多语言化机翻。☆20Aug 1, 2025Updated 9 months ago
- ☆32Jan 13, 2025Updated last year
- 2021腾讯广告算法大赛-赛道二-第五名方案☆19May 22, 2022Updated 3 years ago
- ☆13May 9, 2019Updated 6 years ago