Converted the Jina Tokenizer regex pattern to python.
☆26Aug 26, 2024Updated last year
Alternatives and similar repositories for regex-tokenizer
Users that are interested in regex-tokenizer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 🔨🔨🔨Tool for making model training data set☆20Nov 1, 2024Updated last year
- 💡💡💡awesome compute vision app in gradio☆56May 17, 2024Updated 2 years ago
- [NeurIPS 2025] Bag of Tricks for Inference-time Computation of LLM Reasoning☆16Sep 20, 2025Updated 8 months ago
- A Mac clipboard management application☆48Apr 13, 2026Updated 2 months ago
- PyTorch code for JSTSP2021 paper "Accurate and Lightweight Image Super-Resolution with Model-Guided Deep Unfolding Network""☆12Nov 21, 2020Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆12Jan 31, 2026Updated 4 months ago
- A distributed in-memory store for temporal knowledge graphs☆10Mar 20, 2024Updated 2 years ago
- 斗破苍穹小说的新词发现☆13May 12, 2022Updated 4 years ago
- ☆43Feb 27, 2026Updated 3 months ago
- 🌏 Teddy is a tiny but scalable http server based on Java NIO, inspired by netty.☆11Dec 26, 2019Updated 6 years ago
- something for paper agent☆11Dec 18, 2024Updated last year
- A full stack typescript SAAS boilerplate with Chat, Auth (Langgraph, supabase), Payments (stripe), and AI Credits☆18May 23, 2025Updated last year
- 企业事件抽取☆13May 20, 2021Updated 5 years ago
- 📝The official repository of "Rethinking Cross-Generator Image Forgery Detection through DINOv3"☆25Dec 2, 2025Updated 6 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Unit Minions 的各种数据准备、处理脚本,诸如 OpenAI 处理、格式转换等等。☆14Apr 21, 2023Updated 3 years ago
- Generating Training Data Made Easy☆43Jul 3, 2020Updated 5 years ago
- Textin xParse Web 端集成 - React☆191Feb 24, 2026Updated 3 months ago
- 中文事件抽取☆11Feb 26, 2021Updated 5 years ago
- This sample shows how to use the oneAPI Video Processing Library (oneVPL) to perform a single and multi-source video decode and preproces…☆15Jun 15, 2023Updated 2 years ago
- ☆105Sep 24, 2023Updated 2 years ago
- 这里将paddle中的ocr等模型转为onnx格式,并利用java版深度框架djl加载这些onnx模型进行推理预测尝试。☆14Nov 15, 2022Updated 3 years ago
- PFCC 社区博客☆14Updated this week
- Fast instruction tuning with Llama2☆11Apr 8, 2024Updated 2 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- ☆16May 31, 2024Updated 2 years ago
- 一个游览器脚本,通过deepl翻译网站进行快速大量的renpy多语言化机翻。☆20Aug 1, 2025Updated 10 months ago
- RATT: A Thought Structure for Coherent and Correct LLM Reasoning☆15Jul 11, 2024Updated last year
- ☆16Apr 13, 2023Updated 3 years ago
- 基于 LangChain1.0和DeepAgents的代码优化Agent☆26Dec 28, 2025Updated 5 months ago
- 中文短文本语义匹配☆19Aug 1, 2019Updated 6 years ago
- A Challenge on Dialog Systems with Retrieval Augmented Generation (FutureDial-RAG), Co-located with SLT2024 FutureDial-RAG Challenge☆11Aug 10, 2024Updated last year
- Various test models in WNNX format. It can view with `pip install wnetron && wnetron`☆12Jun 22, 2022Updated 3 years ago
- Transfer Learning on Dogs vs Cats dataset using PyTorch C+ API☆12Aug 16, 2019Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 面向金融领域的小样本跨类迁移事件抽取 第三名 方案及代码☆17Dec 23, 2020Updated 5 years ago
- ☆85Apr 3, 2025Updated last year
- Unleashing Reasoning in Medical Large Language Models☆12Mar 19, 2025Updated last year
- ☆16Sep 17, 2021Updated 4 years ago
- Harmonizing N8N, NocoDB, One-API, and Fastchat to forge an accessible and intuitive AI flows integration platform. ⚡ 融合N8N、NocoDB、One-AP…☆12Jan 5, 2024Updated 2 years ago
- ☆21Aug 2, 2021Updated 4 years ago
- BettaFish-skill:改造自BettaFish(微舆),多Agent舆情分析助手,可以用在Claude Code、OpenClaw、Cursor等支持Skills的Agent中,更方便使用☆55Mar 7, 2026Updated 3 months ago