Converted the Jina Tokenizer regex pattern to python.
☆26Aug 26, 2024Updated last year
Alternatives and similar repositories for regex-tokenizer
Users that are interested in regex-tokenizer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 🔨🔨🔨(mmplot)used to draw graphs of multiple index parameters such as algorithm accuracy and speed of multiple deep learning models.☆87Aug 22, 2024Updated last year
- A proxy for Google Bard LLM☆10Nov 2, 2023Updated 2 years ago
- ☆28Nov 6, 2024Updated last year
- 从零构建了Agent中最重要的功能-function call☆18Oct 16, 2024Updated last year
- 斗破苍穹小说的新词发现☆13May 12, 2022Updated 3 years ago
- If you can read ~200 lines of Python, you understand MCP.☆40Mar 12, 2026Updated last week
- Extract text data from documents using OCR (optical character recognition) technology and NER (named entity recognition).☆10May 11, 2023Updated 2 years ago
- Just a template for quickly creating a python library.☆10Mar 16, 2026Updated last week
- 🌏 Teddy is a tiny but scalable http server based on Java NIO, inspired by netty.☆11Dec 26, 2019Updated 6 years ago
- The source code for BUTTERFLY COUNTING IN BIPARTITE NETWORKS☆12May 29, 2019Updated 6 years ago
- something for paper agent☆11Dec 18, 2024Updated last year
- ☆15Jun 18, 2021Updated 4 years ago
- 企业事件抽取☆13May 20, 2021Updated 4 years ago
- 📝The official repository of "Rethinking Cross-Generator Image Forgery Detection through DINOv3"☆21Dec 2, 2025Updated 3 months ago
- OpenGraph is an open-source graph processing benchmarking suite written in pure C/OpenMP.☆12Apr 27, 2024Updated last year
- Live stock sentiment dashboard of Dow Jones stock, showing the sentiment, stocks, industries and their respective allocation in the Dow J…☆13Dec 19, 2025Updated 3 months ago
- Unit Minions 的各种数据准备、处理脚本,诸如 OpenAI 处理、格式转换等等。☆14Apr 21, 2023Updated 2 years ago
- Contrast Subgraph Mining from Coherent Cores☆13Feb 20, 2018Updated 8 years ago
- ICDM19 - Anomaly Detection / Outlier Detection for Mixed data☆14Dec 6, 2020Updated 5 years ago
- 中文停用词汇总,持续完善中,欢迎push共建☆16Jun 12, 2023Updated 2 years ago
- Textin xParse Web 端集成 - React☆186Feb 24, 2026Updated last month
- 中文事件抽取☆11Feb 26, 2021Updated 5 years ago
- PFCC 社区博客☆14Mar 16, 2026Updated last week
- ssh code☆12May 8, 2017Updated 8 years ago
- ☆16May 31, 2024Updated last year
- ☆16Feb 28, 2023Updated 3 years ago
- ☆13May 9, 2019Updated 6 years ago
- RATT: A Thought Structure for Coherent and Correct LLM Reasoning☆16Jul 11, 2024Updated last year
- ☆71Oct 23, 2025Updated 5 months ago
- Various test models in WNNX format. It can view with `pip install wnetron && wnetron`☆12Jun 22, 2022Updated 3 years ago
- A Challenge on Dialog Systems with Retrieval Augmented Generation (FutureDial-RAG), Co-located with SLT2024 FutureDial-RAG Challenge☆11Aug 10, 2024Updated last year
- Transfer Learning on Dogs vs Cats dataset using PyTorch C+ API☆12Aug 16, 2019Updated 6 years ago
- 面向金融领域的小样本跨类迁移事件抽取 第三名 方案及代码☆17Dec 23, 2020Updated 5 years ago
- Harmonizing N8N, NocoDB, One-API, and Fastchat to forge an accessible and intuitive AI flows integration platform. ⚡ 融合N8N、NocoDB、One-AP…☆11Jan 5, 2024Updated 2 years ago
- Unleashing Reasoning in Medical Large Language Models☆12Mar 19, 2025Updated last year
- ☆84Apr 3, 2025Updated 11 months ago
- A mini soft renderer.☆13Dec 17, 2023Updated 2 years ago
- Download Huggingface repositories without the need to install dependencies☆22Jul 30, 2025Updated 7 months ago
- botsonar analyse open api☆20May 21, 2019Updated 6 years ago