Converted the Jina Tokenizer regex pattern to python.
☆26Jun 10, 2026Updated 3 weeks ago
Alternatives and similar repositories for regex-tokenizer
Users that are interested in regex-tokenizer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- paper-read-notes☆13Sep 26, 2024Updated last year
- 🔨🔨🔨(mmplot)used to draw graphs of multiple index parameters such as algorithm accuracy and speed of multiple deep learning models.☆87Aug 22, 2024Updated last year
- 💡💡💡awesome compute vision app in gradio☆57May 17, 2024Updated 2 years ago
- [NeurIPS 2025] Bag of Tricks for Inference-time Computation of LLM Reasoning☆16Sep 20, 2025Updated 9 months ago
- Document Haystacks: Vision-Language Reasoning Over Piles of 1000+ Documents, CVPR 2025☆26Jan 25, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆17Apr 11, 2025Updated last year
- ☆28Nov 6, 2024Updated last year
- ☆12Jan 31, 2026Updated 5 months ago
- A distributed in-memory store for temporal knowledge graphs☆10Mar 20, 2024Updated 2 years ago
- 斗破苍穹小说的新词发现☆13May 12, 2022Updated 4 years ago
- 🌟 Revolutionize Your Operations with One Sentence Automation: Utilizing large language models and Multi-Agents to generate operational c…☆58Nov 3, 2023Updated 2 years ago
- SQL parser and converter☆11Jan 5, 2024Updated 2 years ago
- Multi-Server PIR (CCSW'14)☆11Dec 2, 2020Updated 5 years ago
- Knowledge-Sharing Hub using RAG Q&A techniques with LLMs (Llama2 and ChatGPT)☆11Nov 29, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Just a template for quickly creating a python library.☆10Updated this week
- 🌏 Teddy is a tiny but scalable http server based on Java NIO, inspired by netty.☆11Dec 26, 2019Updated 6 years ago
- The source code for BUTTERFLY COUNTING IN BIPARTITE NETWORKS☆12May 29, 2019Updated 7 years ago
- 利用tensorflow/serving进行单模型、多模型、同一模型多版本的部署,并进行模型预测,并用Prothemus进行服务监控。☆11Feb 24, 2021Updated 5 years ago
- something for paper agent☆11Dec 18, 2024Updated last year
- ☆15Jun 18, 2021Updated 5 years ago
- Bert TensorRT模型加速部署☆10Apr 1, 2022Updated 4 years ago
- A full stack typescript SAAS boilerplate with Chat, Auth (Langgraph, supabase), Payments (stripe), and AI Credits☆18May 23, 2025Updated last year
- 支持中文的 llama2.c☆12Jun 5, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- If you can read ~200 lines of Python, you understand MCP.☆69Mar 12, 2026Updated 3 months ago
- Unit Minions 的各种数据准备、处理脚本,诸如 OpenAI 处理、格式转换等等。☆14Apr 21, 2023Updated 3 years ago
- Contrast Subgraph Mining from Coherent Cores☆13Feb 20, 2018Updated 8 years ago
- ☆18Aug 19, 2024Updated last year
- This sample shows how to use the oneAPI Video Processing Library (oneVPL) to perform a single and multi-source video decode and preproces…☆15Jun 15, 2023Updated 3 years ago
- 中文事件抽取☆11Feb 26, 2021Updated 5 years ago
- ☆16Dec 8, 2021Updated 4 years ago
- ☆105Sep 24, 2023Updated 2 years ago
- 这里将paddle中的ocr等模型转为onnx格式,并利用java版深度框架djl加载这些onnx模型进行推理预测尝试。☆14Nov 15, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Bert文本分类,EMA+AD☆19May 19, 2020Updated 6 years ago
- Fast instruction tuning with Llama2☆11Apr 8, 2024Updated 2 years ago
- ssh code☆12May 8, 2017Updated 9 years ago
- ☆16May 31, 2024Updated 2 years ago
- ☆37Jan 31, 2023Updated 3 years ago
- 一个游览器脚本,通过deepl翻译网站进行快速大量的renpy多语言化机翻。☆20Aug 1, 2025Updated 11 months ago
- ☆32Jan 13, 2025Updated last year