Converted the Jina Tokenizer regex pattern to python.
☆26Aug 26, 2024Updated last year
Alternatives and similar repositories for regex-tokenizer
Users that are interested in regex-tokenizer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- paper-read-notes☆13Sep 26, 2024Updated last year
- 🔨🔨🔨Tool for making model training data set☆20Nov 1, 2024Updated last year
- APAR: LLMs Can Do Auto-Parallel Auto-Regressive Decoding☆14Jul 22, 2024Updated last year
- 💡💡💡awesome compute vision app in gradio☆55May 17, 2024Updated 2 years ago
- [NeurIPS 2025] Bag of Tricks for Inference-time Computation of LLM Reasoning☆16Sep 20, 2025Updated 8 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A proxy for Google Bard LLM☆10Nov 2, 2023Updated 2 years ago
- A Mac clipboard management application☆48Apr 13, 2026Updated last month
- ☆17Apr 11, 2025Updated last year
- PyTorch code for JSTSP2021 paper "Accurate and Lightweight Image Super-Resolution with Model-Guided Deep Unfolding Network""☆12Nov 21, 2020Updated 5 years ago
- Forces DeepSeek R1 models to engage in extended reasoning by intercepting early termination tokens.☆19Feb 12, 2025Updated last year
- An RGB to Spectrum Conversion for Reflectances - Smits (1999)☆14Jan 26, 2020Updated 6 years ago
- OCR离线图片文字识别命令行windows程序,以JSON字符 串形式输出结果,方便别的程序调用。基于 RapidOcrOnnx 。☆11Feb 8, 2024Updated 2 years ago
- ☆27Nov 6, 2024Updated last year
- ☆12Jan 31, 2026Updated 3 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆28May 19, 2024Updated 2 years ago
- An efficient CNN for spectral reconstruction from RGB images☆13May 10, 2018Updated 8 years ago
- A bouncy View.☆11Aug 11, 2017Updated 8 years ago
- A distributed in-memory store for temporal knowledge graphs☆10Mar 20, 2024Updated 2 years ago
- 斗破苍穹小说的新词发现☆13May 12, 2022Updated 4 years ago
- 🌟 Revolutionize Your Operations with One Sentence Automation: Utilizing large language models and Multi-Agents to generate operational c…☆58Nov 3, 2023Updated 2 years ago
- 多租户分销系统,开箱即用。基于Guns后台框架,动态分销配置调整,内置多种分销策略,文档注释齐全,能够快速上手并与其他模块对接或者自行拓展。如果帮助到你,请Star项目给予作者支持!☆11Feb 22, 2023Updated 3 years ago
- ☆43Feb 27, 2026Updated 2 months ago
- Knowledge-Sharing Hub using RAG Q&A techniques with LLMs (Llama2 and ChatGPT)☆12Nov 29, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Extract text data from documents using OCR (optical character recognition) technology and NER (named entity recognition).☆10May 11, 2023Updated 3 years ago
- Just a template for quickly creating a python library.☆10May 18, 2026Updated last week
- 常用的arcpy方法☆12Dec 9, 2018Updated 7 years ago
- 🌏 Teddy is a tiny but scalable http server based on Java NIO, inspired by netty.☆11Dec 26, 2019Updated 6 years ago
- something for paper agent☆11Dec 18, 2024Updated last year
- ☆15Jun 18, 2021Updated 4 years ago
- A full stack typescript SAAS boilerplate with Chat, Auth (Langgraph, supabase), Payments (stripe), and AI Credits☆18May 23, 2025Updated last year
- 企业事件抽取☆13May 20, 2021Updated 5 years ago
- 📝The official repository of "Rethinking Cross-Generator Image Forgery Detection through DINOv3"☆24Dec 2, 2025Updated 5 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Unit Minions 的各种数据准备、处理脚本,诸如 OpenAI 处理、格式转换等等。☆14Apr 21, 2023Updated 3 years ago
- Contrast Subgraph Mining from Coherent Cores☆13Feb 20, 2018Updated 8 years ago
- ☆18Aug 19, 2024Updated last year
- Generating Training Data Made Easy☆43Jul 3, 2020Updated 5 years ago
- Textin xParse Web 端集成 - React☆190Feb 24, 2026Updated 3 months ago
- This sample shows how to use the oneAPI Video Processing Library (oneVPL) to perform a single and multi-source video decode and preproces…☆15Jun 15, 2023Updated 2 years ago
- 中文事件抽取☆11Feb 26, 2021Updated 5 years ago