isLinXu / regex-tokenizer
Converted the Jina Tokenizer regex pattern to python.
☆24Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for regex-tokenizer
- A Python Package to Access World-Class Generative Models☆125Updated 5 months ago
- ☆60Updated 2 months ago
- dify's rag patch module☆48Updated this week
- ☆15Updated 5 months ago
- 中文原生检索增强生成测评基准☆100Updated 7 months ago
- Analysis of Chinese and English layouts 中英文版面分析☆130Updated last month
- 顾名思义:手搓的RAG☆111Updated 8 months ago
- The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.☆70Updated last year
- doc2x docs☆30Updated this week
- gpt_server是一个用于生产级部署LLMs或Embedding的开源框架。☆123Updated this week
- 如需体验textin文档解析,请点击https://cc.co/16YSIy☆59Updated last week
- 源自PP-Structure的表格识别算法,模型转换为ONNX,推理引擎采用ONNXRuntime,部署简单,无内存泄露问题。☆79Updated last week
- ☆105Updated last year
- A light proxy solution for HuggingFace hub.☆44Updated last year
- DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including …☆137Updated 2 months ago
- 通用版面分析 | 中文文档解析 |Document Layout Analysis | layout paser☆45Updated 5 months ago
- SmartSearch: Building a quick conversation-based search engine with LLMs.☆43Updated 6 months ago
- 演示 vllm 对中文大语言模型的神奇效果☆31Updated last year
- 360LayoutAnaylsis, a series Document Analysis Models and Datasets deleveped by 360 AI Research Institute☆242Updated 2 months ago
- SUS-Chat: Instruction tuning done right☆47Updated 10 months ago
- 一个用于BiliBili网站实时热点&舆情分析的AI 智能体☆16Updated this week
- ChatGPT WebUI using gradio. 给 LLM 对话和检索知识问答RAG提供一个简单好用的Web UI界面☆93Updated 3 months ago
- 首个llama2 13b 中文版模型 (Base + 中文对话SFT,实现流畅多轮人机自然语言交互)☆89Updated last year
- 基于ReAct手搓一个Agent Demo☆105Updated 6 months ago
- Finetune Llama 3, Mistral & Gemma LLMs 2-5x faster with 80% less memory☆25Updated 6 months ago
- (撰写ing..)本仓库偏教程性质,以「模型中文化」为一个典型的模型训练问题切入场景,指导读者上手学习LLM二次微调训练。☆30Updated 3 months ago
- Evaluation for AI apps and agent☆35Updated 10 months ago
- A minimalist benchmarking tool designed to test the routine-generation capabilities of LLMs.☆17Updated 2 weeks ago
- 全球首个StableVicuna中文优化版。☆65Updated last year