jackfsuia / LLM-Data-Cleaner
用大模型批量处理数据,现支持各种大模型做OCR,支持通义千问, 月之暗面, 百度飞桨OCR, OpenAI 和LLAVA。Use LLM to generate or clean data for academic use. Support OCR with qwen, moonshot, PaddleOCR, OpenAI, Llava.
☆12Updated 5 months ago
Alternatives and similar repositories for LLM-Data-Cleaner:
Users that are interested in LLM-Data-Cleaner are comparing it to the libraries listed below
- ☆22Updated 4 months ago
- 训练一个对中文支持更好的LLaVA模型,并开源训练代码和数据。☆46Updated 5 months ago
- A Toolkit for Running On-device Large Language Models (LLMs) in APP☆60Updated 7 months ago
- ☆21Updated 5 months ago
- ☆78Updated 9 months ago
- 我们是第一个完全可商用的角色大模型。☆39Updated 6 months ago
- Exploring Efficient Fine-Grained Perception of Multimodal Large Language Models☆58Updated 3 months ago
- A Simple MLLM Surpassed QwenVL-Max with OpenSource Data Only in 14B LLM.☆37Updated 5 months ago
- 一起来养一只拥有专属记忆的AI猫猫吧!☆10Updated 3 months ago
- GlyphDraw2: Automatic Generation of Complex Glyph Posters with Diffusion Models and Large Language Models☆61Updated 7 months ago
- Here is a demo for PDF parser (Including OCR, object detection tools)☆32Updated 4 months ago
- SUS-Chat: Instruction tuning done right☆48Updated last year
- Our 2nd-gen LMM☆32Updated 9 months ago
- The official code for NeurIPS 2024 paper: Harmonizing Visual Text Comprehension and Generation☆111Updated 3 months ago
- 视觉信息抽取任务中,使用OCR识别结果规范多模态大模型的回答☆25Updated last month
- SkyScript-100M: 1,000,000,000 Pairs of Scripts and Shooting Scripts for Short Drama: https://arxiv.org/abs/2408.09333v2☆113Updated 3 months ago
- [AAAI2025 Oral] Predicting the Original Appearance of Damaged Historical Documents☆58Updated 2 months ago
- ☆56Updated last year
- Built on the robust XTuner backend framework, XTuner Chat GUI offers a user-friendly platform for quick and efficient local model inferen…☆13Updated last year
- ☆15Updated 8 months ago
- ☆78Updated 9 months ago
- 🔥Your Daily Dose of AI Research from Hugging Face 🔥 Stay updated with the latest AI breakthroughs! This bot automatically collects and…☆48Updated this week
- official code for "Fox: Focus Anywhere for Fine-grained Multi-page Document Understanding"☆139Updated 8 months ago
- 模型 llava-Qwen2-7B-Instruct-Chinese-CLIP 增强中文文字识别能力和表情包内涵识别能力,接近gpt4o、claude-3.5-sonnet的识别水平!☆21Updated 7 months ago
- Official code implementation of Slow Perception:Let's Perceive Geometric Figures Step-by-step☆110Updated this week
- SearchGPT: Building a quick conversation-based search engine with LLMs.☆45Updated last month
- ☆25Updated 4 months ago
- ☆36Updated 4 months ago
- ☆35Updated 4 months ago