Liwu-di / PaperCrawlerUtil
一套工具组,包括访问链接, 获取元素,抽取文件等等 也有已经实现好通过scihub获取论文的小工具,还有对于pdf转doc,文本翻译,代理连接获取以及通过api获取代理链接, PDF文件合并,PDF文件截取某些页,CSV,xls文件处理等
☆17Updated last year
Alternatives and similar repositories for PaperCrawlerUtil:
Users that are interested in PaperCrawlerUtil are comparing it to the libraries listed below
- aigc evals☆10Updated last year
- Here is a demo for PDF parser (Including OCR, object detection tools)☆34Updated 5 months ago
- A Multi-Modal Dataset of Chinese Governmental Docunments☆31Updated 4 years ago
- 版面分析+OCR☆11Updated 3 years ago
- AGM阿格姆:AI基因图谱模型,从token-weight权重微粒角度,探索AI模型,GPT\LLM大模型的内在运作机制。☆28Updated last year
- Tracking the hot Github repos and update daily 每天自动追踪Github热门项目☆47Updated this week
- ☆13Updated last year
- Unit Minions 的各种数据准备、处理脚本,诸如 OpenAI 处理、格式转换等等。☆14Updated last year
- 大语言模型训练和服务调研☆37Updated last year
- Large-scale exact string matching tool☆16Updated 3 weeks ago
- share data, prompt data , pretraining data☆36Updated last year
- CodeGPT: A Code-Related Dialogue Dataset Generated by GPT and for GPT☆113Updated last year
- Real-time video understanding and interaction through text,audio,image and video with large multi-modal model. 利用多模态大模型的实时视频理解和交互框架,通过文本…☆23Updated last year
- Code for "A Simple but Effective Approach to Improve Structured Language Model Output for Information Extraction"☆14Updated last year
- Reproduction paper --- PDFTriage : Question Answering over Long, Structured Documents☆40Updated last year
- Let AI live in small world by using LangChain☆17Updated last year
- 词、句拼音转汉字、拼音分割、拼音补全、pygame输入中文☆15Updated 5 years ago
- Information-oriented Metric (IOM)☆11Updated 4 years ago
- this repo is mnbvc text quality classification using fastText☆16Updated last year
- ☆10Updated last year
- ☆41Updated last year
- 该项目主要是抽取病历文件中的一些关键信息。并将抽取的内容进行streamlit前端的展示。目前支持的文件类型:图片,pdf文件,word文件☆23Updated 2 years ago
- ☆26Updated 5 months ago
- ☆58Updated this week
- lightsmile个人的用于爬取网络公开语料数据的mini通用爬虫框架。☆12Updated 4 years ago
- Evaluation for AI apps and agent☆36Updated last year
- ⛏️This is the storage of my Slides、Reports and Papers. | 存储PPT、报告和论文☆11Updated 5 months ago
- 基于电商导购机器人,自然语言理解(NLU),文本纠错,歧义词消歧☆12Updated 4 years ago
- AGI模块库架构图☆75Updated last year
- LinChance Fine-tuning System 采用 Streamlit 结合 LLaMA-Factory 打造的模型微调 Web UI☆14Updated last year