Liwu-di / PaperCrawlerUtil
一套工具组,包括访问链接, 获取元素,抽取文件等等 也有已经实现好通过scihub获取论文的小工具,还有对于pdf转doc,文本翻译,代理连接获取以及通过api获取代理链接, PDF文件合并,PDF文件截取某些页,CSV,xls文件处理等
☆18Updated last year
Alternatives and similar repositories for PaperCrawlerUtil:
Users that are interested in PaperCrawlerUtil are comparing it to the libraries listed below
- aigc evals☆10Updated last year
- Large-scale exact string matching tool☆17Updated last month
- OpenLLMDE: An open source data engineering framework for LLMs☆17Updated last year
- Here is a demo for PDF parser (Including OCR, object detection tools)☆34Updated 6 months ago
- Real-time video understanding and interaction through text,audio,image and video with large multi-modal model. 利用多模态大模型的实时视频理解和交互框架,通过文本…☆23Updated last year
- 大语言模型训练和服务调研☆37Updated last year
- A minimal LLM sales agent framework for sales agent fast deployment and benchmark. Support OpenAI models, Claude, HuggingFace models, Gem…☆17Updated 7 months ago
- 版面分析+OCR☆11Updated 3 years ago
- AGM阿格姆:AI基因图谱模型,从token-weight权重微粒角度,探索AI模型,GPT\LLM大模型的内在运作机制。☆28Updated last year
- ⛏️This is the storage of my Slides、Reports and Papers. | 存储PPT、报告和论文☆11Updated 5 months ago
- LinChance Fine-tuning System 采用 Streamlit 结合 LLaMA-Factory 打造的模型微调 Web UI☆14Updated last year
- Tracking the hot Github repos and update daily 每天自动追踪Github热门项目☆48Updated this week
- 仓库主要记录 NLP 算法工程师相关的顶会论文研读笔记【文本匹配篇】☆13Updated 2 years ago
- 微调阿里开源的文字检测模型,利用合合识别返回的OCR结果作为初始训练数据,对模型进行优化训练,使其更加适应1万张图片的具体场景,提高文字识别的精度。☆9Updated 4 months ago
- Zotero 中文社区官方网站源码 | Source code for website of Zotero Chinese community☆10Updated last week
- Python client designed specifically for large-scale requests to the openai interface☆21Updated last year
- share data, prompt data , pretraining data☆36Updated last year
- ☆11Updated 10 months ago
- 词、句拼音转汉字、拼音分割、拼音补全、pygame输入中文☆15Updated 5 years ago
- lightsmile个人的用于爬取网络公开语料数据的mini通用爬虫框架。☆12Updated 4 years ago
- Repo for for paper "AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction".☆65Updated 9 months ago
- QGEval: A Benchmark for Question Generation Evaluation☆14Updated 5 months ago
- autonomous agent with access to a tool library☆34Updated 3 weeks ago
- CodeGPT: A Code-Related Dialogue Dataset Generated by GPT and for GPT☆113Updated last year
- UnitEval is a benchmarking and evaluation tools for AutoDev Coder.☆12Updated last year
- TPO 是一个优化 LLM 输出文本的框架,通过迭代反馈和优化提示的方式来“微调模型”,而非直接调整模型的参数,使模型在推理过程中与人类偏好对齐以生成更好的结果。本项目提供了一个友好的 WebUI 来加载模型,实时优化基础模型并展示最佳结果。☆10Updated 2 months ago
- Fast instruction tuning with Llama2☆11Updated last year
- this repo is mnbvc text quality classification using fastText☆16Updated last year
- 中文原生等级化代码能力测试基准☆13Updated last year
- kimi逆向api☆17Updated 9 months ago