Liwu-di / PaperCrawlerUtil
一套工具组,包括访问链接, 获取元素,抽取文件等等 也有已经实现好通过scihub获取论文的小工具,还有对于pdf转doc,文本翻译,代理连接获取以及通过api获取代理链接, PDF文件合并,PDF文件截取某些页,CSV,xls文件处理等
☆16Updated 11 months ago
Alternatives and similar repositories for PaperCrawlerUtil:
Users that are interested in PaperCrawlerUtil are comparing it to the libraries listed below
- Large-scale exact string matching tool☆15Updated 2 months ago
- 大语言模型训练和服务调研☆35Updated last year
- Based on the Langchain framework, a retrieval and generative chatbot. 基于langchain实现的检索式和生成式问答☆22Updated 2 weeks ago
- UnitEval is a benchmarking and evaluation tools for AutoDev Coder.☆11Updated last year
- 基于浏览器端,通过JavaScript的小红书爬虫☆13Updated last year
- aigc evals☆10Updated last year
- A minimal LLM sales agent framework for sales agent fast deployment and benchmark. Support OpenAI models, Claude, HuggingFace models, Gem…☆14Updated 4 months ago
- kimi-chat 测试数据☆7Updated last year
- CodeGPT: A Code-Related Dialogue Dataset Generated by GPT and for GPT☆110Updated last year
- Luann allows you to create a LLM agent,which has complete memory module (long-term memory, short-term memory) and knowledge module(Variou…☆18Updated 3 weeks ago
- MNBVC项目-ShareGPT语料清洗☆12Updated last year
- Question Answering dataset generator of Document Visual in English and Chinese☆24Updated last year
- Evaluation for AI apps and agent☆36Updated last year
- Reasoning by Communicating with Agents☆24Updated 3 months ago
- AGM阿格姆:AI基因图谱模型,从token-weight权重微粒角度,探索AI模型,GPT\LLM大模型的内在运作机制。☆28Updated last year
- Speed up your OpenAI requests by balancing prompts to multiple API keys.☆55Updated last year
- fastertransformer for codegeex model☆64Updated last year
- Python client designed specifically for large-scale requests to the openai interface☆21Updated 11 months ago
- 用于微调LLM的中文指令数据集☆27Updated last year
- ☆42Updated last month
- 如需体验textin文档解析,请点击https://cc.co/16YSIy☆23Updated 6 months ago
- A lightweight script for processing HTML page to markdown format with support for code blocks☆78Updated 9 months ago
- OpenLLMDE: An open source data engineering framework for LLMs☆17Updated last year
- Real-time video understanding and interaction through text,audio,image and video with large multi-modal model. 利用多模态大模型的实时视频理解和交互框架,通过文本…☆22Updated last year
- this repo is mnbvc text quality classification using fastText☆15Updated last year
- 🔔🧠 Easily experiment with popular language agents across diverse reasoning/decision-making benchmarks!☆51Updated this week
- A large high-quality corpus of Chinese synonyms 一个大型、高质量的中文同义词语料库。☆40Updated 3 years ago
- Evaluate the Opinion Leadership of LLMs in the Werewolf Game☆9Updated 5 months ago
- ☆13Updated last year
- ☆27Updated last year