cxcscmu / Craw4LLMLinks
Official repository for "Craw4LLM: Efficient Web Crawling for LLM Pretraining"
☆650Updated 11 months ago
Alternatives and similar repositories for Craw4LLM
Users that are interested in Craw4LLM are comparing it to the libraries listed below
Sorting:
- [ACL 2025 Demo] Repository for the demo and paper: ReasonGraph: Visualisation of Reasoning Paths☆512Updated 8 months ago
- Repo for NAACL 2025 Paper "Unfolding the Headline: Iterative Self-Questioning for News Retrieval and Timeline Summarization"☆292Updated 6 months ago
- A simple agent framework that's capable of browser use + mcp + auto instrument + plan + deep research + more☆381Updated last month
- [EMNLP 2025] OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking☆489Updated 5 months ago
- Mentis: A powerful multi-agent orchestration framework built on LangGraph.☆294Updated 8 months ago
- ☆301Updated last year
- Speech to Text but with all the bells and whistles and most importantly AI! AI will clean up your filler words, edit and will refine what…☆328Updated last year
- Secretary is an AI-powered tool that analyzes social media content from specified accounts and delivers results via WeChat. It supports c…☆359Updated 6 months ago
- ☆247Updated 8 months ago
- LLM-Powered Semi-Structured Table Question Answering☆292Updated last week
- ☆872Updated 3 months ago
- PatentWriterAgent Demo☆441Updated 3 months ago
- ☆261Updated 9 months ago
- python package to parse pdfs with different parsers☆245Updated 4 months ago
- The Level-Navi Agent, a framework that requires no training and utilizes large language models for deep query understanding and precise s…☆81Updated last year
- Semantic Search on Wikipedia with Upstash Vector☆473Updated last month
- A General-Purpose AI Agent ✨☆411Updated this week
- A complete 7-layer intelligent memory system for AI Agents with multi-modal memory fusion also support context_engineering☆136Updated 7 months ago
- ☆606Updated last year
- 🧠 世界上覆盖最全的优秀Qwen提示语大全,欢迎贡献你的提示词。🧠 The most comprehensive collection of excellent Qwen prompts in the world. Feel free to contribute you…☆363Updated last week
- Your first AI prompt engineer☆411Updated 7 months ago
- AI ContentCraft is an all-in-one content creation suite that helps creators generate stories, podcast scripts, and multimedia content usi…☆385Updated 7 months ago
- ☆522Updated 10 months ago
- Open-source alternative for crowdtest.ai. Simulate how users might react to different versions of your content☆165Updated 11 months ago
- 🍎APPL: A Prompt Programming Language. Seamlessly integrate LLMs with programs.☆263Updated 11 months ago
- A Playwright-based Node.js tool that bypasses search engine anti-scraping mechanisms to execute Google searches. Local alternative to SER…☆542Updated 10 months ago
- [EMNLP 2025] ViDoRAG: Visual Document Retrieval-Augmented Generation via Dynamic Iterative Reasoning Agents☆626Updated 3 weeks ago
- Mermaid AI Diagram Generator☆41Updated 8 months ago
- Qwen DianJin: LLMs for the Financial Industry by Alibaba Cloud(通义点金:阿里云金融大模型)☆420Updated this week
- ☆79Updated 9 months ago