cxcscmu / Craw4LLMLinks
Official repository for "Craw4LLM: Efficient Web Crawling for LLM Pretraining"
☆631Updated 5 months ago
Alternatives and similar repositories for Craw4LLM
Users that are interested in Craw4LLM are comparing it to the libraries listed below
Sorting:
- [ACL 2025 Demo] Repository for the demo and paper: ReasonGraph: Visualisation of Reasoning Paths☆493Updated 2 months ago
- Repo for NAACL 2025 Paper "Unfolding the Headline: Iterative Self-Questioning for News Retrieval and Timeline Summarization"☆282Updated 6 months ago
- A simple agent framework that's capable of browser use + mcp + auto instrument + plan + deep research + more☆302Updated 2 months ago
- ☆257Updated 11 months ago
- OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking☆454Updated 3 months ago
- OpenAI DeepResearch alternative, An AI-driven research system that performs comprehensive, iterative research on any topic using multiple…☆619Updated last month
- Mentis: A powerful multi-agent orchestration framework built on LangGraph.☆266Updated 2 months ago
- Speech to Text but with all the bells and whistles and most importantly AI! AI will clean up your filler words, edit and will refine what…☆317Updated 5 months ago
- Secretary is an AI-powered tool that analyzes social media content from specified accounts and delivers results via WeChat. It supports c…☆332Updated this week
- MultiAgentPPT 是一个集成了 A2A(Agent2Agent)+ MCP(Model Context Protocol)+ ADK(Agent Development Kit) 架构的智能化演示文稿生成系统,支持通过多智能体协作和流式并发机制☆953Updated this week
- ☆232Updated last month
- Fogsight is an AI agent and animation engine powered by Large Language Models.☆758Updated last week
- 🧠 世界上覆盖最全的优秀Qwen提示语大全,欢 迎贡献你的提示词。🧠 The most comprehensive collection of excellent Qwen prompts in the world. Feel free to contribute you…☆241Updated 8 months ago
- ☆504Updated 5 months ago
- A General-Purpose AI Agent ✨☆385Updated last week
- [ICLR 2025] The First Multimodal Seach Engine Pipeline and Benchmark for LMMs☆457Updated 6 months ago
- AI ContentCraft is an all-in-one content creation suite that helps creators generate stories, podcast scripts, and multimedia content usi…☆368Updated 3 weeks ago
- ☆585Updated 9 months ago
- Lemon AI is the first Full-stack, Open-source, Agentic AI framework, offering a fully local alternative to platforms like Manus & Genspar…☆579Updated 2 weeks ago
- Scira (Formerly MiniPerplx) is a minimalistic AI-powered search engine that helps you find information on the internet. Powered by Vercel…☆121Updated 5 months ago
- Your first AI prompt engineer☆400Updated last month
- The Level-Navi Agent, a framework that requires no training and utilizes large language models for deep query understanding and precise s…☆81Updated 7 months ago
- ☆252Updated 2 months ago
- Query and Summarize your chat messages.☆1,008Updated 7 months ago
- AI agent that is compatible with multiple LLM models☆158Updated this week
- Open-source alternative for crowdtest.ai. Simulate how users might react to different versions of your content☆155Updated 5 months ago
- python package to parse pdfs with different parsers☆199Updated 7 months ago
- recursive rag with r1 reasoning☆325Updated 2 months ago
- Large Language Model in Action☆334Updated 6 months ago
- Mermaid AI Diagram Generator☆41Updated 2 months ago