zjrwtx / SFT-data-builder
利用免费的大模型api来结合你的私域数据来生成sft训练数据(妥妥白嫖)支持llamafactory等工具的训练数据格式synthetic data
☆139Updated 3 months ago
Alternatives and similar repositories for SFT-data-builder:
Users that are interested in SFT-data-builder are comparing it to the libraries listed below
- 顾名思义:手搓的RAG☆120Updated last year
- FlexRAG: A RAG Framework for Information Retrieval and Generation.☆125Updated this week
- Agentica: Effortlessly Build Intelligent, Reflective, and Collaborative Multimodal AI Agents! 轻松构建智能、具备反思能力、可协作的多模态AI Agent。☆131Updated 2 months ago
- 基于ReAct手搓一个Agent Demo☆115Updated 10 months ago
- Alpaca Chinese Dataset -- 中文指令微调数据集☆192Updated 4 months ago
- Repo for NAACL 2025 Paper "Unfolding the Headline: Iterative Self-Questioning for News Retrieval and Timeline Summarization"☆248Updated last month
- 🧠 世界上覆盖最全的优秀Qwen提示语大全,欢迎贡献你的提示词。🧠 The most comprehensive collection of excellent Qwen prompts in the world. Feel free to contribute you…☆192Updated 3 months ago
- 📝 针对文档类图像做内容提取,将文档类图像一比一输出到Word或者Txt中,便于进一步使用或处理。后续计划支持输入PDF/图像,输出对应json格式、Txt格式、Word格式和Markdown格式。☆183Updated 4 months ago
- 🌐 WebWalker: Benchmarking LLMs in Web Traversal☆335Updated 3 weeks ago
- ☆225Updated 9 months ago
- ChatPilot: Chat Agent Web UI,实现Chat对话前端,支持Google搜索、文件网址对话(RAG)、代码解释器功能,复现了Kimi Chat(文件,拖进来;网址,发出来)。☆548Updated last month
- ☆107Updated 6 months ago
- ☆62Updated 5 months ago
- ☆96Updated 9 months ago
- 😜 表情包视觉数据集,使用glm-4v、step-1v的图像解析能力标注。☆115Updated 10 months ago
- ✨🦋 illufly 是自我进化的 Agent 框架: 基于自我进化,快速创造价值☆54Updated this week
- A Python Package to Access World-Class Generative Models☆126Updated 8 months ago
- Repo for Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent☆232Updated 3 weeks ago
- SearchGPT: Building a quick conversation-based search engine with LLMs.☆45Updated last month
- 企业级RAG系统从入门到精通☆334Updated this week
- AM (Advanced Mathematics) Chat is a large language model that integrates advanced mathematical knowledge, exercises in higher mathematics…☆174Updated 6 months ago
- ☆405Updated 2 weeks ago
- ☆199Updated 2 months ago
- 我们是第一个完全可商用的角色大模型。☆39Updated 6 months ago
- 使用langchain进行任务规划,构建子任务的会话场景资源,通过MCTS任务执行器,来让每个子任务通过在上下文中资源,通过自身反思探索来获取自身对问题的最优答案;这种方式依赖模型的对齐偏好,我们在每种偏好上设计了一个工程框架,来完成自我对不同答案的奖励进行采样策略☆29Updated last week
- [ICLR 2025] The official implementation of paper "ToolGen: Unified Tool Retrieval and Calling via Generation"☆126Updated this week
- ☆59Updated 11 months ago