ppo算法实现
☆39Jun 5, 2024Updated last year
Alternatives and similar repositories for RLHF_PPO
Users that are interested in RLHF_PPO are comparing it to the libraries listed below
Sorting:
- 长文本相似度模型☆21Nov 24, 2023Updated 2 years ago
- ☆11Updated this week
- Pytorch DDP Traning Demo☆30Oct 20, 2024Updated last year
- Open-source examples and guides for building with the Qwen. Browse a collection of snippets, advanced techniques and walkthroughs.☆37Nov 20, 2024Updated last year
- A simple WeChat Official Account layout tool based on Dify☆17Jun 27, 2025Updated 8 months ago
- Difyで作る生成AIアプリ完全入門☆17May 25, 2025Updated 9 months ago
- ☆26Feb 28, 2026Updated last week
- ☆42Mar 6, 2025Updated last year
- ☆28Dec 4, 2025Updated 3 months ago
- Workflow automation, but you just describe what you want and it happens.☆27Nov 22, 2025Updated 3 months ago
- A full-stack AI-powered business intelligence tool for non-experts, featuring serverless backend processing and a secure Streamlit fronte…☆28Feb 13, 2026Updated 3 weeks ago
- Write the database metadata into the dify knowledge☆12Dec 30, 2025Updated 2 months ago
- ☆11Aug 29, 2025Updated 6 months ago
- dify 知识库检索工具☆13Apr 3, 2025Updated 11 months ago
- 🤖AI Agents for Financial Trading💰: LLM-Driven Stock Prediction & Investment Recommendation System☆13Apr 14, 2025Updated 10 months ago
- A small framework to benchmark forecasting models via backtesting☆13Nov 25, 2023Updated 2 years ago
- 参考《上海交通大学生存手册》开源☆16Sep 25, 2024Updated last year
- A universal skills runtime framework SDK for building, deploying, and executing modular capabilities across diverse environments.☆27Mar 3, 2026Updated last week
- An SSH plugin for Dify☆13Jan 16, 2026Updated last month
- A Claude Code skill for structured, spec-driven development with phase-by-phase workflow and living documentation☆26Feb 16, 2026Updated 3 weeks ago
- ☆10Dec 29, 2023Updated 2 years ago
- 知予人工智能:从学习者到研究者☆13Jan 20, 2025Updated last year
- A plugin for OpenCode. Make your coding agent learn and grow with every task.☆36Jan 31, 2026Updated last month
- A distilled DeepSeek-R1 variant built on Qwen2.5-32B, fine-tuned with curated data for enhanced performance and efficiency. <metadata> gp…☆16Mar 11, 2025Updated 11 months ago
- Use the knowledge graph generated by GraphRAG as the external knowledge base for the Dify workflow.☆21Jun 4, 2025Updated 9 months ago
- ☆28Jun 27, 2025Updated 8 months ago
- ☆12Jun 28, 2024Updated last year
- ☆41Apr 11, 2025Updated 10 months ago
- LangReact 是一个配置化的 Planning Agent 应用开发工具,通过配置、插件,能快速为你的 GPT 应用提供 Planning 功能。☆12Apr 23, 2024Updated last year
- Group-Group Loss Based Global-Regional Feature Learning for Vehicle Re-Identification☆12May 10, 2022Updated 3 years ago
- This is a fork from Ryan Carson's AI Dev Tasks repository, with some code cleanup and refactoring to enable support for PostgreSQL databa…☆15Sep 8, 2025Updated 6 months ago
- Java implementation for the Agent2Agent Protocol (A2A - https://github.com/google/A2A), enabling interaction between AI agents through a …☆11Apr 21, 2025Updated 10 months ago
- Python Telegraph api.☆15Mar 22, 2025Updated 11 months ago
- Evaluation for AI apps and agent☆44Jan 18, 2024Updated 2 years ago
- A feishu bot daily push arxiv latest articles.☆10Nov 28, 2021Updated 4 years ago
- SQL Query generator using Qwen2-1.5B instruct☆12Jun 12, 2025Updated 8 months ago
- A collection of concise notes on things I learn every day. Covering various topics, from tech to general knowledge. Quick insights for da…☆18Updated this week
- 使用大语言模型自动翻译视频字幕,并采用反思策略优化字幕,最后通过chattts合成语音并合并到原视频中。☆11Aug 1, 2024Updated last year
- ☆11Mar 30, 2025Updated 11 months ago