☆22Apr 22, 2025Updated last year
Alternatives and similar repositories for GRPO2025
Users that are interested in GRPO2025 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 古诗词知识图谱☆16Mar 1, 2022Updated 4 years ago
- Local DeepSearch (Advantage: Low Threshold): an implementation of Agentic RAG based on DeepSeek-R1 API and Tavily API☆17Jun 21, 2025Updated 11 months ago
- A collection of some awesome public projects about LLM-based Web Agents and Tools.☆13Apr 25, 2024Updated 2 years ago
- A small open source 3D agent simulator based on LLM.☆70Dec 1, 2024Updated last year
- ☆14Apr 4, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A lightweight and real Python sandbox supporting a SAFE and FINITE subset of Python☆31Aug 3, 2025Updated 10 months ago
- [EMNLP 2024] FlowBench: Revisiting and Benchmarking Workflow-Guided Planning for LLM-based Agents☆22Jan 6, 2025Updated last year
- ☆10Jul 11, 2022Updated 3 years ago
- [ACMMM 23] Official implementation of Object Segmentation by Mining Cross-Modal Semantics (First Uniformed model for SOD and/or COD with …☆18Sep 15, 2023Updated 2 years ago
- ☆10May 28, 2023Updated 3 years ago
- 一个低成本、易于上手的多模态大模型学习项目。基于Qwen3-0.6B和CLIP构建,使用LLaVA架构和LoRA微调,在消费级16G显卡上数小时即可完成训练☆51Sep 15, 2025Updated 9 months ago
- ☆17Jul 10, 2023Updated 2 years ago
- ☆22May 22, 2024Updated 2 years ago
- 集成Qwen与DeepSeek等先进大语言模型,支持纯LLM+分类层模式及LLM+LoRA+分类层模式,使用transformers模块化设计和训练便于根据需要调整或替换组件。☆21Sep 1, 2025Updated 9 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Improving Pseudo Labels with Global-Local Denoising Framework for Cross-lingual Named Entity Recognition (IJCAI 2024)☆11Aug 18, 2024Updated last year
- Collection of papers, benchmarks and newest trends in the domain of End-to-end ToDs☆14Nov 18, 2023Updated 2 years ago
- ✨✨ Official repo for "Comparative Analysis of Demonstration Selection Algorithms for LLM In-Context Learning"☆16Nov 8, 2024Updated last year
- 知识库、大语言模型、医疗知识库构建、基于大语言模型的知识库☆29Jun 13, 2023Updated 3 years ago
- vocal generation network☆14Mar 21, 2023Updated 3 years ago
- [CVPR 2024] Targeted Representation Alignment for Open-World Semi-Supervised Learning☆14Sep 23, 2024Updated last year
- 👽 基于大模型的知识库问答 | Large model-based knowledge base Q&A.☆30May 21, 2023Updated 3 years ago
- The source code and models for our paper PNP: Robust Learning from Noisy Labels by Probabilistic Noise Prediction☆14Jan 30, 2023Updated 3 years ago
- ☆14Mar 1, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 2024百度商业AI技术创新大赛赛道一:基于大模型的广告检索全国一等奖获奖方案☆16Feb 23, 2025Updated last year
- [NeurIPS 2024] Official implementation of "ClavaDDPM:Multi-relational Data Synthesis with Cluster-guided Diffusion Models"☆20Oct 27, 2024Updated last year
- ☆14Oct 31, 2022Updated 3 years ago
- RWKV-RAG个人版☆27Aug 6, 2025Updated 10 months ago
- 复旦大学nlp实验室入门小实验nlp-beginner☆27Jan 22, 2022Updated 4 years ago
- Source code for the NeurIPS 2023 paper: "CSOT: Curriculum and Structure-Aware Optimal Transport for Learning with Noisy Labels"☆19Dec 11, 2023Updated 2 years ago
- Look, Compare, Decide: Alleviating Hallucination in Large Vision-Language Models via Multi-View Multi-Path Reasoning☆24Sep 9, 2024Updated last year
- This is an unofficial implementation to the EMNLP 2023 paper: Reading Order Matters: Information Extraction from Visually-rich Documents …☆16May 29, 2024Updated 2 years ago
- 一个基于多模态大模型的图表解析器☆44Mar 28, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- This repository collects various works that reproduce DeepSeek R1, as well as works related to DeepSeek R1 and the DeepSeek series.☆19Apr 27, 2025Updated last year
- 本项目对ChatGLM3-6B通过多种方式微调,使模型具备落地潜质(包括但不限于客服、聊天、游戏)☆33Mar 11, 2024Updated 2 years ago
- This is the official repository of the revised datasets FUNSD-r and CORD-r, introduced in EMNLP 2023 paper Reading Order Matters: Informa…☆17Mar 20, 2024Updated 2 years ago
- 强化学习常见算法的实现,Q-Learning/DQN/PG/AC/DDPG/PPO/SAC☆26Feb 17, 2022Updated 4 years ago
- deepspeed+trainer简单高效实现多卡微调大模型☆133May 27, 2023Updated 3 years ago
- 基于Phi3模型结构,使用常见的中文预料从零训练的小参数量LLM。包括了tokenizer训练、模型预训练、指令微调和直接偏好优化等流程。☆26Jun 23, 2024Updated last year
- 用最简单的代码带你实现基于大模型的本地知识库问答系统☆34Sep 5, 2023Updated 2 years ago