☆22Apr 22, 2025Updated last year
Alternatives and similar repositories for GRPO2025
Users that are interested in GRPO2025 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Local DeepSearch (Advantage: Low Threshold): an implementation of Agentic RAG based on DeepSeek-R1 API and Tavily API☆17Jun 21, 2025Updated 11 months ago
- Repository for the Findings of ACL'23 paper Label Agnostic Pre-training for Zero-shot Text Classification☆12Aug 10, 2023Updated 2 years ago
- A collection of some awesome public projects about LLM-based Web Agents and Tools.☆13Apr 25, 2024Updated 2 years ago
- A small open source 3D agent simulator based on LLM.☆70Dec 1, 2024Updated last year
- GLCONet: Learning Multisource Perception Representation for Camouflaged Object Detection (TNNLS, 2024)☆16Jul 10, 2025Updated 10 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆32Oct 1, 2023Updated 2 years ago
- ☆15Apr 4, 2025Updated last year
- ☆10Jul 11, 2022Updated 3 years ago
- [ACMMM 23] Official implementation of Object Segmentation by Mining Cross-Modal Semantics (First Uniformed model for SOD and/or COD with …☆17Sep 15, 2023Updated 2 years ago
- 一个低成本、易于上手的多模态大模型学习项目。基于Qwen3-0.6B和CLIP构建,使用LLaVA架构和LoRA微调,在消费级16G显卡上数小时即可完成训练☆50Sep 15, 2025Updated 8 months ago
- A fast, local, and secure approach for training LLMs for coding tasks using GRPO with WebAssembly and interpreter feedback.☆42Apr 4, 2025Updated last year
- Kaggle AIMO2 solution with token-efficient reasoning LLM recipes☆50Aug 7, 2025Updated 9 months ago
- ☆22May 22, 2024Updated 2 years ago
- 集成Qwen与DeepSeek等先进大语言模型,支持纯LLM+分类层模式及LLM+LoRA+分类层模式,使用transformers模块化设计和训练便于根据需要调整或替换组件。☆21Sep 1, 2025Updated 8 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Improving Pseudo Labels with Global-Local Denoising Framework for Cross-lingual Named Entity Recognition (IJCAI 2024)☆11Aug 18, 2024Updated last year
- Collection of papers, benchmarks and newest trends in the domain of End-to-end ToDs☆14Nov 18, 2023Updated 2 years ago
- this is dataset about network traffic☆21Mar 5, 2021Updated 5 years ago
- 知识库、大语言模型、医疗知识库构建、基于大语言模型的知识库☆29Jun 13, 2023Updated 2 years ago
- vocal generation network☆13Mar 21, 2023Updated 3 years ago
- [ICCV2025] PropVG: End-to-End Proposal-Driven Visual Grounding with Multi-Granularity Discrimination☆32Oct 13, 2025Updated 7 months ago
- 👽 基于大模型的知识库问答 | Large model-based knowledge base Q&A.☆30May 21, 2023Updated 3 years ago
- The code of EGMA framework.☆18Jun 14, 2024Updated last year
- ☆39Nov 20, 2025Updated 6 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- The source code and models for our paper PNP: Robust Learning from Noisy Labels by Probabilistic Noise Prediction☆14Jan 30, 2023Updated 3 years ago
- 2024百度商业AI技术创新大赛赛道一:基于大模型的广告检索全国一等奖获奖方案☆16Feb 23, 2025Updated last year
- RWKV-RAG个人版☆27Aug 6, 2025Updated 9 months ago
- LaTeX template for Dalian Maritime University Graduation Thesis☆18May 27, 2013Updated 13 years ago
- 🔥 [ECCV2024] Official Implementation of "Learning Camouflaged Object Detection from Noisy Pseudo Label"☆23Dec 16, 2025Updated 5 months ago
- 复旦大学nlp实验室入门小实验nlp-beginner☆27Jan 22, 2022Updated 4 years ago
- Source code for the NeurIPS 2023 paper: "CSOT: Curriculum and Structure-Aware Optimal Transport for Learning with Noisy Labels"☆19Dec 11, 2023Updated 2 years ago
- This is an unofficial implementation to the EMNLP 2023 paper: Reading Order Matters: Information Extraction from Visually-rich Documents …☆16May 29, 2024Updated 2 years ago
- 一个基于多模态大模型的图表解析器☆44Mar 28, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 离线部署大模型,构建一个可以上传本地知识库进行RAG问答且可以自行调用工具的Agent。☆41Apr 23, 2024Updated 2 years ago
- This repository collects various works that reproduce DeepSeek R1, as well as works related to DeepSeek R1 and the DeepSeek series.☆19Apr 27, 2025Updated last year
- Official code for the paper "Meta Soft Label Generation for Noisy Labels" accepted at ICPR 2020.☆21Oct 12, 2020Updated 5 years ago
- This is the official repository of the revised datasets FUNSD-r and CORD-r, introduced in EMNLP 2023 paper Reading Order Matters: Informa…☆17Mar 20, 2024Updated 2 years ago
- Self-Knowledge Guided Retrieval Augmentation for Large Language Models (EMNLP Findings 2023)☆28Dec 8, 2023Updated 2 years ago
- 强化学习常见算法的实现,Q-Learning/DQN/PG/AC/DDPG/PPO/SAC☆26Feb 17, 2022Updated 4 years ago
- deepspeed+trainer简单高效实现多卡微调大模型☆133May 27, 2023Updated 3 years ago