☆23Apr 22, 2025Updated 10 months ago
Alternatives and similar repositories for GRPO2025
Users that are interested in GRPO2025 are comparing it to the libraries listed below
Sorting:
- A collection of some awesome public projects about LLM-based Web Agents and Tools.☆12Apr 25, 2024Updated last year
- TOD-Flow: Modeling the Structure of Task-Oriented Dialogues☆13Feb 7, 2024Updated 2 years ago
- 记录了在三本软工两年来的课程资料,进击吧少年☆10Dec 10, 2022Updated 3 years ago
- Basic floating-point components for RISC-V processors☆11Aug 13, 2017Updated 8 years ago
- GLCONet: Learning Multisource Perception Representation for Camouflaged Object Detection (TNNLS, 2024)☆16Jul 10, 2025Updated 7 months ago
- ☆10Jul 11, 2022Updated 3 years ago
- Extended Inductive Reasoning for Personalized Preference Inference from Behavioral Signals☆11Jan 8, 2026Updated 2 months ago
- ☆11Oct 24, 2024Updated last year
- 用于研读LevelDB源码时进行注释,持续更新☆12Feb 23, 2023Updated 3 years ago
- Collection of papers, benchmarks and newest trends in the domain of End-to-end ToDs☆14Nov 18, 2023Updated 2 years ago
- This is an unofficial implementation to the EMNLP 2023 paper: Reading Order Matters: Information Extraction from Visually-rich Documents …☆16May 29, 2024Updated last year
- ✨✨ Official repo for "Comparative Analysis of Demonstration Selection Algorithms for LLM In-Context Learning"☆16Nov 8, 2024Updated last year
- 集成Qwen与DeepSeek等先进大语言模型,支持纯LLM+分类层模式及LLM+LoRA+分类层模式,使用transformers模块化设计和训练便于根据需要调整或替换组件。☆19Sep 1, 2025Updated 6 months ago
- Local DeepSearch (Advantage: Low Threshold): an implementation of Agentic RAG based on DeepSeek-R1 API and Tavily API☆17Jun 21, 2025Updated 8 months ago
- [ACMMM 23] Official implementation of Object Segmentation by Mining Cross-Modal Semantics (First Uniformed model for SOD and/or COD with …☆16Sep 15, 2023Updated 2 years ago
- Code for the SofT-GRPO algorithm on the LLM soft-thinking reasoning pattern.☆41Jan 2, 2026Updated 2 months ago
- ☆17Jul 10, 2023Updated 2 years ago
- [CVPR 2024] Targeted Representation Alignment for Open-World Semi-Supervised Learning☆15Sep 23, 2024Updated last year
- 受到self-instruct启发,除了通用LLM还能做垂直领域的小LLM实现定制效果,通过GPT获得question和answer来作为训练数据☆18May 12, 2023Updated 2 years ago
- [NeurIPS 2024] Official implementation of "ClavaDDPM:Multi-relational Data Synthesis with Cluster-guided Diffusion Models"☆18Oct 27, 2024Updated last year
- ☆14Oct 31, 2022Updated 3 years ago
- The source code and models for our paper PNP: Robust Learning from Noisy Labels by Probabilistic Noise Prediction☆14Jan 30, 2023Updated 3 years ago
- 数字逻辑课程资料☆12Dec 28, 2017Updated 8 years ago
- This is the official repository of the revised datasets FUNSD-r and CORD-r, introduced in EMNLP 2023 paper Reading Order Matters: Informa…☆17Mar 20, 2024Updated last year
- My solution for labs of MIT-6.004-computation-Construction spring 20.The materials of the course can be found here//6004.mit.edu/web/spri…☆14Mar 29, 2020Updated 5 years ago
- Time-HD-Lib: A Library for High-Dimensional Time Series Forecasting☆50Jan 26, 2026Updated last month
- This is official github repo for InReview paper "MaskAttn-UNet: A Mask Attention-Driven Framework for Universal Low-Resolution Image Seg…☆25May 6, 2025Updated 10 months ago
- This repository collects various works that reproduce DeepSeek R1, as well as works related to DeepSeek R1 and the DeepSeek series.☆19Apr 27, 2025Updated 10 months ago
- The code of EGMA framework.☆19Jun 14, 2024Updated last year
- ☆22May 22, 2024Updated last year
- ☆18Dec 2, 2017Updated 8 years ago
- 🔥 [ECCV2024] Official Implementation of "Learning Camouflaged Object Detection from Noisy Pseudo Label"☆22Dec 16, 2025Updated 2 months ago
- 基于外挂知识库的大模型问答☆24Mar 6, 2024Updated 2 years ago
- ☆27Dec 22, 2022Updated 3 years ago
- Pseudo-labeling for tabular data☆24Feb 11, 2026Updated 3 weeks ago
- Official code for the paper "Meta Soft Label Generation for Noisy Labels" accepted at ICPR 2020.☆21Oct 12, 2020Updated 5 years ago
- ☆23Nov 29, 2024Updated last year
- Camouflaged Object Detection☆22Jun 27, 2025Updated 8 months ago
- GNC is Not C. It is intended for a better and more effective c language.☆19Jan 15, 2022Updated 4 years ago