☆22Apr 22, 2025Updated 11 months ago
Alternatives and similar repositories for GRPO2025
Users that are interested in GRPO2025 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Extended Inductive Reasoning for Personalized Preference Inference from Behavioral Signals☆11Jan 8, 2026Updated 3 months ago
- Basic floating-point components for RISC-V processors☆12Aug 13, 2017Updated 8 years ago
- A Ray-based LLM server compatible with OpenAI API☆12Mar 12, 2024Updated 2 years ago
- ☆11Oct 24, 2024Updated last year
- Local DeepSearch (Advantage: Low Threshold): an implementation of Agentic RAG based on DeepSeek-R1 API and Tavily API☆17Jun 21, 2025Updated 9 months ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Repository for the Findings of ACL'23 paper Label Agnostic Pre-training for Zero-shot Text Classification☆12Aug 10, 2023Updated 2 years ago
- TOD-Flow: Modeling the Structure of Task-Oriented Dialogues☆13Feb 7, 2024Updated 2 years ago
- Code for KDD 2025 paper "FreRA: A Frequency-Refined Augmentation for Contrastive Learning on Time Series Classification"☆33Jun 20, 2025Updated 9 months ago
- A small open source 3D agent simulator based on LLM.☆69Dec 1, 2024Updated last year
- ☆31Oct 1, 2023Updated 2 years ago
- 受到self-instruct启发,除了通用LLM还能做垂直领域的小LLM实现定制效果,通过GPT获得question和answer来作为训练数据☆18May 12, 2023Updated 2 years ago
- ☆15Apr 4, 2025Updated last year
- [EMNLP 2024] FlowBench: Revisiting and Benchmarking Workflow-Guided Planning for LLM-based Agents☆22Jan 6, 2025Updated last year
- 数字逻辑课程资料☆12Dec 28, 2017Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [ACMMM 23] Official implementation of Object Segmentation by Mining Cross-Modal Semantics (First Uniformed model for SOD and/or COD with …☆17Sep 15, 2023Updated 2 years ago
- ☆10May 28, 2023Updated 2 years ago
- A fast, local, and secure approach for training LLMs for coding tasks using GRPO with WebAssembly and interpreter feedback.☆42Apr 4, 2025Updated last year
- My solution for labs of MIT-6.004-computation-Construction spring 20.The materials of the course can be found here//6004.mit.edu/web/spri…☆14Mar 29, 2020Updated 6 years ago
- ☆32Jan 24, 2025Updated last year
- ☆17Jul 10, 2023Updated 2 years ago
- ☆22May 22, 2024Updated last year
- 集成Qwen与DeepSeek等先进大语言模型,支持纯LLM+分类层模式及LLM+LoRA+分类层模式,使用transformers模块化设计和训练便于根据需要调整或替换组件。☆21Sep 1, 2025Updated 7 months ago
- ✨✨ Official repo for "Comparative Analysis of Demonstration Selection Algorithms for LLM In-Context Learning"☆16Nov 8, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- 知识库、大语言模型、医疗知识库构建、基于大语言模型的知识库☆29Jun 13, 2023Updated 2 years ago
- vocal generation network☆13Mar 21, 2023Updated 3 years ago
- [CVPR 2024] Targeted Representation Alignment for Open-World Semi-Supervised Learning☆14Sep 23, 2024Updated last year
- 👽 基于大模型的知识库问答 | Large model-based knowledge base Q&A.☆30May 21, 2023Updated 2 years ago
- ☆23Nov 29, 2024Updated last year
- This is official github repo for InReview paper "MaskAttn-UNet: A Mask Attention-Driven Framework for Universal Low-Resolution Image Seg…☆30May 6, 2025Updated 11 months ago
- 基于外挂知识库的大模型问答☆23Mar 6, 2024Updated 2 years ago
- ☆14Mar 1, 2023Updated 3 years ago
- 2024百度商业AI技术创新大赛赛道一:基于大模型的广告检索全国一等奖获奖方案☆16Feb 23, 2025Updated last year
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- LaTeX template for Dalian Maritime University Graduation Thesis☆18May 27, 2013Updated 12 years ago
- RWKV-RAG个人版☆27Aug 6, 2025Updated 8 months ago
- 复旦大学nlp实验室入门小实验nlp-beginner☆27Jan 22, 2022Updated 4 years ago
- Look, Compare, Decide: Alleviating Hallucination in Large Vision-Language Models via Multi-View Multi-Path Reasoning☆24Sep 9, 2024Updated last year
- 一个基于多模态大模型的图表解析器☆44Mar 28, 2025Updated last year
- Time-HD-Lib: A Library for High-Dimensional Time Series Forecasting☆52Jan 26, 2026Updated 2 months ago
- ☆24Mar 16, 2023Updated 3 years ago