☆22Apr 22, 2025Updated 11 months ago
Alternatives and similar repositories for GRPO2025
Users that are interested in GRPO2025 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Extended Inductive Reasoning for Personalized Preference Inference from Behavioral Signals☆11Jan 8, 2026Updated 2 months ago
- 古诗词知识图谱☆16Mar 1, 2022Updated 4 years ago
- Local DeepSearch (Advantage: Low Threshold): an implementation of Agentic RAG based on DeepSeek-R1 API and Tavily API☆17Jun 21, 2025Updated 9 months ago
- Repository for the Findings of ACL'23 paper Label Agnostic Pre-training for Zero-shot Text Classification☆12Aug 10, 2023Updated 2 years ago
- A collection of some awesome public projects about LLM-based Web Agents and Tools.☆12Apr 25, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Code for KDD 2025 paper "FreRA: A Frequency-Refined Augmentation for Contrastive Learning on Time Series Classification"☆32Jun 20, 2025Updated 9 months ago
- A small open source 3D agent simulator based on LLM.☆69Dec 1, 2024Updated last year
- ☆31Oct 1, 2023Updated 2 years ago
- 受到self-instruct启发,除了通用LLM还能做垂直领域的小LLM实现定制效果,通过GPT获得question和answer来作为训练数据☆18May 12, 2023Updated 2 years ago
- ☆15Apr 4, 2025Updated 11 months ago
- 一个低成本、易于上手的多模态大模型学习项目。基于Qwen3-0.6B和CLIP构建,使用LLaVA架构和LoRA微调,在消费级16G显卡上数小时即可完成训练☆43Sep 15, 2025Updated 6 months ago
- [EMNLP 2024] FlowBench: Revisiting and Benchmarking Workflow-Guided Planning for LLM-based Agents☆22Jan 6, 2025Updated last year
- ☆10Jul 11, 2022Updated 3 years ago
- ☆10May 28, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Kaggle AIMO2 solution with token-efficient reasoning LLM recipes☆45Aug 7, 2025Updated 7 months ago
- ☆17Jul 10, 2023Updated 2 years ago
- ☆22May 22, 2024Updated last year
- 集成Qwen与DeepSeek等先进大语言模型,支持纯LLM+分类层模式及LLM+LoRA+分类层模式,使用transformers模块化设计和训练便于根据需要调整或替换组件。☆20Sep 1, 2025Updated 6 months ago
- Improving Pseudo Labels with Global-Local Denoising Framework for Cross-lingual Named Entity Recognition (IJCAI 2024)☆11Aug 18, 2024Updated last year
- Collection of papers, benchmarks and newest trends in the domain of End-to-end ToDs☆14Nov 18, 2023Updated 2 years ago
- ✨✨ Official repo for "Comparative Analysis of Demonstration Selection Algorithms for LLM In-Context Learning"☆16Nov 8, 2024Updated last year
- this is dataset about network traffic☆21Mar 5, 2021Updated 5 years ago
- 知识库、大语言模型、医疗知识库构建、基于大语言模型的知识库☆29Jun 13, 2023Updated 2 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- MetaSearch:llm深度研究(deepsearch)功能方案实现☆34Aug 21, 2025Updated 7 months ago
- [ICCV2025] PropVG: End-to-End Proposal-Driven Visual Grounding with Multi-Granularity Discrimination☆32Oct 13, 2025Updated 5 months ago
- 👽 基于大模型的知识库问答 | Large model-based knowledge base Q&A.☆30May 21, 2023Updated 2 years ago
- 基于外挂知识库的大模型问答☆23Mar 6, 2024Updated 2 years ago
- The source code and models for our paper PNP: Robust Learning from Noisy Labels by Probabilistic Noise Prediction☆14Jan 30, 2023Updated 3 years ago
- ☆14Mar 1, 2023Updated 3 years ago
- 2024百度商业AI技术创新大赛赛道一:基于大模型的广告检索全国一等奖获奖方案☆17Feb 23, 2025Updated last year
- ☆14Oct 31, 2022Updated 3 years ago
- RWKV-RAG个人版☆27Aug 6, 2025Updated 7 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Source code for the NeurIPS 2023 paper: "CSOT: Curriculum and Structure-Aware Optimal Transport for Learning with Noisy Labels"☆19Dec 11, 2023Updated 2 years ago
- Look, Compare, Decide: Alleviating Hallucination in Large Vision-Language Models via Multi-View Multi-Path Reasoning☆24Sep 9, 2024Updated last year
- This is an unofficial implementation to the EMNLP 2023 paper: Reading Order Matters: Information Extraction from Visually-rich Documents …☆16May 29, 2024Updated last year
- Time-HD-Lib: A Library for High-Dimensional Time Series Forecasting☆51Jan 26, 2026Updated 2 months ago
- [NeurIPS 2025] Think Silently, Think Fast: Dynamic Latent Compression of LLM Reasoning Chains☆84Updated this week
- Go天才小队☆53Dec 15, 2023Updated 2 years ago
- Official code for the paper "Meta Soft Label Generation for Noisy Labels" accepted at ICPR 2020.☆21Oct 12, 2020Updated 5 years ago