☆13Mar 16, 2025Updated last year
Alternatives and similar repositories for GRPO-R1
Users that are interested in GRPO-R1 are comparing it to the libraries listed below
Sorting:
- 2 players chinese chess game with PyGame☆14Oct 25, 2022Updated 3 years ago
- MCP DeepResearch Server: 基于 LangGraph + Ollama + Tavily 的深度研究服务器,支持异步运行、超时控制与进度推送☆31Jun 16, 2025Updated 9 months ago
- Submission Guide + Discussion Board for AI Singapore Global Challenge for Safe and Secure LLMs (Track 1A).☆16Jul 4, 2024Updated last year
- AI Demo 项目,一个专门为希望学习和探索人工智能(AI)技术的开发者准备的实战案例集合。☆25Jan 3, 2026Updated 2 months ago
- This repo offers advanced tutorials for LLMs, BERT-based models, and multimodal models, covering fine-tuning, quantization, vocabulary ex…☆24May 5, 2025Updated 10 months ago
- llms related stuff , including code, docs☆13Feb 25, 2025Updated last year
- 天池算法比赛《BetterMixture - 大模型数据混合挑战赛》的第一名top1解决方案☆34Jul 7, 2024Updated last year
- (撰写ing..)本仓库偏教程性质,以「模型中文化」为一个典型的模型训练问题切入场景,指导读者上手学习LLM二次微调训练。☆37Aug 5, 2024Updated last year
- 放弃幻想、时刻准备、随时面试☆14Dec 17, 2025Updated 3 months ago
- Real-time envrionment reconstruction based on ORB_SLAM2 with XTION (RGBD sensor)☆35May 28, 2016Updated 9 years ago
- 🤗 HF Downloader (Hugging Face Downloader) 📦 A user-friendly GUI tool for downloading Hugging Face resources with enhanced connectivity…☆13Jan 5, 2025Updated last year
- 大模型API企业网关,公司内部API管理,分发聚和系统,支持将多种大模型转换成统一的OpenAI兼容接口,尤其对国内开源模型deepseek,qwen,kimi,glm提供特别支持 可供个人或者企业内部大模型API统一管理和渠道分发使用(key管理与二次分发),长期更新,支…☆40Sep 12, 2025Updated 6 months ago
- ☆13Feb 21, 2025Updated last year
- [ICML 2023] "Robust Weight Signatures: Gaining Robustness as Easy as Patching Weights?" by Ruisi Cai, Zhenyu Zhang, Zhangyang Wang☆16May 4, 2023Updated 2 years ago
- [ICML 2024] Code release for "On the Emergence of Cross-Task Linearity in Pretraining-Finetuning Paradigm"☆11Feb 20, 2025Updated last year
- BLOOM 模型的指令微调☆24Jun 15, 2023Updated 2 years ago
- Mod Menu for non-jailbroken devices☆26Nov 12, 2024Updated last year
- 🌈 NERpy: Implementation of Named Entity Recognition using Python. 命名实体识别工具,支持BertSoftmax、BertSpan等模型,开箱即用。☆118Feb 19, 2024Updated 2 years ago
- ☆12Sep 29, 2024Updated last year
- It is a simple demo of chatDB workflow in dify.☆24Dec 7, 2024Updated last year
- [Findings of EMNLP22] From Mimicking to Integrating: Knowledge Integration for Pre-Trained Language Models☆19Mar 16, 2023Updated 3 years ago
- Phi4 Multimodal Instruct - OpenAI endpoint and Docker Image for self-hosting☆40Mar 2, 2025Updated last year
- Dify DSL collection收集Dify工作流文件DSL,这里很多文件并不是本人原创,而是收集而来,感谢原作者。目前我是初学github,后面会加入大量原创内容☆26Jul 13, 2025Updated 8 months ago
- Playing around with various jailbreaking techniques ahead of the Gray Swan AI Ultimate Jailbreaking Competition☆18Oct 6, 2024Updated last year
- A query predictor pipeline and service to predict resource usages of Presto queries☆15May 2, 2023Updated 2 years ago
- ☆11Updated this week
- code space of paper "Safety Layers in Aligned Large Language Models: The Key to LLM Security" (ICLR 2025)☆22Apr 26, 2025Updated 10 months ago
- [ICLR2025 Spotlight] Advantage-Guided Distillation for Preference Alignment in Small Language Models☆24Feb 10, 2025Updated last year
- OpenHIS医院系统(信创版)集十大核心模块于一体,涵盖目录管理、基础数据配置、个性化设置、门诊/住院全流程管理、药房药库智能管控、精细化耗材管理、财务核算体系、医保合规对接及多维报表分析等功能模块,共计372项标准化功能。☆15Feb 5, 2026Updated last month
- Code for Robust Fine-tuning (RbFT)☆17Jan 31, 2025Updated last year
- TianGong-AI-Unstructure☆71Feb 4, 2026Updated last month
- OODRobustBench: a Benchmark and Large-Scale Analysis of Adversarial Robustness under Distribution Shift. ICML 2024 and ICLRW-DMLR 2024☆23Jul 25, 2024Updated last year
- AI写作小工具方案:让2个智能体合作写出真正可用的图文并茂的帖子(微信公众号,小红书,博客)。1,写作智能体,2,知识库智能体。☆21Jun 8, 2025Updated 9 months ago
- KDD2024-WhoIsWho-Top3☆16Jun 17, 2024Updated last year
- [npj Digital Medicine] An In-Depth Evaluation of Federated Learning on Biomedical Natural Language Processing for Information Extraction☆12May 1, 2024Updated last year
- 使用bert进行中文方面级情感识别。☆25Jun 26, 2023Updated 2 years ago
- Prompt Injection Attacks against GPT-4, Gemini, Azure, Azure with Jailbreak☆29Oct 8, 2024Updated last year
- QGEval: A Benchmark for Question Generation Evaluation☆19Nov 7, 2024Updated last year
- 为 AstrBot 提供一种 Deepresearch 方案☆29Aug 5, 2025Updated 7 months ago