Decoupled Gradient Policy Optimization (DGPO) - Official Implementation
☆37Mar 5, 2026Updated this week
Alternatives and similar repositories for DGPO-RL
Users that are interested in DGPO-RL are comparing it to the libraries listed below
Sorting:
- Mass-Adaptive Soft Policy Optimization (MASPO) - Official Implementation☆45Feb 23, 2026Updated last week
- 喂饭级MCP开发教程,让你从0到1开发调试一个MCP服务。☆24Jun 26, 2025Updated 8 months ago
- ☆10Aug 19, 2021Updated 4 years ago
- This repository provides the PyTorch implementation of the paper: Anomaly Discovery in Semantic Segmentation via Distillation Comparison …☆15Apr 18, 2023Updated 2 years ago
- 一个基于Langchain和ChatGPT的AI图表生成与数据可视化平台☆18Dec 10, 2025Updated 2 months ago
- a project about Personalization recommendation(UserCF,itemCF,LFM,Personal Rank)☆18Sep 20, 2020Updated 5 years ago
- ☆28Feb 7, 2025Updated last year
- build a neural machine translator using seq2seq, attention mechanism.☆19Oct 8, 2022Updated 3 years ago
- StoneSkipping model for detecting Chinese camouflaged spam☆20May 8, 2020Updated 5 years ago
- Analyze XML extracted from PDFs (e.g. from TET or PDFMiner)☆20Jan 11, 2018Updated 8 years ago
- 华中科技大学课程作业:华中科技大学电信系微机原理实验代码☆20May 16, 2021Updated 4 years ago
- Project website of TE141K.☆17Mar 24, 2020Updated 5 years ago
- [WACV21] Code for our paper: Samuel, Atzmon and Chechik, "From Generalized zero-shot learning to long-tail with class descriptors"☆27Apr 6, 2021Updated 4 years ago
- 北大软微 2020级CS 研一期末考试 算法☆24Jan 24, 2021Updated 5 years ago
- Focal CTC for End-To-End OMR task with Class Imbalance, SangCTC (Part I)☆22Updated this week
- Scoreboard for ONNX Backend Compatibility☆29Jan 24, 2026Updated last month
- ☆26Feb 5, 2024Updated 2 years ago
- ☆24Mar 23, 2018Updated 7 years ago
- AutoThink is a reinforcement learning framework designed to equip R1-style language models with adaptive reasoning capabilities. Instead …☆50Oct 14, 2025Updated 4 months ago
- 北京大学软件与微电子学院关键软件方向课程资料、作业等汇总(操作系统与虚拟化、深度学习技术与应用等)☆34Sep 8, 2024Updated last year
- ☆31Jun 18, 2021Updated 4 years ago
- MXNet implementation of CapsNet☆29Nov 29, 2017Updated 8 years ago
- Client-side poster maker using HTML5, CSS3, and Angular☆33Jan 3, 2023Updated 3 years ago
- This is a repo including all projects and labs in my Artificial Intelligence course (DATA130008.01) in School of Data Science @Fudan Univ…☆30Jul 1, 2018Updated 7 years ago
- 这是一个open-r1的复现项目,对0.5B、1.5B、3B、7B的qwen模型进行GRPO训练,观察到一些有趣的现象。☆56Apr 13, 2025Updated 10 months ago
- ☆42Jun 7, 2023Updated 2 years ago
- Repository for Text2Mol: Cross-Modal Molecular Retrieval with Natural Language Queries☆49Mar 18, 2025Updated 11 months ago
- 🦜通过演示 LangChain 最具有 代表性的应用范例,带你快速上手 LangChain 各个使用场景。(包含完整代码和数据集)☆53Nov 14, 2023Updated 2 years ago
- ☆46May 6, 2021Updated 4 years ago
- Project code for ACM MM2020 paper: "TextRay: Contour-based Geometric Modeling for Arbitrary-shaped Scene Text Detection"☆47Oct 3, 2023Updated 2 years ago
- ☆85May 23, 2025Updated 9 months ago
- A list of awesome papers on AI-generated Image Detection.☆102Oct 29, 2025Updated 4 months ago
- PyTorch implementation of EigenGAN☆65May 26, 2021Updated 4 years ago
- [AAAI-2020, Oral] Diversity Transfer Network for Few-Shot Learning☆62Jan 4, 2021Updated 5 years ago
- The offical code for paper "Matching Images and Text with Multi-modal Tensor Fusion and Re-ranking", ACM Multimedia 2019 Oral☆68Sep 28, 2019Updated 6 years ago
- Code for CVPR'19 "Recursive Visual Attention in Visual Dialog"☆64Mar 24, 2023Updated 2 years ago
- Official implementation of our ICML 2023 paper "LinSATNet: The Positive Linear Satisfiability Neural Networks".☆82Apr 12, 2024Updated last year
- A tutorial on the PyTorch-based ocropus components.☆73Apr 18, 2020Updated 5 years ago
- ☆73Oct 29, 2020Updated 5 years ago