简单易理解的代码,用于在qwen上使用grpo加强数学能力
☆57May 14, 2025Updated last year
Alternatives and similar repositories for qwen_grpo_gsm8k
Users that are interested in qwen_grpo_gsm8k are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The AI Algorithm Proposed by The Master's Degree Thesis of Shengyuan Yan of Wuhan University School of Computer Science☆14May 11, 2022Updated 4 years ago
- ☆17Dec 21, 2024Updated last year
- A Multimodal Detection and Tracking System based on DJI Payload SDK and Mobile SDK.☆19Mar 3, 2024Updated 2 years ago
- LLM-MapBook: AI-Powered Maps for Storytelling. Extracts geo-coordinates from books, visualizes on interactive maps, offering immersive st…☆12Aug 27, 2024Updated last year
- ☆24Sep 2, 2023Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- White-box Fairness Testing through Adversarial Sampling☆14Apr 16, 2021Updated 5 years ago
- 一个基于 GitHub Actions 的自动化工具,每天早上自动追踪和分析 arXiv 最新论文,并通过邮件发送分析报告。该工具使用 DeepSeek AI 进行论文分析和总结。☆22Jun 20, 2025Updated 11 months ago
- This repository contains the ToolSelect dataset which was used to fine-tune Llama-2 70B for tool selection.☆22Mar 11, 2024Updated 2 years ago
- [Official] NeurIPS 2023, "Navigating Data Heterogeneity in Federated Learning: A Semi-Supervised Approach for Object Detection"☆21Jan 3, 2024Updated 2 years ago
- [EMNLP 2024 Main] Official implementation of the paper "Unveiling In-Context Learning: A Coordinate System to Understand Its Working Mech…☆16Oct 8, 2024Updated last year
- a simple repo for some references about thermal infrared object detection☆16Jul 31, 2022Updated 3 years ago
- Collection of latest papers and materials in the area of RLVR!☆107May 11, 2026Updated last week
- [NeurIPS2023] How2comm: Communication-Efficient and Collaboration-Pragmatic Multi-Agent Perception☆31Jan 20, 2024Updated 2 years ago
- Java面试总结☆19May 11, 2020Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- TensorRT☆11Sep 22, 2020Updated 5 years ago
- 基于电商导购机器人,自然语言理解(NLU),文本纠错,歧义词消歧☆12May 5, 2020Updated 6 years ago
- 中文语料:大量人工标注样本,非常有价值 !!!☆11Aug 15, 2019Updated 6 years ago
- Official Repo For AAAI 2026 Accepted Paper "Rethinking the Spatio-Temporal Alignment of End-to-End 3D Perception"☆30Mar 25, 2026Updated last month
- Semantic Lidar Odometry☆12May 1, 2020Updated 6 years ago
- Table2answer: Read the database and answer without SQL https://arxiv.org/abs/1902.04260☆14May 11, 2021Updated 5 years ago
- Code for the paper "Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns"☆18Mar 15, 2024Updated 2 years ago
- ☆28Apr 26, 2023Updated 3 years ago
- 这是一个利用Spring Cloud,Dubbo,Thrift三个微服务框架整合开发的IM社交系统,并用到了Netty即时通讯技术,Tensorflow深度学习框架与Haar+Adaboost人脸识别技术,每个模块都可以被完整的被拿来直接使用,适合对微服务,即时通信感兴趣的…☆11Nov 16, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 基于 rasa 1.x 版本搭建的中文天气查询 demo | A simple & micro Chinese Weatherbot based on rasa framework☆12Aug 14, 2019Updated 6 years ago
- Official code of "Discover and Mitigate Unknown Biases with Debiasing Alternate Networks" (ECCV 2022)☆24Feb 15, 2023Updated 3 years ago
- 首届电子商务AI算法大赛TOP2开源代码☆13Aug 31, 2021Updated 4 years ago
- 日期时间实体识别☆11Sep 10, 2020Updated 5 years ago
- [CVPR 2025 Highlight] Official Pytorch codebase for paper: "Assessing and Learning Alignment of Unimodal Vision and Language Models"☆60Aug 15, 2025Updated 9 months ago
- ☆12May 14, 2026Updated last week
- Selection-based Question Answering☆14Feb 7, 2018Updated 8 years ago
- [RAL 2024] Triplet-Graph: Global Metric Localization Based on Semantic Triplet Graph for Autonomous Vehicles☆10Mar 23, 2024Updated 2 years ago
- 基于乐鑫 ESP32/ESP32-S2/S3 开发的小型无人机解决方案、基于北京理工大学自动化学院OLDX多旋翼开发平台(OLDX-FC)、基于正点原子ATK-F405☆21Apr 22, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Compensate the position of the spinning point cloud frame while the laser scanner is moving.☆11Apr 30, 2019Updated 7 years ago
- ☆44Apr 16, 2026Updated last month
- Deep Introspective SLAM: Deep Reinforcement Learning based Approach to Avoid Tracking Failure in Visual SLAM☆11Jul 31, 2021Updated 4 years ago
- [ICRA 2024] WLST: Weak Labels Guided Self-training for Weakly-supervised Domain Adaptation on 3D Object Detection☆12Feb 6, 2024Updated 2 years ago
- 面向智能家居场景的嵌入式 RPC 框架☆14May 5, 2024Updated 2 years ago
- Autonomous navigation simulation of an agricultural robot during soil fertilization in open fields using ROS and Gazebo.☆11Apr 8, 2025Updated last year
- [NeurIPS 2025] Encoder-Decoder Diffusion Language Models for Efficient Training and Inference☆41Oct 29, 2025Updated 6 months ago