liuchen6667 / qwen_grpo_gsm8kView external linksLinks
简单易理解的代码,用于在qwen上使用grpo加强数学能力
☆48May 14, 2025Updated 9 months ago
Alternatives and similar repositories for qwen_grpo_gsm8k
Users that are interested in qwen_grpo_gsm8k are comparing it to the libraries listed below
Sorting:
- Agent that converts natural language queries into SQL and provides response and query created☆55May 28, 2025Updated 8 months ago
- [ICLR 2025] Official PyTorch Implementation for CPE: Concept Pinpoint Eraser for Text-to-image Diffusion Models via Residual Attention Ga…☆12Apr 7, 2025Updated 10 months ago
- [ICRA 2024] WLST: Weak Labels Guided Self-training for Weakly-supervised Domain Adaptation on 3D Object Detection☆12Feb 6, 2024Updated 2 years ago
- The AI Algorithm Proposed by The Master's Degree Thesis of Shengyuan Yan of Wuhan University School of Computer Science☆14May 11, 2022Updated 3 years ago
- OpenVLA Lightweight Version(0.5B). It uses qwen2-0.5B and fine-tunes using mllm format, without occupying LLM's inherent tokens. It repre…☆15Jan 7, 2026Updated last month
- RLCar Gazebo v2☆12Jun 28, 2024Updated last year
- ☆13May 11, 2022Updated 3 years ago
- Documentation at☆14Mar 27, 2025Updated 10 months ago
- ☆11Aug 9, 2018Updated 7 years ago
- GUIEvalKit: Open-source Evaluation Toolkit for GUI Agents☆19Jan 26, 2026Updated 3 weeks ago
- sgbm立体匹配算法以及生成点云☆12Jan 29, 2021Updated 5 years ago
- Autonomous navigation simulation of an agricultural robot during soil fertilization in open fields using ROS and Gazebo.☆10Apr 8, 2025Updated 10 months ago
- Final Project of ME5413 Autonomous Mobile Robotics @ NUS☆10Oct 13, 2023Updated 2 years ago
- 🔥(ECCV 2024 Oral) RAPiD-Seg: Range-Aware Pointwise Distance Distribution Networks for 3D LiDAR Segmentation☆47Sep 2, 2025Updated 5 months ago
- ☆10Apr 8, 2024Updated last year
- The codes for ECCV'22: Learning to Train a Point Cloud Reconstruction Network without Matching☆10Nov 16, 2022Updated 3 years ago
- HGL: Hierarchical Geometry Learning for Test-time Adaptation in 3D Point Cloud Segmentation☆13Sep 13, 2024Updated last year
- 这个仓库用于在ros2 humble中整合九轴IMU和车轮Odom的数据,其中的参数是经过多次实验后自行调整的。☆14Jan 25, 2025Updated last year
- [NeurIPS 2025] Encoder-Decoder Diffusion Language Models for Efficient Training and Inference☆36Oct 29, 2025Updated 3 months ago
- RAL-2024, A key-frame based LiDAR global localization method.☆10Mar 23, 2024Updated last year
- This ROS node includes C++ implementations for extracting OpenStreetMaps(OSM), performing planning using latitude/longitude or 2D relativ…☆14Feb 22, 2025Updated 11 months ago
- Localize the car in a static map with a particle filter.☆12Apr 2, 2025Updated 10 months ago
- Repository with all source files relating to the 6CCE3EEP Final Year Project titled "Self Parking with Reinforcement Learning." The proje…☆10Jul 20, 2023Updated 2 years ago
- Self-supervised adversarial masking for point clouds☆11Jul 12, 2023Updated 2 years ago
- Semantic Lidar Odometry☆12May 1, 2020Updated 5 years ago
- Extended Implementation of FastLGS☆16Dec 17, 2024Updated last year
- [ECCV 2024] Online Continuous Generalized Category Discovery☆14Oct 6, 2024Updated last year
- ☆10Sep 23, 2021Updated 4 years ago
- Generalizable Stable Points Segmentation for 3D LiDAR Scan-to-Map Long-Term Localization☆17Jun 3, 2024Updated last year
- Official Repository for Heterogeneous Models Dataset Condensation (ECCV 2024, Oral)☆10Dec 15, 2024Updated last year
- [WACV 2025-Oral Presentation] Test-Time Adaptation in Point Clouds: Leveraging Sampling Variation with Weight Averaging☆12Mar 31, 2025Updated 10 months ago
- 基于乐鑫 ESP32/ESP32-S2/S3 开发的小型无人机解决方案、基于北京理工大学自动化学院OLDX多旋翼开发平台(OLDX-FC)、基于正点原子ATK-F405☆21Apr 22, 2023Updated 2 years ago
- [ECCV 2024] R3D-AD: Reconstruction via Diffusion for 3D Anomaly Detection☆18Jan 6, 2025Updated last year
- Deep Introspective SLAM: Deep Reinforcement Learning based Approach to Avoid Tracking Failure in Visual SLAM☆11Jul 31, 2021Updated 4 years ago
- 使用ROS2+RL 的循迹小车☆12Aug 30, 2024Updated last year
- Source code to execute signal injection attacks against CCD image sensors☆11Aug 26, 2021Updated 4 years ago
- This repository contains the **official implementation** of the paper: "VL2Lite: Task-Specific Knowledge Distillation from Large Vision-…☆16Mar 23, 2025Updated 10 months ago
- ☆12Jun 27, 2022Updated 3 years ago
- Efficient Decoupled Feature 3D Gaussian Splatting via Hierarchical Compression☆12Mar 17, 2025Updated 11 months ago