简单易理解的代码,用于在qwen上使用grpo加强数学能力
☆54May 14, 2025Updated 10 months ago
Alternatives and similar repositories for qwen_grpo_gsm8k
Users that are interested in qwen_grpo_gsm8k are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The AI Algorithm Proposed by The Master's Degree Thesis of Shengyuan Yan of Wuhan University School of Computer Science☆14May 11, 2022Updated 3 years ago
- This repository contains reference implementation for multi-LLM ToM paper (accepted to EMNLP 2023), Theory of Mind for Multi-Agent Collab…☆18Jun 11, 2024Updated last year
- A Multimodal Detection and Tracking System based on DJI Payload SDK and Mobile SDK.☆18Mar 3, 2024Updated 2 years ago
- ☆16Apr 20, 2018Updated 7 years ago
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆11Sep 30, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- MSTI☆16Mar 6, 2024Updated 2 years ago
- One command to start a streaming ASR server.☆12Oct 2, 2024Updated last year
- Compute WER and SER for speech recognition evaluation☆27Mar 18, 2026Updated 3 weeks ago
- FreeSWITCH ASR module fork from mod_audio_stream, use FunASR online cpu version☆17Jun 27, 2025Updated 9 months ago
- The personal migration of the ROS Humble version of ego-planner.☆39Sep 19, 2024Updated last year
- 使用paddlepaddle框架完成对中文类型垃圾邮件进行分类☆13Feb 27, 2022Updated 4 years ago
- ☆17Jul 6, 2023Updated 2 years ago
- ☆11Dec 24, 2024Updated last year
- 2025ICASSP☆16Jun 23, 2025Updated 9 months ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆12Jul 11, 2024Updated last year
- ☆14Aug 9, 2021Updated 4 years ago
- A reinforcement learning agent that learns to solve mazes using Group Relative Policy Optimization (GRPO).☆12Feb 9, 2025Updated last year
- ASR_LLM_TTS前端项目☆15Dec 3, 2024Updated last year
- Code for the paper "Multi-perspective Coherent Reasoning for Helpfulness Prediction of Multimodal Reviews" (ACL 2021)☆18Mar 8, 2022Updated 4 years ago
- 基于 Sherpa-ONNX 实现在线下载模型的端侧实时语音识别应用(Implement speech recognition based on Sherpa-ONNX by downloading the model online.)☆28Feb 27, 2025Updated last year
- FunASR安卓端侧离线版本2pass 全模式☆15Sep 4, 2023Updated 2 years ago
- A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.☆12May 7, 2019Updated 6 years ago
- The official codes for paper "Deep hash learning for remote sensing image retrieval"☆21Nov 16, 2020Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆15Oct 19, 2024Updated last year
- [Official] NeurIPS 2023, "Navigating Data Heterogeneity in Federated Learning: A Semi-Supervised Approach for Object Detection"☆21Jan 3, 2024Updated 2 years ago
- a simple repo for some references about thermal infrared object detection☆16Jul 31, 2022Updated 3 years ago
- Simple voice activity detection (VAD) algorithm in Python☆15Aug 10, 2023Updated 2 years ago
- 京东/淘宝客服对话数据公开,seq2seq生成模型设计对话系统获第二名☆44Dec 8, 2022Updated 3 years ago
- TensorRT☆11Sep 22, 2020Updated 5 years ago
- This is a project focused on Faster Whisper, a streaming speech recognition project.☆19Sep 27, 2024Updated last year
- 基于 Android Studio 与 Java 的 Android 端游戏应用,是一个结合 RPG 与 GalGame 模式的解密攻略类游戏, 包含背包系统、地图系统、交易系统、存档系统等。☆21Mar 11, 2024Updated 2 years ago
- ☆25Dec 13, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Repository for the IJGIS paper "Zero-shot urban function inference with street view images through prompting a pre-trained vision-languag…☆27May 25, 2024Updated last year
- A streaming whisper server for on-prem transcription☆23Aug 15, 2024Updated last year
- 基于电商导购机器人,自然语言理解(NLU),文本纠错,歧义词消歧☆12May 5, 2020Updated 5 years ago
- Official Repo For AAAI 2026 Accepted Paper "Rethinking the Spatio-Temporal Alignment of End-to-End 3D Perception"☆30Mar 25, 2026Updated 2 weeks ago
- CTC decoder with hotwords for ASR.☆35Apr 13, 2025Updated 11 months ago
- ☆27Feb 26, 2023Updated 3 years ago
- Code for the paper "Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns"☆18Mar 15, 2024Updated 2 years ago