简单易理解的代码,用于在qwen上使用grpo加强数学能力
☆54May 14, 2025Updated 11 months ago
Alternatives and similar repositories for qwen_grpo_gsm8k
Users that are interested in qwen_grpo_gsm8k are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The AI Algorithm Proposed by The Master's Degree Thesis of Shengyuan Yan of Wuhan University School of Computer Science☆14May 11, 2022Updated 3 years ago
- Source code, datasets and models of the paper "Efficient White-box Fairness Testing through Gradient Search" by Lingfeng Zhang, Yueling Z…☆11Jul 24, 2021Updated 4 years ago
- [AAAI 2026] Multimodal Deepresearcher: Generating Text-Chart Interleaved Reports From Scratch with Agentic Framework☆53Jan 25, 2026Updated 3 months ago
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆11Sep 30, 2024Updated last year
- An algebra for Matrix Expressions written in Maude☆24Oct 23, 2012Updated 13 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆24Sep 2, 2023Updated 2 years ago
- [COLING 2025] NesTools: A Dataset for Evaluating Nested Tool Learning Abilities of Large Language Models☆18Jan 18, 2025Updated last year
- One command to start a streaming ASR server.☆12Oct 2, 2024Updated last year
- White-box Fairness Testing through Adversarial Sampling☆14Apr 16, 2021Updated 5 years ago
- ☆48Aug 31, 2025Updated 8 months ago
- This repository contains the ToolSelect dataset which was used to fine-tune Llama-2 70B for tool selection.☆22Mar 11, 2024Updated 2 years ago
- FreeSWITCH ASR module fork from mod_audio_stream, use FunASR online cpu version☆17Jun 27, 2025Updated 10 months ago
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆17May 16, 2025Updated 11 months ago
- Hpyformer base FunASR☆30Nov 5, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- CosyVoice语音合成简易API☆14Nov 1, 2024Updated last year
- funasr语音转文字的简单api版本,funasr+fastapi,方便部署在服务器上☆13Aug 10, 2024Updated last year
- ☆33May 9, 2025Updated 11 months ago
- Official TensorFlow implementation of Federated Learning of Generative Image Priors for MRI Reconstruction (FedGIMP)☆15Apr 3, 2022Updated 4 years ago
- 对llava官方代码的一些学习笔记☆29Oct 11, 2024Updated last year
- 2025ICASSP☆16Jun 23, 2025Updated 10 months ago
- Agent that converts natural language queries into SQL and provides response and query created☆62May 28, 2025Updated 11 months ago
- <综合> Funasr语音识别,调用Qwen大模型回答,通过GPTSovits输出语音的ai程序,其中调用模型还是在线,后续将添加离线大模型☆13Nov 30, 2024Updated last year
- Collection of papers, tools, datasets for fairness of LLM☆18Oct 7, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- A reinforcement learning agent that learns to solve mazes using Group Relative Policy Optimization (GRPO).☆12Feb 9, 2025Updated last year
- ASR_LLM_TTS前端项目☆15Dec 3, 2024Updated last year
- 恋上算法,Java版算法面试题解大全集☆18May 17, 2020Updated 5 years ago
- Model for SCALE-Net: Scalable Vehicle Trajectory Prediction Network under Random Number of Interacting Vehicles via Edge-enhanced Graph C…☆12Apr 20, 2020Updated 6 years ago
- FunASR安卓端侧离线版本2pass全模式☆15Sep 4, 2023Updated 2 years ago
- 基于 Sherpa-ONNX 实现在线下载模型的端侧实时语音识别应用(Implement speech recognition based on Sherpa-ONNX by downloading the model online.)☆28Feb 27, 2025Updated last year
- ☆22Jul 16, 2024Updated last year
- Official code for "Divide and Translate: Compositional First-Order Logic Translation and Verification for Complex Logical Reasoning", ICL…☆29May 12, 2025Updated 11 months ago
- ☆16Nov 9, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.☆12May 7, 2019Updated 6 years ago
- [Official] NeurIPS 2023, "Navigating Data Heterogeneity in Federated Learning: A Semi-Supervised Approach for Object Detection"☆21Jan 3, 2024Updated 2 years ago
- a simple repo for some references about thermal infrared object detection☆16Jul 31, 2022Updated 3 years ago
- ☆14Jun 3, 2022Updated 3 years ago
- Simple voice activity detection (VAD) algorithm in Python☆15Aug 10, 2023Updated 2 years ago
- Collection of latest papers and materials in the area of RLVR!☆95Updated this week
- MMTL-UniAD: A Unified Framework for Multimodal and Multi-Task Learning in Assistive Driving Perception☆30Sep 15, 2025Updated 7 months ago