PyTorch implementation of GRPO.
☆15Apr 21, 2025Updated 11 months ago
Alternatives and similar repositories for GRPO-PyTorch
Users that are interested in GRPO-PyTorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Rucio K8s tutorial☆11Sep 26, 2025Updated 6 months ago
- Aho-Corasick automation for large-scale multi-pattern matching. Available for C/C++, Python, and Java on Linux, macOS, and Windows.☆14Oct 29, 2024Updated last year
- A distilled DeepSeek-R1 variant built on Qwen2.5-32B, fine-tuned with curated data for enhanced performance and efficiency. <metadata> gp…☆16Mar 11, 2025Updated last year
- ☆16Mar 18, 2025Updated last year
- Quickly uploads an image/audio combination to Youtube☆27Jun 10, 2018Updated 7 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Geographical Graph Attention Networks: Spatial Deep Learning Models for Spatial Prediction and Exploratory Spatial Data Analysis☆17Jul 28, 2025Updated 7 months ago
- A direct Convolution Neural Network implementation in pure C++, with MNIST dataset.☆13Feb 11, 2015Updated 11 years ago
- Custom triton kernels for training Karpathy's nanoGPT.☆19Oct 21, 2024Updated last year
- RuCLIP-SB (Russian Contrastive Language–Image Pretraining SWIN-BERT) is a multimodal model for obtaining images and text similarities and…☆14Jan 25, 2022Updated 4 years ago
- Electric Vehicle Market Segmentation Analysis in India☆16May 14, 2025Updated 10 months ago
- Classify documents using Python based on SVM and TF-IDF.☆15Nov 19, 2019Updated 6 years ago
- Our solution to ML Talent Match hackathon☆10Mar 22, 2024Updated 2 years ago
- Official Repo For AAAI 2026 Accepted Paper "Rethinking the Spatio-Temporal Alignment of End-to-End 3D Perception"☆30Updated this week
- Image captioning using CNN and RNN☆11Mar 24, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Word2Vec in pure Python☆19Jun 13, 2018Updated 7 years ago
- ☆25May 13, 2019Updated 6 years ago
- A simple implementation of a convolutional neural network from scratch in C++☆14Jul 15, 2019Updated 6 years ago
- ☆40Jul 1, 2025Updated 8 months ago
- Solving Problems with Applied Deep Learning (ITS-530)☆27Feb 5, 2026Updated last month
- This repository provides a multi task benchmark for instance segmentation, depth estimation, and 3D object detection.☆14Jul 29, 2023Updated 2 years ago
- Make triton easier☆50Jun 12, 2024Updated last year
- RL significantly the reasoning capability of Qwen2.5-1.5B-Instruct☆31Feb 23, 2025Updated last year
- THUD Dataset Overview☆27May 22, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- This repo would give multi-task keypoint detect code based yolov8. The landmarks or keypoints with different classes and numbers can be …☆12Feb 28, 2023Updated 3 years ago
- Anchor Assignment and Sampling Heuristics in Deep Object Detection: A Review☆11Aug 2, 2022Updated 3 years ago
- ☆10Dec 11, 2015Updated 10 years ago
- Генерация расписания занятий для студентов ИТМО программы Искусственный интеллект.☆21Sep 17, 2024Updated last year
- E-Commerce Website A/B testing: Recommend which of two landing pages to keep based on A/B testing☆24Dec 21, 2017Updated 8 years ago
- ☆13Nov 25, 2022Updated 3 years ago
- Codes for SHINE published in EMNLP 2021.☆41Jul 1, 2022Updated 3 years ago
- ☆10Apr 22, 2025Updated 11 months ago
- TurtleBot3 ROS Packages☆20Jan 28, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- 🎭 Проекты, которые я выполняю самостоятельно. Датасеты беру из открытых источников.☆24Nov 29, 2022Updated 3 years ago
- Verilog Snippets for partial fulfilment of CS-F342 Computer Architecture,BITS Pilani☆17Nov 17, 2017Updated 8 years ago
- NetRunner is a simple neural network visualizer created in Unity3D.☆24Feb 19, 2025Updated last year
- JAX implementation of Graph Attention Networks☆13Jan 29, 2022Updated 4 years ago
- Simple repository for training small reasoning models☆49Feb 17, 2026Updated last month
- My personal resume compiled with LaTeX☆27Apr 13, 2025Updated 11 months ago
- A fast implementation of the Goemans-Williamson scheme for the prize-collecting Steiner tree / forest problem.☆65Oct 17, 2024Updated last year