kyegomez/OpenR1

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/kyegomez/OpenR1)

kyegomez / OpenR1

An open source implementation of R1

☆30

Alternatives and similar repositories for OpenR1

Users that are interested in OpenR1 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

The-Swarm-Corporation / Brainwave
View on GitHub
Brainwave is a state-of-the-art neural decoder that transforms electroencephalogram (EEG) and brain signals into multimodal outputs inclu…
☆14Oct 6, 2025Updated 9 months ago
The-Swarm-Corporation / Mamba-R1
View on GitHub
Mamba R1 represents a novel architecture that combines the efficiency of Mamba's state space models with the scalability of Mixture of Ex…
☆25Oct 13, 2025Updated 9 months ago
swyang50066 / rl-stock-trading
View on GitHub
WATERMELON: Multi-Agent Reinforcement Learning Based Algorithmic Stock Trading System with GUI Application
☆18Sep 8, 2022Updated 3 years ago
andrew-silva / mlx-rlhf
View on GitHub
An example implementation of RLHF (or, more accurately, RLAIF) built on MLX and HuggingFace.
☆37Jun 21, 2024Updated 2 years ago
ryanhlewis / GPT-Auto-Scraper
View on GitHub
A GPT-powered AI auto scraper for websites. AI Web Scraping made easy.
☆13Jun 26, 2023Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
kyegomez / OpenStrawberry
View on GitHub
An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO
☆30Updated this week
sanowl / CoRAG
View on GitHub
this is based on the paper Chain-of-Retrieval Augmented Generation
☆15Mar 29, 2025Updated last year
AShar97 / ML-RL-in-Finance
View on GitHub
Machine Learning and Reinforcement Learning in Finance Specialization (MOOC) Assignments
☆12Nov 4, 2021Updated 4 years ago
HuazhangHu / PortScan
View on GitHub
基于Flask的TCP/UDP协议端口与服务自动化扫描
☆10Aug 9, 2021Updated 4 years ago
shenduldh / CosyVoice-Lightning
View on GitHub
Lightning-responsive CosyVoice streaming API based on FastAPI.
☆28Apr 27, 2026Updated 2 months ago
senyka0 / binance-options-arbitrage
View on GitHub
Script for trade arbitrage opportunities between European-style options and Perpetual futures, with notifications in telegram
☆11Jun 10, 2023Updated 3 years ago
leimao / ONNX-Python-Examples
View on GitHub
ONNX Python Examples
☆16Sep 13, 2022Updated 3 years ago
edenartlab / flux-trainer
View on GitHub
Eden Flux LoRA trainer and full-finetuning
☆23Mar 21, 2025Updated last year
keinsell / Hftish
View on GitHub
Alpaca-based Order Book Inbalace Algorithm.
☆12Jul 23, 2020Updated 5 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
lemurproject / ClueWeb22
View on GitHub
☆17Dec 11, 2024Updated last year
eigenpi / VHDL-Examples-from-Pong-Chu-Book
View on GitHub
This repository contains all the needed source files for several examples from Pong Chu's book: "Pong P. Chu, FPGA Prototyping by VHDL Ex…
☆11Apr 2, 2022Updated 4 years ago
phymhan / S2D2
View on GitHub
☆16Jun 17, 2026Updated last month
The-Swarm-Corporation / Brain2Qwerty
View on GitHub
An implementation of the paper Brain2Qwerty that translates brain EEG data into text for reading people's brains. There was no code so we…
☆25Feb 9, 2025Updated last year
olivierjeunen / pessimism-recsys-2021
View on GitHub
Source code for our paper "Pessimistic Decision-Making for Recommender Systems" published at ACM TORS, and RecSys 2021.
☆11Dec 15, 2022Updated 3 years ago
yliuhz / PMAW
View on GitHub
Source code of the paper "Prediction of Molecular Absorption Wavelength Using Deep Neural Networks"
☆10May 29, 2022Updated 4 years ago
kyegomez / dev-swarm
View on GitHub
A swarm of LLM agents that will help you test, document, and productionize your code!
☆19Updated this week
idiom-bytes / flaskGPT
View on GitHub
Waffer-thin FlaskGPT on Vercel.
☆12Jun 1, 2023Updated 3 years ago
lancopku / DCKD
View on GitHub
Code and data for Distributional Correlation–Aware Knowledge Distillation for Stock Trading Volume Prediction (ECML-PKDD 22)
☆16Sep 6, 2022Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
FudanNLP / Irl_gen
View on GitHub
This is implementation of the paper 'Toward Diverse Text Generation with Inverse Reinforcement Learning' https://arxiv.org/abs/1804.11258…
☆34Nov 29, 2018Updated 7 years ago
meilerlab / probabilities_design
View on GitHub
☆15Mar 28, 2025Updated last year
kongjiellx / octupus-tool-call
View on GitHub
☆64May 4, 2025Updated last year
Team-SKI / Publications
View on GitHub
Supporting Information of Publications
☆14Mar 24, 2019Updated 7 years ago
inJeans / qnn
View on GitHub
An implementation of a quantum neural network built using pyquil.
☆11Jun 7, 2019Updated 7 years ago
ShaojieJiang / tldr
View on GitHub
Source code repo for paper "TLDR: Token Loss Dynamic Reweighting for Reducing Repetitive Utterance Generation"
☆10Aug 11, 2023Updated 2 years ago
epicrispr-biotechnologies / evolutionary_monte_carlo_search
View on GitHub
Implementation of Evolutionary and Metropolis Hastings Monte Carlo for text based (e.g. nucleotide/peptide) sequences
☆13Mar 7, 2024Updated 2 years ago
piyushkhanna7 / VolTAGE
View on GitHub
☆10Nov 16, 2021Updated 4 years ago
davidekim / parametric_barrels
View on GitHub
Utility scripts to generate and evaluate parametrically guided beta barrel protein backbone structures.
☆13Nov 14, 2025Updated 8 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
spawnaga / FlexTrader
View on GitHub
A multi-task deep reinforcement learning model for trading futures contracts using the Interactive Brokers API and TensorFlow
☆15Feb 8, 2023Updated 3 years ago
JasonYuchen / pydolphindb
View on GitHub
A C++ Boosted DolphinDB Python API
☆11Jun 29, 2019Updated 7 years ago
luka-group / FaviComp
View on GitHub
[EMNLP 2025 Findings] Familiarity-aware Evidence Compression for Retrieval Augmented Generation
☆15Aug 20, 2025Updated 11 months ago
kivenyangming / OpencvSocket
View on GitHub
这是一个使用opencv读取视频并使用socket进行传输视频画面的脚本文件，相较于调用ffmpeg传输节约了90%的数据量
☆11May 14, 2024Updated 2 years ago
HKAIR-Lab / HK-O1aw
View on GitHub
☆43Nov 1, 2024Updated last year
mayunxi / mpp_rtsp_play_QT
View on GitHub
☆11Mar 30, 2020Updated 6 years ago
mnoukhov / async_rlhf
View on GitHub
Code and Configs for Asynchronous RLHF: Faster and More Efficient RL for Language Models
☆68Mar 5, 2026Updated 4 months ago