Complete Reinforcement Learning Toolkit for Large Language Models!
☆21Aug 2, 2025Updated 10 months ago
Alternatives and similar repositories for Q-Flow
Users that are interested in Q-Flow are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆12Sep 26, 2019Updated 6 years ago
- Edge-weighted online bipartite matching (JACM 2022)☆12Jun 18, 2023Updated 2 years ago
- code submission to NeurIPS2019☆13Aug 9, 2023Updated 2 years ago
- ☆13Jan 22, 2025Updated last year
- Contains the code relative to the paper Partial Gromov-Wasserstein with Applications on Positive-Unlabeled Learning https://arxiv.org/abs…☆21Mar 3, 2020Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Feb 22, 2024Updated 2 years ago
- OPSTL: Self-supervised Skeleton-based Action Recognition in Occluded Environments☆14Oct 25, 2023Updated 2 years ago
- ☆10Nov 17, 2022Updated 3 years ago
- Hybrid Linear UCB Multi-arm Bandit library☆14Oct 5, 2016Updated 9 years ago
- ☆20Nov 3, 2024Updated last year
- cosy voice serverless demo☆10Nov 14, 2024Updated last year
- A Qwen .5B reasoning model trained on OpenR1-Math-220k☆14Oct 11, 2025Updated 8 months ago
- ☆19Apr 19, 2024Updated 2 years ago
- [ICLR 2025] Code for the paper "Implicit Search via Discrete Diffusion: A Study on Chess"☆38Mar 3, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- GRPO Training Script for Qwen Model on GSM8K Dataset. This script trains a Qwen model using the GRPO (Generalized Reinforcement Policy Op…☆32Dec 11, 2025Updated 6 months ago
- Code for "Evaluating Spatial Understanding of Large Language Models" TMLR 2024.☆16Feb 22, 2024Updated 2 years ago
- Improvement for Modular Camera based Tactile Sensor, with integrated circuit, optimized illumination, and biomimetic markers.☆16Feb 14, 2024Updated 2 years ago
- [ICCV2023] Chaotic World: A Large and Challenging Benchmark for Human Behavior Understanding in Chaotic Events☆10Dec 7, 2024Updated last year
- Code for paper Almost-Orthogonal Layers for Efficient General-Purpose Lipschitz Networks☆13Aug 9, 2022Updated 3 years ago
- Official code of the MSF model for GZSSAR (ICIG 2023)☆13Jan 3, 2026Updated 5 months ago
- Fast Approximate Quadratic Assignment for (Brain) Graph Matching☆16Aug 23, 2016Updated 9 years ago
- Codes for NIPS 2019 Paper: Rethinking Kernel Methods for Node Representation Learning on Graphs☆35Feb 20, 2020Updated 6 years ago
- ☆14Apr 1, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- "Applying Regularized Schrödinger-Bridge-Based Stochastic Process in Generative Modeling"☆11Aug 16, 2022Updated 3 years ago
- Codebase of the paper "Aligning Protein Conformation Ensemble Generation with Physical Feedback" (ICML 2025)☆18Jul 6, 2025Updated 11 months ago
- The implementation of "SKT-Hang: Hanging Everyday Objects via Object-Agnostic Semantic Keypoint Trajectory Generation"☆10Jun 2, 2024Updated 2 years ago
- Official PyTorch Implementation of Masked Temporal Interpolation Diffusion for Procedure Planning in Instructional Videos☆11Apr 26, 2026Updated last month
- 仓库主要记录 NLP 算法工程师相关的顶会论文研读笔记【问答篇】☆21Mar 22, 2023Updated 3 years ago
- ☆11Apr 10, 2023Updated 3 years ago
- Generate Game Character for animation (SSD)☆36Mar 16, 2025Updated last year
- A Quasi-Wasserstein Loss for Learning Graph Neural Networks (QW loss)☆10May 20, 2024Updated 2 years ago
- [ACMMM 2024] An Inverse Partial Optimal Transport Framework for Music-guided Movie Trailer Generation☆16Mar 15, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [WWW 25] USPTO-LLM: A Large Language Model-Assisted Information-enriched Chemical Reaction Dataset☆19Dec 12, 2024Updated last year
- Geometric Algebra Flow Matching (GAFL) for Protein Backbone Generation☆19May 5, 2026Updated last month
- ☆19Apr 5, 2025Updated last year
- ☆14Aug 21, 2025Updated 9 months ago
- ☆24Jan 30, 2025Updated last year
- A tool for detecting anomalies in time series data☆11Dec 1, 2022Updated 3 years ago
- Getting started docs, examples, tutorials, and use cases.☆12Jun 15, 2021Updated 4 years ago