☆13Sep 12, 2024Updated last year
Alternatives and similar repositories for GRPO-bandits
Users that are interested in GRPO-bandits are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆71Jul 28, 2024Updated last year
- Implementation Code for "LLM-based Medical Assistant Personalization with Short- and Long-Term Memory Coordination"☆14Apr 25, 2025Updated 11 months ago
- Code Repository for Blog - How to Productionize Large Language Models (LLMs)☆12Mar 27, 2024Updated 2 years ago
- Orchestrate sandboxed agents that run in the cloud while you work. Fully open source☆52Updated this week
- Personalized Story Evaluation Model☆18Nov 27, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- High population city simulation in Unity ECS☆12Jul 20, 2018Updated 7 years ago
- A flexible terminal grid for multi-agent AI workflows☆30Updated this week
- Neural MMO - A Massively Multiagent Environment for Artificial Intelligence Research☆15May 30, 2024Updated last year
- Data Benchmarking☆23May 24, 2024Updated last year
- An open source implementation of a vehicle controller using Unity's ECS.☆15Mar 19, 2019Updated 7 years ago
- appstore and google play ranking and review crawler☆11Jan 22, 2014Updated 12 years ago
- Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"☆28Aug 21, 2024Updated last year
- ☆20Apr 7, 2024Updated 2 years ago
- Resource Based Weapon System template for Godot 4.3. Tutorial included!☆12Jul 26, 2025Updated 8 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A pixel-style rougelike RPG dungeon game in unity3d. Weapon Forging will be the primary element .☆14Dec 19, 2016Updated 9 years ago
- Guified is a GUI library for LÖVE (Love2D) that simplifies window management and UI element creation. It allows developers to create inte…☆16Feb 28, 2026Updated last month
- [AAAI 2026] Multimodal Deepresearcher: Generating Text-Chart Interleaved Reports From Scratch with Agentic Framework☆50Jan 25, 2026Updated 2 months ago
- [NAACL 2025] Source code for MMEvalPro, a more trustworthy and efficient benchmark for evaluating LMMs☆25Sep 26, 2024Updated last year
- Personalized Graph-based Retrieval for LLMs Benchmark☆34Feb 16, 2025Updated last year
- This high-frequency trading (HFT) bot is designed for low-latency trading in the EUR/USD currency pair. Utilizing advanced C++ techniques…☆13Jul 9, 2024Updated last year
- Datastructure for data science☆23Apr 12, 2024Updated 2 years ago
- ☆11Oct 6, 2020Updated 5 years ago
- Open Source Virtual Assistant Framework☆13Sep 4, 2025Updated 7 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Nautical Object Detection using Detectron2 for Instance Segmentation Trained on Nautical Objects (buoys, ships and land).☆21Apr 5, 2022Updated 4 years ago
- A programming language for parallel and asynchronous computation that makes it easy to orchestrate AI agents☆73Apr 10, 2026Updated last week
- Boost.Asio C++ Network Programming Cookbook by Dmytro Radchuk☆15May 14, 2017Updated 8 years ago
- Order Book Imbalance trading strategy☆10Nov 21, 2022Updated 3 years ago
- Quality Shapes Extraction from very large Knowledge Graphs☆13Nov 15, 2025Updated 5 months ago
- ☆10Oct 30, 2023Updated 2 years ago
- Script for trade arbitrage opportunities between European-style options and Perpetual futures, with notifications in telegram☆11Jun 10, 2023Updated 2 years ago
- Alpaca-based Order Book Inbalace Algorithm.☆12Jul 23, 2020Updated 5 years ago
- Annotations for the Mistake Detection benchmark of Assembly101☆11Aug 3, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Algo options trading using machine learning.☆15Jul 16, 2021Updated 4 years ago
- Personalized knowledge graph summarization based on historical queries☆14Jun 17, 2020Updated 5 years ago
- Personalized Fashion Compatibility Modeling via Metapath-guided Heterogeneous Graph Learning.☆16Nov 7, 2022Updated 3 years ago
- 还是要多练☆12Jul 31, 2020Updated 5 years ago
- Lattice // Salt // Facet☆78Mar 2, 2026Updated last month
- ☆12Oct 17, 2022Updated 3 years ago
- Source code for our paper "Pessimistic Decision-Making for Recommender Systems" published at ACM TORS, and RecSys 2021.☆11Dec 15, 2022Updated 3 years ago