☆13Sep 12, 2024Updated last year
Alternatives and similar repositories for GRPO-bandits
Users that are interested in GRPO-bandits are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆71Jul 28, 2024Updated last year
- Implementation Code for "LLM-based Medical Assistant Personalization with Short- and Long-Term Memory Coordination"☆14May 17, 2026Updated last week
- Code Repository for Blog - How to Productionize Large Language Models (LLMs)☆12Mar 27, 2024Updated 2 years ago
- Personalized Story Evaluation Model☆17Nov 27, 2023Updated 2 years ago
- Orchestrate sandboxed agents that run in the cloud while you work. Fully open source☆65Updated this week
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- High population city simulation in Unity ECS☆12Jul 20, 2018Updated 7 years ago
- Neural MMO - A Massively Multiagent Environment for Artificial Intelligence Research☆15May 30, 2024Updated last year
- Data Benchmarking☆25May 24, 2024Updated 2 years ago
- A flexible terminal grid for multi-agent AI workflows☆38May 18, 2026Updated last week
- An open source implementation of a vehicle controller using Unity's ECS.☆15Mar 19, 2019Updated 7 years ago
- appstore and google play ranking and review crawler☆11Jan 22, 2014Updated 12 years ago
- Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"☆28Aug 21, 2024Updated last year
- ☆20Apr 7, 2024Updated 2 years ago
- Resource Based Weapon System template for Godot 4.3. Tutorial included!☆12Jul 26, 2025Updated 10 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A pixel-style rougelike RPG dungeon game in unity3d. Weapon Forging will be the primary element .☆14Dec 19, 2016Updated 9 years ago
- Guified is a GUI library for LÖVE (Love2D) that simplifies window management and UI element creation. It allows developers to create inte…☆16Apr 15, 2026Updated last month
- [AAAI 2026] Multimodal Deepresearcher: Generating Text-Chart Interleaved Reports From Scratch with Agentic Framework☆54Jan 25, 2026Updated 4 months ago
- [NAACL 2025] Source code for MMEvalPro, a more trustworthy and efficient benchmark for evaluating LMMs☆25Sep 26, 2024Updated last year
- Personalized Graph-based Retrieval for LLMs Benchmark☆34Feb 16, 2025Updated last year
- This high-frequency trading (HFT) bot is designed for low-latency trading in the EUR/USD currency pair. Utilizing advanced C++ techniques…☆14Jul 9, 2024Updated last year
- Datastructure for data science☆23Apr 12, 2024Updated 2 years ago
- ☆11Oct 6, 2020Updated 5 years ago
- Open Source Virtual Assistant Framework☆13Sep 4, 2025Updated 8 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Nautical Object Detection using Detectron2 for Instance Segmentation Trained on Nautical Objects (buoys, ships and land).☆22Apr 5, 2022Updated 4 years ago
- Boost.Asio C++ Network Programming Cookbook by Dmytro Radchuk☆15May 14, 2017Updated 9 years ago
- Order Book Imbalance trading strategy☆11Nov 21, 2022Updated 3 years ago
- [CVPR 2025] Beacon3D: Object-centric Evaluation for 3D Grounding-QA☆28Nov 25, 2025Updated 6 months ago
- Quality Shapes Extraction from very large Knowledge Graphs☆13Nov 15, 2025Updated 6 months ago
- ☆10Oct 30, 2023Updated 2 years ago
- Script for trade arbitrage opportunities between European-style options and Perpetual futures, with notifications in telegram☆11Jun 10, 2023Updated 2 years ago
- Alpaca-based Order Book Inbalace Algorithm.☆12Jul 23, 2020Updated 5 years ago
- Algo options trading using machine learning.☆15Jul 16, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Personalized knowledge graph summarization based on historical queries☆14Jun 17, 2020Updated 5 years ago
- Personalized Fashion Compatibility Modeling via Metapath-guided Heterogeneous Graph Learning.☆16Nov 7, 2022Updated 3 years ago
- Lattice // Salt // Facet☆92Mar 2, 2026Updated 2 months ago
- 还是要多练☆12Jul 31, 2020Updated 5 years ago
- ☆12Oct 17, 2022Updated 3 years ago
- Source code for our paper "Pessimistic Decision-Making for Recommender Systems" published at ACM TORS, and RecSys 2021.☆11Dec 15, 2022Updated 3 years ago
- Create Meshes from Depth Maps in Unity☆10Feb 13, 2023Updated 3 years ago