☆13Sep 12, 2024Updated last year
Alternatives and similar repositories for GRPO-bandits
Users that are interested in GRPO-bandits are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆71Jul 28, 2024Updated last year
- Implementation Code for "LLM-based Medical Assistant Personalization with Short- and Long-Term Memory Coordination"☆14May 17, 2026Updated last month
- Code Repository for Blog - How to Productionize Large Language Models (LLMs)☆12Mar 27, 2024Updated 2 years ago
- Personalized Story Evaluation Model☆17Nov 27, 2023Updated 2 years ago
- Orchestrate sandboxed agents that run in the cloud while you work. Fully open source☆75Updated this week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- High population city simulation in Unity ECS☆12Jul 20, 2018Updated 7 years ago
- Neural MMO - A Massively Multiagent Environment for Artificial Intelligence Research☆15May 30, 2024Updated 2 years ago
- Data Benchmarking☆25May 24, 2024Updated 2 years ago
- A flexible terminal grid for multi-agent AI workflows☆39Jun 1, 2026Updated 2 weeks ago
- An open source implementation of a vehicle controller using Unity's ECS.☆15Mar 19, 2019Updated 7 years ago
- appstore and google play ranking and review crawler☆11Jan 22, 2014Updated 12 years ago
- Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"☆27Aug 21, 2024Updated last year
- ☆20Apr 7, 2024Updated 2 years ago
- Resource Based Weapon System template for Godot 4.3. Tutorial included!☆12Jul 26, 2025Updated 10 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A pixel-style rougelike RPG dungeon game in unity3d. Weapon Forging will be the primary element .☆14Dec 19, 2016Updated 9 years ago
- Open-source AI agent CLI built in Zig. A pure Zig rewrite of OpenClaw — one ~5MB binary, zero Node.js. Agentic loop, SSE streaming, tool …☆37Jun 5, 2026Updated 2 weeks ago
- Guified is a GUI library for LÖVE (Love2D) that simplifies window management and UI element creation. It allows developers to create inte…☆16Apr 15, 2026Updated 2 months ago
- [AAAI 2026] Multimodal Deepresearcher: Generating Text-Chart Interleaved Reports From Scratch with Agentic Framework☆57Jun 8, 2026Updated last week
- [NAACL 2025] Source code for MMEvalPro, a more trustworthy and efficient benchmark for evaluating LMMs☆25Sep 26, 2024Updated last year
- Personalized Graph-based Retrieval for LLMs Benchmark☆34Feb 16, 2025Updated last year
- This high-frequency trading (HFT) bot is designed for low-latency trading in the EUR/USD currency pair. Utilizing advanced C++ techniques…☆14Jul 9, 2024Updated last year
- Datastructure for data science☆23Apr 12, 2024Updated 2 years ago
- ☆11Oct 6, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Open Source Virtual Assistant Framework☆13Sep 4, 2025Updated 9 months ago
- Nautical Object Detection using Detectron2 for Instance Segmentation Trained on Nautical Objects (buoys, ships and land).☆22Apr 5, 2022Updated 4 years ago
- Boost.Asio C++ Network Programming Cookbook by Dmytro Radchuk☆16May 14, 2017Updated 9 years ago
- Order Book Imbalance trading strategy☆11Nov 21, 2022Updated 3 years ago
- [CVPR 2025] Beacon3D: Object-centric Evaluation for 3D Grounding-QA☆28Nov 25, 2025Updated 6 months ago
- Quality Shapes Extraction from very large Knowledge Graphs☆13Nov 15, 2025Updated 7 months ago
- ☆10Oct 30, 2023Updated 2 years ago
- Script for trade arbitrage opportunities between European-style options and Perpetual futures, with notifications in telegram☆11Jun 10, 2023Updated 3 years ago
- Alpaca-based Order Book Inbalace Algorithm.☆12Jul 23, 2020Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Algo options trading using machine learning.☆15Jul 16, 2021Updated 4 years ago
- Personalized knowledge graph summarization based on historical queries☆14Jun 17, 2020Updated 6 years ago
- Personalized Fashion Compatibility Modeling via Metapath-guided Heterogeneous Graph Learning.☆16Nov 7, 2022Updated 3 years ago
- 还是要多练☆12Jul 31, 2020Updated 5 years ago
- Lattice // Salt // Facet☆93Jun 2, 2026Updated 2 weeks ago
- ☆12Oct 17, 2022Updated 3 years ago
- Source code for our paper "Pessimistic Decision-Making for Recommender Systems" published at ACM TORS, and RecSys 2021.☆11Dec 15, 2022Updated 3 years ago