This repo has scripts to compare various powerful RL methods
☆33Feb 23, 2026Updated last week
Alternatives and similar repositories for Tiny-RL
Users that are interested in Tiny-RL are comparing it to the libraries listed below
Sorting:
- ☆25Aug 19, 2025Updated 6 months ago
- CHIP-8 emulator for UEFI☆12Jun 12, 2017Updated 8 years ago
- 💀 gigasmol: a lightweight wrapper for gigachat api model for seamless use with smolagents.☆15Oct 23, 2025Updated 4 months ago
- Risky Object Localization (ROL) in a Driving Scene Dataset☆15Dec 24, 2023Updated 2 years ago
- Large-scale text embedding model☆38Sep 6, 2025Updated 5 months ago
- Solutions to LeetCode Problems in Java, Python, Go, Rust, and TypeScript☆11Feb 8, 2026Updated 3 weeks ago
- The GPT-4 function calls used in everchanging quest for the HF game jam☆10Jul 9, 2023Updated 2 years ago
- VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection☆25May 31, 2025Updated 9 months ago
- Multiclass and multilabel classification of ECG signals using various deep learning models.☆11Nov 22, 2020Updated 5 years ago
- Wave - The Software as a Service Starter Kit, designed to help you build the SAAS of your dreams 🚀 💰☆12Jan 30, 2026Updated last month
- [ICLR 26] The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".☆15Feb 9, 2026Updated 3 weeks ago
- Tensorflow tf.metrics tutorial☆12Aug 30, 2018Updated 7 years ago
- A Frida MCP server to enable autonomous AI assistance for Android instrumentation☆33Feb 8, 2026Updated 3 weeks ago
- ZJU毛概资料汇总☆10Mar 16, 2024Updated last year
- ☆26Jan 4, 2026Updated last month
- R1-Code-Interpreter: Training LLMs to Reason with Code via Supervised and Reinforcement Learning☆29Feb 9, 2026Updated 3 weeks ago
- High performance, enterprise grade, scalable Nostr relay☆25Feb 2, 2026Updated last month
- ReaRAG: Knowledge-guided Reasoning Enhances Factuality of Large Reasoning Models with Iterative Retrieval Augmented Generation☆25Aug 24, 2025Updated 6 months ago
- Scripts to install, build and deploy headless Pharo server applications using command line tools☆11May 29, 2020Updated 5 years ago
- 每日新闻.☆13Oct 16, 2025Updated 4 months ago
- Quick example which shows how to get user country name by IP-address using PHP☆11Sep 8, 2014Updated 11 years ago
- ☆10Feb 13, 2022Updated 4 years ago
- Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models☆13Dec 23, 2024Updated last year
- A Markov Chain Generator In Scala☆13Apr 1, 2013Updated 12 years ago
- Distributed Multi-Object Tracking Under Limited Field of View Sensors.☆19Oct 8, 2021Updated 4 years ago
- This repository contains papers for a comprehensive survey on accelerated generation techniques in Large Language Models (LLMs).☆11May 24, 2024Updated last year
- ☆18Feb 25, 2025Updated last year
- Python package for fitting item response theory models using pystan☆14Apr 7, 2025Updated 10 months ago
- V is an AI Personal Trainer, built with NVIDIA and LangChain tools.☆15Jun 25, 2024Updated last year
- 抖音、B站各大平台直播互动游戏,随时更新,需要的小伙伴可以加Q群:476900886,大家一起讨论!☆18Jul 5, 2022Updated 3 years ago
- Reproducible and flexible LLM evaluations for scientific reasoning.☆26Jul 23, 2025Updated 7 months ago
- ☆28Dec 17, 2025Updated 2 months ago
- Ruby gem for taking responsive screenshots☆21Sep 16, 2019Updated 6 years ago
- Proof assistant for Typographical Number Theory☆16Dec 15, 2015Updated 10 years ago
- ☆15Aug 25, 2020Updated 5 years ago
- ☆20Oct 14, 2023Updated 2 years ago
- Wrapper for various third party ACH services☆12Feb 13, 2025Updated last year
- 自定义实现基于netty的rpc框架☆14Nov 8, 2025Updated 3 months ago
- ☆15Sep 11, 2024Updated last year