This repo has scripts to compare various powerful RL methods
☆42Feb 23, 2026Updated 2 months ago
Alternatives and similar repositories for Tiny-RL
Users that are interested in Tiny-RL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Unsupervised muti-metric fusion for Full-Reference (FR) Image Quality Assessment (IQA)☆11Jul 11, 2014Updated 11 years ago
- OllamaFX is a native, lightweight, and professional JavaFX desktop client for Ollama. Run Llama 3, Mistral, and Phi-3 locally with maximu…☆67Mar 6, 2026Updated 2 months ago
- Point Cloud Annotation Tool. Built with PPTK and PyQt.☆15May 18, 2023Updated 3 years ago
- HUHEMS is a full-stack exam management system for Haramaya University. It supports admin-managed exams and question banks, student exam a…☆61Mar 30, 2026Updated last month
- ☆15Jun 25, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- OpenClaw Daily News (with Ollama + Telegram Quick Setup Guide) | 每日新聞兼快速安裝指南☆42Updated this week
- Efficient Multi-Vehicle Trajectory Planning via Centralized Searching Decentralized Optimization☆27Jan 16, 2025Updated last year
- PyTorch implementation for our paper "Efficient Meta Reinforcement Learning for Preference-based Fast Adaptation"☆13Apr 19, 2023Updated 3 years ago
- 时间有限,保持专注,戴上信息降噪眼镜。 只在重要时,主动通知你☆53Dec 26, 2025Updated 4 months ago
- [CVPR 2025] Implementation of "Forensics-Bench: A Comprehensive Forgery Detection Benchmark Suite for Large Vision Language Models"☆40Apr 28, 2025Updated last year
- [DATE'2025, TCAD'2025] Terafly : A Multi-Node FPGA Based Accelerator Design for Efficient Cooperative Inference in LLMs☆36Nov 13, 2025Updated 6 months ago
- ☆76Apr 16, 2026Updated last month
- Andrej Karpathy's microGPT transliterated to Rust☆36Feb 28, 2026Updated 2 months ago
- An AlphaZero engine for Saiblo Connect4, featuring a pure Python implementation of key KataGo techniques.☆18Apr 21, 2026Updated last month
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- PostgreSQL SKILLs for AI Agent☆33Feb 5, 2026Updated 3 months ago
- [NeurIPS 2025] Dual-Flow: Transferable Multi-Target, Instance-Agnostic Attacks via In-the-wild Cascading Flow Optimization☆19Oct 22, 2025Updated 7 months ago
- 深度学习初学者理论与实践学习的资料总结☆13Apr 19, 2019Updated 7 years ago
- Deep Learning for Energy Efficient Beamforming in MU-MISO Networks: A GAT-based Approach☆15Apr 22, 2023Updated 3 years ago
- Code for the paper "FastAdaSP: An Efficient Multitask Inference Framework for Large Speech Language Models". @ EMNLP'24(Oral)☆17Nov 14, 2024Updated last year
- lightweight and scalable whole-body teleoperation framework for humanoid robots☆102May 15, 2026Updated last week
- CarlaDataCollector is a lightweight framework for efficient data collection in Carla simulation environment.☆14Jan 9, 2024Updated 2 years ago
- This repository contains a web application designed to execute relatively compact, locally-operated Large Language Models (LLMs).☆47Feb 8, 2026Updated 3 months ago
- a toy mock server based on anyproxy☆20Mar 18, 2019Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [ICLR 2026] SwiReasoning: Switch-Thinking in Latent and Explicit for Pareto-Superior Reasoning LLMs☆59May 11, 2026Updated last week
- practical claude code commands and subagents☆70Apr 14, 2026Updated last month
- [AAAI 2026] Multimodal Deepresearcher: Generating Text-Chart Interleaved Reports From Scratch with Agentic Framework☆53Jan 25, 2026Updated 3 months ago
- Latex template for Amit Yadav's CV☆26Oct 23, 2021Updated 4 years ago
- 大模型(LLMs)微调训练 快速入门指南☆37Feb 26, 2026Updated 2 months ago
- Argy: Command-line parsing library for modern C++ — simple, intuitive, and header-only with no dependencies.☆34Aug 31, 2025Updated 8 months ago
- 🍪 青龙助手:自动同步网站Cookie到青龙面板的Chrome扩展,支持多网站配置和完整的环境变量管理。[qnloft出品]☆62Dec 31, 2025Updated 4 months ago
- Convert images to fit Commodore 64 graphic modes☆45Jan 9, 2026Updated 4 months ago
- FactoryTest is used for manufacturing tests on Android devices. Includes: WiFi, Bluetooth, Ethernet, Mobile Network and more tests. Based…☆29Aug 24, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- SSD检测过程的Tensorflow实现☆28Jun 7, 2018Updated 7 years ago
- [Spotlight ICLR 2023 paper] Continual evaluation for lifelong learning with neural networks, identifying the stability gap.☆35Apr 2, 2023Updated 3 years ago
- The most atomic way to train and inference a GPT in pure, dependency-free JavaScript. This repository covers the complete algorithm. Ever…☆92Feb 13, 2026Updated 3 months ago
- ReaRAG: Knowledge-guided Reasoning Enhances Factuality of Large Reasoning Models with Iterative Retrieval Augmented Generation☆26Aug 24, 2025Updated 8 months ago
- awsome ai tools☆12Apr 21, 2023Updated 3 years ago
- 45+ production-ready tutorials on data science, MLOps, and AI tools. All code is executable and adaptable for real projects.☆24Apr 7, 2026Updated last month
- The Custom Gridworld and Environment Demo of Ship Route Planning with Reinforcement Learning. The reinforcement learning based on Qlearni…☆35Sep 27, 2022Updated 3 years ago