OpenPipe Reinforcement Learning Experiments
☆32Mar 14, 2025Updated last year
Alternatives and similar repositories for rl-experiments
Users that are interested in rl-experiments are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Train your own SOTA deductive reasoning model☆112Mar 6, 2025Updated last year
- Clue inspired puzzles for testing LLM deduction abilities☆47Mar 19, 2026Updated 2 months ago
- MilimoChat: Privacy-first, self-hosted AI chat with customizable personas, context-aware memory, and local analytics. Built on Python/Str…☆14Mar 12, 2025Updated last year
- [ICLR 25 Spotlight] A testbed for agents and environments that can automatically improve models through data generation.☆28Mar 4, 2025Updated last year
- ☆18May 12, 2023Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- unsloth-5090-multiple☆63May 21, 2025Updated last year
- This AI Agent retrieves the latest news articles based on a multi keyword using the Serp API. It processes the results and returns struct…☆11Jan 31, 2025Updated last year
- We study toy models of skill learning.☆33Feb 3, 2026Updated 4 months ago
- Delta: LLM conversation branching☆14Dec 30, 2024Updated last year
- ☆12Mar 3, 2022Updated 4 years ago
- Rust-native GPU kernel authoring framework: write GPU compute kernels in Rust, compile to PTX. The Triton equivalent for the Rust ecosyst…☆33May 18, 2026Updated 3 weeks ago
- A Datasette instance for searching WebVid-10M☆15Sep 30, 2022Updated 3 years ago
- this is based on the paper Chain-of-Retrieval Augmented Generation☆15Mar 29, 2025Updated last year
- Collection of LLM completions for reasoning-gym task datasets☆31Jul 4, 2025Updated 11 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- MarketGPT: Developing a Pre-trained transformer (GPT) for Modeling Financial Time Series☆18Sep 5, 2025Updated 9 months ago
- Deploying full-stack on-prem deep research agent that can be run entirely on a local machine for $0!☆34Nov 8, 2025Updated 7 months ago
- A Python-based parallel file chunking system designed for processing large codebases into LLM-friendly chunks.☆47Aug 13, 2025Updated 9 months ago
- Interactive AI Tutor that not just responds in text but engages with with students by "performing actions" on the interactive activity.☆16Oct 13, 2024Updated last year
- Enlightener, the cutting-edge Retrieval-Augmented Generation (RAG) system that revolutionizes query responses. By combining the power of …☆13Jul 28, 2025Updated 10 months ago
- A list of articles outside of the official MLIR docs that I've found useful for learning MLIR☆13Aug 16, 2023Updated 2 years ago
- A VSCode extension to display relationships between files in a codebase, overlaid on a circle packing diagram of the file structure.☆14Jan 8, 2023Updated 3 years ago
- ☆53Feb 10, 2025Updated last year
- Various LLM Benchmarks☆26Feb 20, 2026Updated 3 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆17Apr 1, 2024Updated 2 years ago
- Code snippets and reproductions from JustAByte☆48Apr 6, 2026Updated 2 months ago
- Annotates GCode files with human readable descriptions of commands☆12Dec 9, 2024Updated last year
- Preprint: Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning☆27Jan 26, 2024Updated 2 years ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆91Jan 23, 2025Updated last year
- Run Orpheus 3B Locally with Gradio UI, Standalone App☆24Apr 1, 2025Updated last year
- Mojo Miji | A guide to Mojo programming language from a Pythonista's perspective | Mojo 秘籍☆31Updated this week
- A powerful system for crawling documentation websites, extracting code snippets, and providing fast search capabilities via MCP (Model C…☆28Dec 25, 2025Updated 5 months ago
- Pytorch implementation of the paper: Zero-Reference Deep Curve Estimation for Low-Light Image Enhancement.☆10Oct 17, 2020Updated 5 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- LLM powered local Search Engine☆31Apr 30, 2026Updated last month
- Automatically generated and up-to-date datasets for Cobalt.☆10May 16, 2020Updated 6 years ago
- Task manager for macOS fully implemented in SwiftUI☆10Apr 28, 2020Updated 6 years ago
- ☆76Mar 23, 2026Updated 2 months ago
- Perfect for AI Prompts. CodeMapper is a python script that creates a comprehensive Markdown document representing the structure and conte…☆27Nov 4, 2025Updated 7 months ago
- Simple Tool Caller for llama.cpp☆11Aug 12, 2024Updated last year
- ☆16May 14, 2025Updated last year