OpenPipe Reinforcement Learning Experiments
☆32Mar 14, 2025Updated last year
Alternatives and similar repositories for rl-experiments
Users that are interested in rl-experiments are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Train your own SOTA deductive reasoning model☆110Mar 6, 2025Updated last year
- Clue inspired puzzles for testing LLM deduction abilities☆47Mar 19, 2026Updated 2 months ago
- A simple frontend page to interact with an OpenAI like API☆16Jan 31, 2025Updated last year
- Visualize any repo or codebase into diagram or animation☆23Oct 14, 2024Updated last year
- ☆19Aug 19, 2025Updated 9 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [ICLR 25 Spotlight] A testbed for agents and environments that can automatically improve models through data generation.☆28Mar 4, 2025Updated last year
- ☆29Oct 25, 2025Updated 6 months ago
- unsloth-5090-multiple☆63May 21, 2025Updated 11 months ago
- This AI Agent retrieves the latest news articles based on a multi keyword using the Serp API. It processes the results and returns struct…☆11Jan 31, 2025Updated last year
- Condense source code for LLM analysis by extracting essential highlights, utilizing a simplified version of Paul Gauthier's repomap techn…☆14Mar 3, 2024Updated 2 years ago
- Technical Analysis Library using Pandas (Modin for speedup) (Python)☆11Jun 24, 2019Updated 6 years ago
- Python package for extractive NLP using the OpenAI API☆17Aug 28, 2024Updated last year
- Download Web-10K data by querying Bing Image Search☆10Feb 1, 2022Updated 4 years ago
- A Datasette instance for searching WebVid-10M☆15Sep 30, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Collection of LLM completions for reasoning-gym task datasets☆31Jul 4, 2025Updated 10 months ago
- Repository of GUI Action Narrator☆13Apr 8, 2025Updated last year
- Deploying full-stack on-prem deep research agent that can be run entirely on a local machine for $0!☆34Nov 8, 2025Updated 6 months ago
- Interactive AI Tutor that not just responds in text but engages with with students by "performing actions" on the interactive activity.☆16Oct 13, 2024Updated last year
- ☆13Apr 10, 2026Updated last month
- Ibis is a Hands-Free Interactive Web Page. Using the latest generative AI, it can be Any Page.☆21Oct 30, 2024Updated last year
- A list of articles outside of the official MLIR docs that I've found useful for learning MLIR☆11Aug 16, 2023Updated 2 years ago
- ☆53Feb 10, 2025Updated last year
- ☆16Mar 22, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Asimov helps you build high performance LLM apps, written in Rust 🦀☆11Jun 28, 2024Updated last year
- Various LLM Benchmarks☆26Feb 20, 2026Updated 2 months ago
- ☆16Apr 1, 2024Updated 2 years ago
- Source code for paper Are Human-generated Demonstrations Necessary for In-context Learning☆12Jan 21, 2024Updated 2 years ago
- AI voice assistant that uses Twilio Voice and ConversationRelay, and the Google Gemini API to engage in two-way conversations over a phon…☆27Feb 19, 2026Updated 3 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆91Jan 23, 2025Updated last year
- Run Orpheus 3B Locally with Gradio UI, Standalone App☆24Apr 1, 2025Updated last year
- Exploring how optimizations for GEMMs work☆33Feb 28, 2026Updated 2 months ago
- A powerful system for crawling documentation websites, extracting code snippets, and providing fast search capabilities via MCP (Model C…☆28Dec 25, 2025Updated 4 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆19May 16, 2024Updated 2 years ago
- XETBook, a free version of Bembo☆16Apr 25, 2026Updated 3 weeks ago
- LLM powered local Search Engine☆31Apr 30, 2026Updated 2 weeks ago
- Automatically generated and up-to-date datasets for Cobalt.☆10May 16, 2020Updated 6 years ago
- Task manager for macOS fully implemented in SwiftUI☆10Apr 28, 2020Updated 6 years ago
- ☆74Mar 23, 2026Updated last month
- Perfect for AI Prompts. CodeMapper is a python script that creates a comprehensive Markdown document representing the structure and conte…☆27Nov 4, 2025Updated 6 months ago