OpenPipe Reinforcement Learning Experiments
☆32Mar 14, 2025Updated last year
Alternatives and similar repositories for rl-experiments
Users that are interested in rl-experiments are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Train your own SOTA deductive reasoning model☆109Mar 6, 2025Updated last year
- Clue inspired puzzles for testing LLM deduction abilities☆47Mar 19, 2026Updated 3 weeks ago
- A simple frontend page to interact with an OpenAI like API☆16Jan 31, 2025Updated last year
- MilimoChat: Privacy-first, self-hosted AI chat with customizable personas, context-aware memory, and local analytics. Built on Python/Str…☆14Mar 12, 2025Updated last year
- Visualize any repo or codebase into diagram or animation☆23Oct 14, 2024Updated last year
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- ☆51Oct 1, 2025Updated 6 months ago
- A testbed for agents and environments that can automatically improve models through data generation.☆28Mar 4, 2025Updated last year
- unsloth-5090-multiple☆62May 21, 2025Updated 10 months ago
- This AI Agent retrieves the latest news articles based on a multi keyword using the Serp API. It processes the results and returns struct…☆11Jan 31, 2025Updated last year
- Condense source code for LLM analysis by extracting essential highlights, utilizing a simplified version of Paul Gauthier's repomap techn…☆14Mar 3, 2024Updated 2 years ago
- We study toy models of skill learning.☆33Feb 3, 2026Updated 2 months ago
- Python package for extractive NLP using the OpenAI API☆17Aug 28, 2024Updated last year
- this is based on the paper Chain-of-Retrieval Augmented Generation☆14Mar 29, 2025Updated last year
- ☆12Mar 3, 2022Updated 4 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- a transformer implemented primarily using einops and trained on the tinystories dataset☆13Jun 21, 2024Updated last year
- A Datasette instance for searching WebVid-10M☆15Sep 30, 2022Updated 3 years ago
- Collection of LLM completions for reasoning-gym task datasets☆31Jul 4, 2025Updated 9 months ago
- Repository of GUI Action Narrator☆13Apr 8, 2025Updated last year
- Deploying full-stack on-prem deep research agent that can be run entirely on a local machine for $0!☆34Nov 8, 2025Updated 5 months ago
- Various LLM Benchmarks☆24Feb 20, 2026Updated last month
- ☆13Mar 29, 2026Updated last week
- Interactive AI Tutor that not just responds in text but engages with with students by "performing actions" on the interactive activity.☆16Oct 13, 2024Updated last year
- Ibis is a Hands-Free Interactive Web Page. Using the latest generative AI, it can be Any Page.☆21Oct 30, 2024Updated last year
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- A VSCode extension to display relationships between files in a codebase, overlaid on a circle packing diagram of the file structure.☆14Jan 8, 2023Updated 3 years ago
- ☆53Feb 10, 2025Updated last year
- Asimov helps you build high performance LLM apps, written in Rust 🦀☆11Jun 28, 2024Updated last year
- Terminal-Bench-Science: Evaluating AI Agents on Complex Real-World Scientific Workflows in the Terminal☆51Updated this week
- AdaLLM is an NVFP4-first inference runtime for Ada Lovelace (RTX 4090) with FP8 KV cache and custom decode kernels. This repo targets NVF…☆107Feb 15, 2026Updated last month
- Preprint: Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning☆27Jan 26, 2024Updated 2 years ago
- Run Orpheus 3B Locally with Gradio UI, Standalone App☆24Apr 1, 2025Updated last year
- A powerful system for crawling documentation websites, extracting code snippets, and providing fast search capabilities via MCP (Model C…☆27Dec 25, 2025Updated 3 months ago
- PyTorch implementation for Neural Additive Models☆25Dec 2, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Mojo Miji | A guide to Mojo programming language from a Pythonista's perspective | Mojo 秘籍☆27Mar 7, 2026Updated last month
- Pytorch implementation of the paper: Zero-Reference Deep Curve Estimation for Low-Light Image Enhancement.☆10Oct 17, 2020Updated 5 years ago
- XETBook, a free version of Bembo☆15Apr 8, 2019Updated 7 years ago
- LLM powered local Search Engine☆31Updated this week
- Windows-compatible Fast API implementation of VoiceCraft, the Zero-Shot Speech Editing and Text-to-Speech in the Wild☆22Apr 23, 2024Updated last year
- Task manager for macOS fully implemented in SwiftUI☆10Apr 28, 2020Updated 5 years ago
- Perfect for AI Prompts. CodeMapper is a python script that creates a comprehensive Markdown document representing the structure and conte…☆27Nov 4, 2025Updated 5 months ago