OpenPipe Reinforcement Learning Experiments
☆34Mar 14, 2025Updated last year
Alternatives and similar repositories for rl-experiments
Users that are interested in rl-experiments are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Clue inspired puzzles for testing LLM deduction abilities☆47Mar 19, 2026Updated 3 months ago
- MilimoChat: Privacy-first, self-hosted AI chat with customizable personas, context-aware memory, and local analytics. Built on Python/Str…☆14Mar 12, 2025Updated last year
- ☆51Oct 1, 2025Updated 8 months ago
- ☆19Aug 19, 2025Updated 10 months ago
- [ICLR 25 Spotlight] A testbed for agents and environments that can automatically improve models through data generation.☆28Mar 4, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Unofficial PyTorch implementation of Neural Additive Models (NAM) by Agarwal, et al.☆14May 30, 2021Updated 5 years ago
- unsloth-5090-multiple☆63May 21, 2025Updated last year
- This AI Agent retrieves the latest news articles based on a multi keyword using the Serp API. It processes the results and returns struct…☆11Jan 31, 2025Updated last year
- Condense source code for LLM analysis by extracting essential highlights, utilizing a simplified version of Paul Gauthier's repomap techn…☆14Mar 3, 2024Updated 2 years ago
- We study toy models of skill learning.☆34Feb 3, 2026Updated 4 months ago
- Rust-native GPU kernel authoring framework: write GPU compute kernels in Rust, compile to PTX. The Triton equivalent for the Rust ecosyst…☆35Jun 12, 2026Updated 2 weeks ago
- this is based on the paper Chain-of-Retrieval Augmented Generation☆15Mar 29, 2025Updated last year
- Deploying full-stack on-prem deep research agent that can be run entirely on a local machine for $0!☆34Nov 8, 2025Updated 7 months ago
- A Python-based parallel file chunking system designed for processing large codebases into LLM-friendly chunks.☆47Aug 13, 2025Updated 10 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Interactive AI Tutor that not just responds in text but engages with with students by "performing actions" on the interactive activity.☆16Oct 13, 2024Updated last year
- ☆13Apr 10, 2026Updated 2 months ago
- A list of articles outside of the official MLIR docs that I've found useful for learning MLIR☆13Aug 16, 2023Updated 2 years ago
- A VSCode extension to display relationships between files in a codebase, overlaid on a circle packing diagram of the file structure.☆14Jan 8, 2023Updated 3 years ago
- ☆53Feb 10, 2025Updated last year
- Asimov helps you build high performance LLM apps, written in Rust 🦀☆11Jun 28, 2024Updated 2 years ago
- ☆16Mar 22, 2025Updated last year
- Various LLM Benchmarks☆26Feb 20, 2026Updated 4 months ago
- Code snippets and reproductions from JustAByte☆48Apr 6, 2026Updated 2 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Source code for paper Are Human-generated Demonstrations Necessary for In-context Learning☆12Jan 21, 2024Updated 2 years ago
- AI voice assistant that uses Twilio Voice and ConversationRelay, and the Google Gemini API to engage in two-way conversations over a phon…☆28Feb 19, 2026Updated 4 months ago
- Preprint: Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning☆27Jan 26, 2024Updated 2 years ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆91Jan 23, 2025Updated last year
- Run Orpheus 3B Locally with Gradio UI, Standalone App☆24Apr 1, 2025Updated last year
- PyTorch implementation for Neural Additive Models☆25Dec 2, 2020Updated 5 years ago
- ☆18Nov 7, 2022Updated 3 years ago
- A powerful system for crawling documentation websites, extracting code snippets, and providing fast search capabilities via MCP (Model C…☆29Dec 25, 2025Updated 6 months ago
- Octopus is a neural machine generation toolkit for Arabic Natural Lnagauge Generation (NLG)☆10Apr 29, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- XETBook, a free version of Bembo☆16Apr 25, 2026Updated 2 months ago
- LLM powered local Search Engine☆31Apr 30, 2026Updated last month
- Task manager for macOS fully implemented in SwiftUI☆10Apr 28, 2020Updated 6 years ago
- ☆78Mar 23, 2026Updated 3 months ago
- Perfect for AI Prompts. CodeMapper is a python script that creates a comprehensive Markdown document representing the structure and conte…☆28Nov 4, 2025Updated 7 months ago
- Simple Tool Caller for llama.cpp☆11Aug 12, 2024Updated last year
- Rules-based file management for macOS☆14Jun 6, 2026Updated 3 weeks ago