OpenPipe Reinforcement Learning Experiments
☆32Mar 14, 2025Updated 11 months ago
Alternatives and similar repositories for rl-experiments
Users that are interested in rl-experiments are comparing it to the libraries listed below
Sorting:
- Train your own SOTA deductive reasoning model☆107Mar 6, 2025Updated 11 months ago
- Clue inspired puzzles for testing LLM deduction abilities☆45Mar 24, 2025Updated 11 months ago
- A simple frontend page to interact with an OpenAI like API☆16Jan 31, 2025Updated last year
- ☆18Aug 19, 2025Updated 6 months ago
- Deploying full-stack on-prem deep research agent that can be run entirely on a local machine for $0!☆30Nov 8, 2025Updated 3 months ago
- unsloth-5090-multiple☆60May 21, 2025Updated 9 months ago
- Ibis is a Hands-Free Interactive Web Page. Using the latest generative AI, it can be Any Page.☆21Oct 30, 2024Updated last year
- MarketGPT: Developing a Pre-trained transformer (GPT) for Modeling Financial Time Series☆17Sep 5, 2025Updated 5 months ago
- ☆17May 12, 2023Updated 2 years ago
- A powerful system for crawling documentation websites, extracting code snippets, and providing fast search capabilities via MCP (Model C…☆27Dec 25, 2025Updated 2 months ago
- Collection of LLM completions for reasoning-gym task datasets☆30Jul 4, 2025Updated 7 months ago
- Visualize any repo or codebase into diagram or animation☆20Oct 14, 2024Updated last year
- A testbed for agents and environments that can automatically improve models through data generation.☆28Mar 4, 2025Updated 11 months ago
- ☆51Oct 1, 2025Updated 4 months ago
- Enlightener, the cutting-edge Retrieval-Augmented Generation (RAG) system that revolutionizes query responses. By combining the power of …☆14Jul 28, 2025Updated 6 months ago
- ☆24Jan 22, 2025Updated last year
- Quantized text-audio foundation model from Boson AI☆43Aug 13, 2025Updated 6 months ago
- Run Orpheus 3B Locally with Gradio UI, Standalone App☆23Apr 1, 2025Updated 10 months ago
- A local RAG pipeline that passed a Japanese corporate exam☆24May 7, 2025Updated 9 months ago
- ICLR 2021: "Monte-Carlo Planning and Learning with Language Action Value Estimates"☆33Nov 30, 2023Updated 2 years ago
- The official repository of "SmartAgent: Chain-of-User-Thought for Embodied Personalized Agent in Cyber World".☆27Aug 20, 2025Updated 6 months ago
- ☆27Oct 22, 2024Updated last year
- Oobabooga Text-Gen Web UI extension: get web content, add to context☆23Jun 1, 2024Updated last year
- ☆26Sep 12, 2019Updated 6 years ago
- GPT-4 Level Conversational QA Trained In a Few Hours☆67Aug 21, 2024Updated last year
- My version of an LLM Websearch Agent using a local SearXNG server because SearXNG is great.☆41Jan 27, 2026Updated last month
- The ArchiveWeb.page Site☆32Nov 7, 2025Updated 3 months ago
- A QT GUI for large language models☆39Dec 27, 2023Updated 2 years ago
- Financial Analysis and Algorithmic Trading Strategies in Python☆11Feb 16, 2023Updated 3 years ago
- AdaLLM is an NVFP4-first inference runtime for Ada Lovelace (RTX 4090) with FP8 KV cache and custom decode kernels. This repo targets NVF…☆94Feb 15, 2026Updated last week
- Codes and datasets for adaptive spline fitting method SHAPES☆10Sep 27, 2024Updated last year
- A Python-based parallel file chunking system designed for processing large codebases into LLM-friendly chunks.☆47Aug 13, 2025Updated 6 months ago
- Preprint: Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning☆28Jan 26, 2024Updated 2 years ago
- klmbr - a prompt pre-processing technique to break through the barrier of entropy while generating text with LLMs☆86Sep 22, 2024Updated last year
- Code for paper called Self-Training Elicits Concise Reasoning in Large Language Models☆42Apr 22, 2025Updated 10 months ago
- MQTT interface for Bluetti power stations☆16Jun 21, 2025Updated 8 months ago
- RL algorithm for stock trading with multiple reward functions☆11Apr 21, 2024Updated last year
- Some implementations from the paper robust risk aware reinforcement learning☆36Dec 15, 2021Updated 4 years ago
- AI voice assistant that uses Twilio Voice and ConversationRelay, and the Google Gemini API to engage in two-way conversations over a phon…☆23Feb 19, 2026Updated last week