Pavankunchala / Reinforcement-learning-with-verifable-rewards-LearningsLinks
RLVR Testing and Training
☆23Updated 5 months ago
Alternatives and similar repositories for Reinforcement-learning-with-verifable-rewards-Learnings
Users that are interested in Reinforcement-learning-with-verifable-rewards-Learnings are comparing it to the libraries listed below
Sorting:
- A truly open version of gpt-oss which shows the entire pre-training from scratch☆85Updated 4 months ago
- ☆31Updated 10 months ago
- From-scratch implementation of OpenAI's GPT-OSS model in Python. No Torch, No GPUs.☆108Updated 2 months ago
- Streaming Retrieval-Augmented Generation (RAG) agent in Go. It consumes real-time data from Kafka topics, processes it in configurable wi…☆25Updated 7 months ago
- The code repository of the paper: Competition and Attraction Improve Model Fusion☆168Updated 5 months ago
- Proteus is an experimental platform that combines the power of Large Language Models with the Genesis physics engine☆25Updated last year
- An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning☆37Updated 8 months ago
- ☆158Updated 9 months ago
- OpenPipe Reinforcement Learning Experiments☆32Updated 10 months ago
- Sparse Inferencing for transformer based LLMs☆218Updated 5 months ago
- ☆439Updated last month
- Retrieval-augmented generation (RAG) for remote & local LLM use☆44Updated 8 months ago
- ☆43Updated 2 months ago
- Exploring retrieval systems for language models☆14Updated 9 months ago
- unsloth-5090-multiple☆60Updated 8 months ago
- Measuring Thinking Efficiency in Reasoning Models - Research Repository☆38Updated last month
- Train transformer language models with reinforcement learning.☆19Updated 11 months ago
- Enhancing LLMs with LoRA☆206Updated 3 months ago
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆103Updated last year
- Mixture of Cognitive Reasoners: Modular Reasoning with Brain-Like Specialization☆38Updated 2 months ago
- Roomey is a multi-purpose Voice Agent designed to run your personal and business life.☆60Updated 7 months ago
- ☆39Updated last year
- ☆95Updated last week
- rudradb-opin-examples is for example implementations of the pip install rudradb-opin☆29Updated 4 months ago
- Modified Beam Search with periodical restart☆12Updated last year
- Official repository for the paper "NeuZip: Memory-Efficient Training and Inference with Dynamic Compression of Neural Networks". This rep…☆60Updated last year
- ☆29Updated 9 months ago
- A Python CLI to test, benchmark, and find the best RAG chunking strategy for your Markdown documents.☆99Updated last week
- Official Repository for Task-Circuit Quantization☆24Updated 7 months ago
- 🗣️ Real‑time, low‑latency voice, vision, and conversational‑memory AI assistant built on LiveKit and local LLMs ✨☆103Updated 7 months ago