☆160Apr 17, 2025Updated 10 months ago
Alternatives and similar repositories for ReZero
Users that are interested in ReZero are comparing it to the libraries listed below
Sorting:
- Autonomously train research-agent LLMs on custom data using reinforcement learning and self-verification.☆683Mar 22, 2025Updated 11 months ago
- ☆10Feb 14, 2025Updated last year
- ☆20Mar 25, 2025Updated 11 months ago
- Locally hosted AI Agent Python Tool To Generate Novel Research Hypothesis + Titles + Abstracts☆30Apr 30, 2025Updated 10 months ago
- [ICLR 2025]ChemAgent: Self-updating Library in Large Language Models Improves Chemical Reasoning https://arxiv.org/abs/2501.06590☆79Jul 31, 2025Updated 7 months ago
- From Word to World: Can Large Language Models be Implicit Text-based World Models?☆48Dec 25, 2025Updated 2 months ago
- Your personal and private AI☆55Apr 3, 2025Updated 11 months ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆34Mar 2, 2024Updated 2 years ago
- ☆30Oct 4, 2024Updated last year
- ☆17Dec 16, 2024Updated last year
- Advanced Coding AI Assistant that uses a Gradio interface to stream coding related responses. ChatRAG supports local and API inference an…☆23May 6, 2025Updated 9 months ago
- ☆69Jan 18, 2026Updated last month
- Training an LLM to use a calculator with multi-turn reinforcement learning, achieving a **62% absolute increase in evaluation accuracy**.☆66May 5, 2025Updated 9 months ago
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆88Feb 7, 2026Updated 3 weeks ago
- Verifiers for LLM Reinforcement Learning☆80Apr 15, 2025Updated 10 months ago
- CrewAI-Agentic-Jira: Enhance your Jira workflows with intelligent agent-driven automation. Powered by the CrewAI framework, this project …☆21Feb 3, 2025Updated last year
- This project showcases engaging interactions between two AI chatbots.☆10Jan 10, 2024Updated 2 years ago
- ☆47Apr 29, 2025Updated 10 months ago
- [ACL 2025] A Generalizable and Purely Unsupervised Self-Training Framework☆71Jun 1, 2025Updated 9 months ago
- ☆67Mar 30, 2025Updated 11 months ago
- ☆25Aug 19, 2025Updated 6 months ago
- ☆16Sep 17, 2024Updated last year
- KuaiSearch PERKS☆12Nov 16, 2021Updated 4 years ago
- QLoRA with Enhanced Multi GPU Support☆38Aug 8, 2023Updated 2 years ago
- ☆28Apr 8, 2025Updated 10 months ago
- GPT-4 Level Conversational QA Trained In a Few Hours☆68Aug 21, 2024Updated last year
- Kyutai with an "eye"☆238Mar 26, 2025Updated 11 months ago
- 🎮 Material You TUI for monitoring NVIDIA GPUs☆58Jan 16, 2026Updated last month
- Chain-of-thought 방식을 활용하여 llama2를 fine-tuning☆10Nov 18, 2023Updated 2 years ago
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- [ICLR 26] The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".☆15Feb 9, 2026Updated 3 weeks ago
- REBUS: A Robust Evaluation Benchmark of Understanding Symbols☆13Aug 13, 2024Updated last year
- C++ inference wrappers for running blazing fast embedding services on your favourite serverless like AWS Lambda. By Prithivi Da, PRs welc…☆23Mar 4, 2024Updated last year
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆125Jun 11, 2025Updated 8 months ago
- Auto Data is a library designed for quick and effortless creation of datasets tailored for fine-tuning Large Language Models (LLMs).☆105Oct 31, 2024Updated last year
- ☆132May 8, 2025Updated 9 months ago
- Llama cute voice assistant☆27Sep 10, 2023Updated 2 years ago
- Let's create synthetic textbooks together :)☆76Jan 29, 2024Updated 2 years ago
- ☆26Jan 4, 2026Updated last month