☆160Apr 17, 2025Updated 11 months ago
Alternatives and similar repositories for ReZero
Users that are interested in ReZero are comparing it to the libraries listed below
Sorting:
- ☆20Mar 25, 2025Updated 11 months ago
- Autonomously train research-agent LLMs on custom data using reinforcement learning and self-verification.☆686Mar 22, 2025Updated last year
- 🌐 OpenCrawl: An ethical, high-performance web crawler built for scale A powerful web crawling library that respects robots.txt and rate…☆19Apr 3, 2025Updated 11 months ago
- ☆30Oct 4, 2024Updated last year
- C++ inference wrappers for running blazing fast embedding services on your favourite serverless like AWS Lambda. By Prithivi Da, PRs welc…☆23Mar 4, 2024Updated 2 years ago
- ⚡ Bhumi – The fastest AI inference client for Python, built with Rust for unmatched speed, efficiency, and scalability 🚀☆64Jan 22, 2026Updated 2 months ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆34Mar 2, 2024Updated 2 years ago
- ☆17Dec 16, 2024Updated last year
- ☆26Jan 14, 2025Updated last year
- ☆58Aug 19, 2025Updated 7 months ago
- Your personal and private AI☆54Apr 3, 2025Updated 11 months ago
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆90Mar 16, 2026Updated last week
- ☆67Mar 30, 2025Updated 11 months ago
- QLoRA with Enhanced Multi GPU Support☆38Aug 8, 2023Updated 2 years ago
- [ICLR 2025]ChemAgent: Self-updating Library in Large Language Models Improves Chemical Reasoning https://arxiv.org/abs/2501.06590☆81Jul 31, 2025Updated 7 months ago
- [NeurIPS'25] Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learning☆124Dec 30, 2025Updated 2 months ago
- [ACL 2025] A Generalizable and Purely Unsupervised Self-Training Framework☆71Jun 1, 2025Updated 9 months ago
- MCP Client Implemented to FastAPI☆11Feb 26, 2025Updated last year
- 🛤️ Pathik - High-Performance Web Crawler ⚡☆31Mar 27, 2025Updated 11 months ago
- KuaiSearch PERKS☆12Nov 16, 2021Updated 4 years ago
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31May 22, 2024Updated last year
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆59Oct 18, 2025Updated 5 months ago
- Verifiers for LLM Reinforcement Learning☆80Apr 15, 2025Updated 11 months ago
- Auto Data is a library designed for quick and effortless creation of datasets tailored for fine-tuning Large Language Models (LLMs).☆106Oct 31, 2024Updated last year
- Chain-of-thought 방식을 활용하여 llama2를 fine-tuning☆10Nov 18, 2023Updated 2 years ago
- This project showcases engaging interactions between two AI chatbots.☆10Jan 10, 2024Updated 2 years ago
- CrewAI-Agentic-Jira: Enhance your Jira workflows with intelligent agent-driven automation. Powered by the CrewAI framework, this project …☆22Feb 3, 2025Updated last year
- Advanced Coding AI Assistant that uses a Gradio interface to stream coding related responses. ChatRAG supports local and API inference an…☆23May 6, 2025Updated 10 months ago
- ☆28Jan 4, 2026Updated 2 months ago
- Optimizing inference proxy for LLMs☆3,381Jan 28, 2026Updated last month
- From Word to World: Can Large Language Models be Implicit Text-based World Models?☆55Dec 25, 2025Updated 2 months ago
- ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning & ReCall: Learning to Reason with Tool Call for LLMs via Rei…☆1,349May 16, 2025Updated 10 months ago
- PropRAG: A novel Retrieval-Augmented Generation framework leveraging context-rich propositions and LLM-free beam search over proposition …☆33Nov 9, 2025Updated 4 months ago
- a survey on deep research☆47Sep 9, 2025Updated 6 months ago
- 🎮 Material You TUI for monitoring NVIDIA GPUs☆58Jan 16, 2026Updated 2 months ago
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆125Jun 11, 2025Updated 9 months ago
- GPT-4 Level Conversational QA Trained In a Few Hours☆68Aug 21, 2024Updated last year
- Automated LLM novelist☆46Apr 11, 2024Updated last year