Distributed Reinforcement Learning for LLM Fine-Tuning with multi-GPU utilization
☆22Mar 12, 2025Updated 11 months ago
Alternatives and similar repositories for DistRL-LLM
Users that are interested in DistRL-LLM are comparing it to the libraries listed below
Sorting:
- The DPAB-α Benchmark☆32Jan 15, 2025Updated last year
- Oak National Academy's AI Auto Eval tools provide LLM as a judge evaluation on lesson plans and resources☆17Nov 4, 2025Updated 3 months ago
- This project showcases engaging interactions between two AI chatbots.☆10Jan 10, 2024Updated 2 years ago
- A tool for determining if a pickleball is in or out of bounds☆14Feb 3, 2025Updated last year
- Home Assistant integration for Tylo Helo sauna heaters via RS485☆20Feb 16, 2026Updated last week
- 增加了indextts2的简单的界面与api调用方式☆20Oct 27, 2025Updated 4 months ago
- ☆12Jun 19, 2024Updated last year
- [ACL'24] A Knowledge-grounded Interactive Evaluation Framework for Large Language Models☆39Jul 19, 2024Updated last year
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- Estimates fatigue loads in wind turbines from SCADA data based on supervised learning.☆10Sep 11, 2018Updated 7 years ago
- The official implementation of our work SQLFixAgent: Towards Semantic-Accurate Text-to-SQL Parsing via Consistency-Enhanced Multi-Agent C…☆23May 2, 2025Updated 9 months ago
- A Kubernetes operator for managing Prefect servers and work pools☆17Updated this week
- Chain-of-thought 방식을 활용하여 llama2를 fine-tuning☆10Nov 18, 2023Updated 2 years ago
- Provides for deploying custom ETL containers on AIStore, with subsequent user-defined extraction-transformation-loading in parallel, on t…☆19Nov 26, 2025Updated 3 months ago
- ☆12May 30, 2025Updated 9 months ago
- LobotoMl is a set of scripts and tools to assess production deployments of ML services☆10May 16, 2022Updated 3 years ago
- ☆12Jun 20, 2023Updated 2 years ago
- QLoRA: Efficient Finetuning of Quantized LLMs☆11Jul 22, 2023Updated 2 years ago
- Yet another frontend for LLM, written using .NET and WinUI 3☆10Sep 14, 2025Updated 5 months ago
- REBUS: A Robust Evaluation Benchmark of Understanding Symbols☆13Aug 13, 2024Updated last year
- Doccano annotation server together with a Spacy backend☆11Apr 5, 2023Updated 2 years ago
- Arxiv + Notion Sync☆20May 12, 2025Updated 9 months ago
- Face++ 是一款基于 Android 平台开发的创新性 AI 面相分析应用。它巧妙地将中国传统面相学理论(如“三庭五眼”和“十二宫”)与现代人工智能技术相结合,为用户提供一份专业、详尽且富有洞察力的面相分析报告☆21Jul 14, 2025Updated 7 months ago
- 开源扫雷网是专业玩家建设的扫雷排名网站。在这里,你可以上传扫雷录像参与全球排名;也希望有开发能力的雷友可以发挥专业能力,为网站贡献代码、增加功能。Open minesweeper website is a community-built ranking website fo…☆11Updated this week
- 日期时间实体识别☆11Sep 10, 2020Updated 5 years ago
- code for training and using chess embeddings models☆13Jun 9, 2024Updated last year
- an auto-sleeping and -waking framework around llama.cpp☆12Feb 8, 2025Updated last year
- Connects MQTT (ie. zigbee2mqtt) to Ai, using a openai -compatible- api interface☆23Dec 30, 2025Updated 2 months ago
- ☆10Apr 16, 2021Updated 4 years ago
- Docker images for various AI tools.☆13Jun 12, 2023Updated 2 years ago
- ☆11Apr 23, 2023Updated 2 years ago
- redbox.wiki media and pages☆16Updated this week
- ☆11Sep 19, 2025Updated 5 months ago
- A free multi-purpose and open source Color Picker Software.☆17Jan 31, 2020Updated 6 years ago
- LLM CLI Interface - Extremely Convenient and Fast☆12Sep 22, 2025Updated 5 months ago
- This is the Mixture-of-Agents (MoA) concept, adapted from the original work by TogetherAI. My version is tailored for local model usage a…☆12Jun 25, 2024Updated last year
- [NAACL'25] RuleR: Improving LLM Controllability by Rule-based Data Recycling☆14Sep 27, 2025Updated 5 months ago
- RAG Hallucination Detecting By LRP.☆11Mar 31, 2025Updated 11 months ago
- hllama is a library which aims to provide a set of utility tools for large language models.☆10Apr 16, 2024Updated last year