Distributed Reinforcement Learning for LLM Fine-Tuning with multi-GPU utilization
☆22Mar 12, 2025Updated last year
Alternatives and similar repositories for DistRL-LLM
Users that are interested in DistRL-LLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Community Eventing and Scripting examples☆19Aug 11, 2025Updated 7 months ago
- ☆19Mar 16, 2025Updated last year
- LobotoMl is a set of scripts and tools to assess production deployments of ML services☆10May 16, 2022Updated 3 years ago
- API and CLI tool to fetch and query Chome DevTools heap snapshots (Python & Playwright)☆16May 16, 2024Updated last year
- The collections of MOE (Mixture Of Expert) papers, code and tools, etc.☆12Mar 15, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- An HTTP proxy that naively injects NTLM data for the current user into outgoing requests☆14Nov 14, 2018Updated 7 years ago
- Project code for training LLMs to write better unit tests + code☆21May 19, 2025Updated 10 months ago
- Face++ 是一款基于 Android 平台开发的创新性 AI 面相分析应用。它巧妙地将中国传统面相学理论(如“三庭五眼”和“十二宫”)与现代人工智能技术相结合,为用户提供一份专业、详尽且富有洞察力的面相分析报告☆22Jul 14, 2025Updated 8 months ago
- Link Python logic with Svelte interfaces for simple demos☆13Jan 9, 2025Updated last year
- Google Chrome Extension for recording Google Meet transcripts☆12Aug 6, 2020Updated 5 years ago
- ☆18May 6, 2023Updated 2 years ago
- A jupyter client for your terminal☆24Jan 3, 2026Updated 3 months ago
- 增加了indextts2的简单的界面与api调用方式☆27Oct 27, 2025Updated 5 months ago
- Arxiv + Notion Sync☆20May 12, 2025Updated 10 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- One Line To Build Zero-Data Classifiers in Minutes☆65Sep 25, 2024Updated last year
- Happy Hacking With Claude!!!☆24Oct 27, 2025Updated 5 months ago
- A framework for creating message-driven training systems with PyTorch☆21Oct 7, 2025Updated 6 months ago
- ☆16Jan 2, 2020Updated 6 years ago
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- ☆16Jun 13, 2017Updated 8 years ago
- Offical Code For "Towards Hierarchical Multi-Step Reward Models for Enhanced Reasoning in Large Language Models"☆20Mar 25, 2025Updated last year
- Example agents for the Dreadnode platform☆30Dec 19, 2025Updated 3 months ago
- Triton backend for managing the model state tensors automatically in sequence batcher☆16Feb 12, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- 自动生成短视频,文章自动成片,多模态混剪,数字人,声音克隆☆13Jun 25, 2024Updated last year
- ☆18Apr 15, 2024Updated last year
- Minimal workflows☆21Mar 19, 2024Updated 2 years ago
- ☆15Mar 12, 2022Updated 4 years ago
- A Field-Theoretic Approach to Unbounded Memory in Large Language Models☆20Apr 15, 2025Updated 11 months ago
- Kerberos ticket delegation and impersonation for Batch/CI/CD environments☆20Mar 21, 2026Updated 3 weeks ago
- This project showcases engaging interactions between two AI chatbots.☆10Jan 10, 2024Updated 2 years ago
- MCP server for ROS to control robots via topics, services, and actions.☆31Aug 19, 2025Updated 7 months ago
- Small, simple agent task environments for training and evaluation☆19Nov 1, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- 📚 OpenAI API 完整功能演示项目,包含: • ChatGPT/GPT-4 对话 • DALL-E 图像生成 • Whisper 语音转换 • 文本嵌入搜索 • RAG 知识库系统 • Assistants API 应用 • 提示词工程最佳实践 🔥 特点: •…☆23Nov 10, 2025Updated 5 months ago
- A free multi-purpose and open source Color Picker Software.☆17Jan 31, 2020Updated 6 years ago
- Forensic Reconstruction of Severely Degraded License Plates, Electronic Imaging, 2019.☆18Apr 27, 2022Updated 3 years ago
- ChromaDB Data Pipes 🖇️ - The easiest way to get data into and out of ChromaDB☆20Oct 22, 2024Updated last year
- QLoRA: Efficient Finetuning of Quantized LLMs☆11Jul 22, 2023Updated 2 years ago
- Oak National Academy's AI Auto Eval tools provide LLM as a judge evaluation on lesson plans and resources☆17Nov 4, 2025Updated 5 months ago
- 天池Better Synth多模态大模型数据合成挑战赛-打赢baseline 就算成功方案☆28Oct 30, 2025Updated 5 months ago