Distributed Reinforcement Learning for LLM Fine-Tuning with multi-GPU utilization
☆22Mar 12, 2025Updated last year
Alternatives and similar repositories for DistRL-LLM
Users that are interested in DistRL-LLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Community Eventing and Scripting examples☆19Aug 11, 2025Updated 10 months ago
- Code for "Guiding Large Language Models to Post-Edit Machine Translation with Error Annotations" [NAACL Findings 2024]☆14Apr 3, 2026Updated 2 months ago
- ☆19Mar 16, 2025Updated last year
- API and CLI tool to fetch and query Chome DevTools heap snapshots (Python & Playwright)☆16May 16, 2024Updated 2 years ago
- The DPAB-α Benchmark☆32Jan 15, 2025Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- An HTTP proxy that naively injects NTLM data for the current user into outgoing requests☆14Nov 14, 2018Updated 7 years ago
- Project code for training LLMs to write better unit tests + code☆22May 19, 2025Updated last year
- 在index-tts-vllm的基础上,实现了并提供了模拟流式合成音频的接口服务及客户端测试脚本☆25Sep 2, 2025Updated 9 months ago
- Face++ 是一款基于 Android 平台开发的创新性 AI 面相分析应用。它巧妙地将中国传统面相学理论(如“三庭五眼”和“十二宫”)与现代人工智能技术相结合,为用户提供一份专业、详尽且富有洞察力的面相分析报告☆22Jul 14, 2025Updated 11 months ago
- Link Python logic with Svelte interfaces for simple demos☆14Jan 9, 2025Updated last year
- ☆18May 6, 2023Updated 3 years ago
- TRITONCACHE implementation of a Redis cache☆17Jun 9, 2026Updated 3 weeks ago
- Just a simple Android app that uses Rokid's CXR-M SDK to upload/sideload an APK onto your Rokid glasses over Wi-Fi. It might be hard to g…☆56Apr 9, 2026Updated 2 months ago
- Happy Hacking With Claude!!!☆25Oct 27, 2025Updated 8 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Arxiv + Notion Sync☆20May 12, 2025Updated last year
- A jupyter client for your terminal☆27Jan 3, 2026Updated 5 months ago
- One Line To Build Zero-Data Classifiers in Minutes☆65Sep 25, 2024Updated last year
- A conversational UI for chatbots using the llama.cpp server☆15May 26, 2025Updated last year
- DSPy: The framework for programming with foundation models☆13Aug 24, 2023Updated 2 years ago
- A framework for creating message-driven training systems with PyTorch☆21Oct 7, 2025Updated 8 months ago
- ☆15Jan 15, 2024Updated 2 years ago
- ppt转数字人后台☆20Apr 9, 2025Updated last year
- 一个开源的多模态 AI 搜索项目,结合 大语言模型(LLM)+ 多源搜索引擎 + 多 Agent 架构,打造新一代的智能问答式搜索体验☆17Mar 26, 2025Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- [ACL2026 Findings] "Towards Hierarchical Multi-Step Reward Models for Enhanced Reasoning in Large Language Models"☆20Mar 25, 2025Updated last year
- Triton backend for managing the model state tensors automatically in sequence batcher☆17Feb 12, 2024Updated 2 years ago
- 自动生成短视频,文章自动成片,多模态混剪,数字人,声音克隆☆13Jun 25, 2024Updated 2 years ago
- ☆18Apr 15, 2024Updated 2 years ago
- Example agents for the Dreadnode platform☆33Dec 19, 2025Updated 6 months ago
- Easily sort and organise your image collection.☆10Jan 5, 2024Updated 2 years ago
- AI research lab🔬: implementations of AI papers and theoretical research: InstructGPT, llama, transformers, diffusion models, RLHF, etc..…☆18Jun 9, 2026Updated 3 weeks ago
- Yet another frontend for LLM, written using .NET and WinUI 3☆11Sep 14, 2025Updated 9 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Minimal workflows☆22Mar 19, 2024Updated 2 years ago
- ☆15Mar 12, 2022Updated 4 years ago
- A Field-Theoretic Approach to Unbounded Memory in Large Language Models☆20Apr 15, 2025Updated last year
- Kerberos ticket delegation and impersonation for Batch/CI/CD environments☆20May 27, 2026Updated last month
- Small, simple agent task environments for training and evaluation☆20Nov 1, 2024Updated last year
- 📚 OpenAI API 完整功能演示项目,包含: • ChatGPT/GPT-4 对话 • DALL-E 图像生成 • Whisper 语音转换 • 文本嵌入搜索 • RAG 知识库系统 • Assistants API 应用 • 提示词工程最佳实践 🔥 特点: •…☆22Nov 10, 2025Updated 7 months ago
- AI Robustness Evaluation System☆51Updated this week