青稞Talk
☆201Jan 21, 2026Updated 2 months ago
Alternatives and similar repositories for qingketalk
Users that are interested in qingketalk are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆25Apr 17, 2024Updated last year
- ☆62Apr 3, 2026Updated last week
- ☆14Nov 3, 2025Updated 5 months ago
- A lightweight reinforcement learning framework that integrates seamlessly into your codebase, empowering developers to focus on algorithm…☆103Aug 25, 2025Updated 7 months ago
- This code implements the algorithm of FIPO, a value-free RL recipe for eliciting deeper reasoning from a clean base model.☆89Apr 7, 2026Updated last week
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- DeepXTrace is a lightweight tool for precisely diagnosing slow ranks in DeepEP-based environments.☆95Jan 16, 2026Updated 2 months ago
- Benchmarking Attention Mechanism in Vision Transformers.☆20Oct 10, 2022Updated 3 years ago
- 代码大模型 预训练&微调&DPO 数据处理 业界处理pipeline sota☆53Jul 25, 2024Updated last year
- Prototyp MegaScale-Infer: Serving Mixture-of-Experts at Scale with Disaggregated Expert Parallelism☆28Apr 4, 2025Updated last year
- Vortex: A Flexible and Efficient Sparse Attention Framework☆51Updated this week
- A set of examples based on verl for end-to-end RL training recipes.☆243Updated this week
- [EMNLP 2025 Industry] Datasets and Recipes for Video Temporal Grounding via Reinforcement Learning☆36Oct 22, 2025Updated 5 months ago
- 分层解耦的深度学习推理引擎☆79Feb 17, 2025Updated last year
- ☆28Feb 2, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A controlled benchmark on evaluating and studying the dynamics of Long Context Language Models☆25Oct 17, 2025Updated 5 months ago
- ☆61Feb 5, 2026Updated 2 months ago
- ☆28Jul 11, 2021Updated 4 years ago
- Ray Framework (https://github.com/ray-project/ray) on Kubernetes☆13Oct 12, 2018Updated 7 years ago
- A Sober Look at Language Model Reasoning☆94Nov 18, 2025Updated 4 months ago
- ☆29Mar 13, 2026Updated last month
- DeeperGEMM: crazy optimized version☆86May 5, 2025Updated 11 months ago
- Fully open reproduction of DeepSeek-R1☆11Mar 24, 2025Updated last year
- Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling☆478May 17, 2025Updated 10 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆14Feb 2, 2021Updated 5 years ago
- ☆13Mar 5, 2025Updated last year
- The official implementation for the intra-stage fusion technique introduced in https://arxiv.org/abs/2409.13221☆31Apr 22, 2025Updated 11 months ago
- Implementation for FP8/INT8 Rollout for RL training without performence drop.☆298Nov 7, 2025Updated 5 months ago
- Search Self-Play: Pushing the Frontier of Agent Capability without Supervision☆98Mar 4, 2026Updated last month
- Optimizing Anytime Reasoning via Budget Relative Policy Optimization☆54Jul 15, 2025Updated 9 months ago
- ATLAHS: An Application-centric Network Simulator Toolchain for AI, HPC, and Distributed Storage☆76Mar 29, 2026Updated 2 weeks ago
- ☆13Mar 24, 2024Updated 2 years ago
- An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models☆3,063Updated this week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Dataset Quantization with Active Learning based Adaptive Sampling [ECCV 2024]☆10Jul 9, 2024Updated last year
- A flexible and efficient training framework for large-scale alignment tasks☆452Oct 23, 2025Updated 5 months ago
- Ring attention implementation with flash attention☆1,006Sep 10, 2025Updated 7 months ago
- FastThresholdClustering is an efficient vector clustering algorithm based on FAISS, particularly suitable for large-scale vector data clu…☆30Dec 17, 2024Updated last year
- [NAACL 2024] Making Language Models Better Tool Learners with Execution Feedback☆43Mar 14, 2024Updated 2 years ago
- Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.☆5,011Updated this week
- Efficient Mixture of Experts for LLM Paper List☆175Sep 28, 2025Updated 6 months ago