☆62Mar 23, 2026Updated last week
Alternatives and similar repositories for llm_trainer
Users that are interested in llm_trainer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implement llm model in pytorch, support MoE and RoPE☆61Mar 18, 2026Updated last week
- 从零构建大模型:从预训练到RLHF的完整实践☆2,553Mar 19, 2026Updated last week
- Tiny-DeepSpeed, a minimalistic re-implementation of the DeepSpeed library☆50Aug 20, 2025Updated 7 months ago
- A distributed in-memory store for temporal knowledge graphs☆10Mar 20, 2024Updated 2 years ago
- A python implementation of PROCLUS: PROjected CLUStering algorithm.☆10Jan 12, 2015Updated 11 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- A Sample Code Project for ASP.NET 5 with Dapr☆13Apr 18, 2021Updated 4 years ago
- ☆24Aug 21, 2025Updated 7 months ago
- ☆24Dec 11, 2025Updated 3 months ago
- MeloTTS demo on Axera☆12Nov 18, 2025Updated 4 months ago
- Reproduced the DFT method without using Verl. https://arxiv.org/abs/2508.05629☆21Oct 14, 2025Updated 5 months ago
- java implementation of Bert Tokenizer, support output onnx tensor for onnx model inference☆13Sep 4, 2023Updated 2 years ago
- ☆12Sep 25, 2021Updated 4 years ago
- [COLM2025] "Weak-for-Strong: Training Weak Meta-Agent to Harness Strong Executors"☆55Oct 6, 2025Updated 5 months ago
- 实现《Multiway Attention Networks for Modeling Sentence Pairs》中的网络模型,可用于问答,句子逻辑推理☆11Apr 13, 2020Updated 5 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆15Jun 22, 2025Updated 9 months ago
- ☆25Mar 8, 2026Updated 3 weeks ago
- The implementation of Text Classification with Negative Supervision (ACL, 2020)☆10Oct 8, 2020Updated 5 years ago
- ☆10Jan 12, 2024Updated 2 years ago
- 使用Sentencepiece对中文语料进行分词☆13Nov 30, 2023Updated 2 years ago
- 在您的机器上本地离线运行 AI 模型☆11May 8, 2025Updated 10 months ago
- Experimental syslog template mining module☆11Aug 29, 2016Updated 9 years ago
- Repository for score-based transport modeling.☆11Jul 22, 2023Updated 2 years ago
- A unified tool to generate fine-tuning datasets for LLMs, including questions, answers, and dialogues. ✨🤖📚💬☆63Mar 14, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Taylor moment expansion in Python (JaX and SymPy) and Matlab☆11Nov 26, 2024Updated last year
- Code for "Translatotron-V(ison): An End-to-End Model for In-Image Machine Translation" (Findings of ACL 2024)☆16Jul 4, 2024Updated last year
- [ICLR 2022] Denoising Likelihood Score Matching for Conditional Score-based Data Generation☆11Jan 2, 2025Updated last year
- STRODE: Stochastic Boundary Ordinary Differential Equation☆13Jul 20, 2021Updated 4 years ago
- NetChain in P4☆18Jul 4, 2020Updated 5 years ago
- Go Micro Service by (Beego+Go-Micro-v2.*)☆16Apr 12, 2022Updated 3 years ago
- Advanced implementation of DeepSeek-R1 featuring Group Relative Policy Optimization (GRPO) for mathematical reasoning AI. Integrates safe…☆13Jan 29, 2025Updated last year
- a distributed fifo queue based on zookeeper.☆17Dec 28, 2015Updated 10 years ago
- ☆20Apr 17, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- The official repository for AdaMuon☆35Aug 27, 2025Updated 7 months ago
- A training and inference framework for LLM☆44Mar 20, 2026Updated last week
- Code used for the AAAI 2020 paper "System Identification with Time-Aware Neural Sequence Models"☆16Nov 22, 2019Updated 6 years ago
- ☆16May 12, 2023Updated 2 years ago
- IPLoM (Iterative Partitioning Log Mining) - Java☆15Mar 13, 2016Updated 10 years ago
- Proactive-adaptive arbitration between shipping compute and shipping data☆18Jul 8, 2021Updated 4 years ago
- [ICLR2025 Spotlight] Advantage-Guided Distillation for Preference Alignment in Small Language Models☆25Feb 10, 2025Updated last year