☆65Jun 10, 2026Updated last week
Alternatives and similar repositories for llm_trainer
Users that are interested in llm_trainer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 从零构建大模型:从预训练到RLHF的完整实践☆2,667May 20, 2026Updated 3 weeks ago
- Tiny-DeepSpeed, a minimalistic re-implementation of the DeepSpeed library☆52Aug 20, 2025Updated 9 months ago
- 基于chatGLM的语音对话实现(英文),整个流程基本使用开源工具,不涉及到openAI,完全免费,仅供个人测试,禁止商用☆16May 7, 2023Updated 3 years ago
- A distributed in-memory store for temporal knowledge graphs☆10Mar 20, 2024Updated 2 years ago
- A python implementation of PROCLUS: PROjected CLUStering algorithm.☆10Jan 12, 2015Updated 11 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- ☆28Dec 11, 2025Updated 6 months ago
- kitti-devkit for generating the error maps, KITTI-color-space disparity maps, and pfm2uint16png and uint16png2pfm converting☆12Feb 20, 2021Updated 5 years ago
- MeloTTS demo on Axera☆13Nov 18, 2025Updated 7 months ago
- Experiments with reasoning models, training techniques, papers☆30Updated this week
- java implementation of Bert Tokenizer, support output onnx tensor for onnx model inference☆13Sep 4, 2023Updated 2 years ago
- Python Implementation for KITTI Scan Unfolding☆16Jun 20, 2025Updated 11 months ago
- 实现《Multiway Attention Networks for Modeling Sentence Pairs》中的网络模型,可用于问答,句子逻辑推理☆11Apr 13, 2020Updated 6 years ago
- ☆16Apr 23, 2026Updated last month
- ☆25Mar 8, 2026Updated 3 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- LLM KV Cache compression - K+V dual compression, 73-99% VRAM savings, zero accuracy loss☆57Mar 30, 2026Updated 2 months ago
- ☆10Jan 12, 2024Updated 2 years ago
- The implementation of Text Classification with Negative Supervision (ACL, 2020)☆10Oct 8, 2020Updated 5 years ago
- ☆13Sep 25, 2021Updated 4 years ago
- A tensorflow2.0 implementation of the YOLOv1 paper https://arxiv.org/pdf/1506.02640☆44Sep 9, 2021Updated 4 years ago
- MetaSearch:llm深度研究(deepsearch)功能方案实现☆33Aug 21, 2025Updated 9 months ago
- 使用Sentencepiece对中文语料进行分词☆13Nov 30, 2023Updated 2 years ago
- 👂 Typing is slow, talk to me. The project name means ' i am tired ' in Chinese (我累了). This is a AI efficiency assistant, complete your d…☆16Jun 8, 2024Updated 2 years ago
- Experimental syslog template mining module☆11Aug 29, 2016Updated 9 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Repository for score-based transport modeling.☆11Jul 22, 2023Updated 2 years ago
- Multiagent optimization system (MAOS) for solving the Traveling Salesman Problem (TSP).☆12Aug 7, 2019Updated 6 years ago
- Methods and experiments for assumed density SDE approximations☆12Jan 26, 2022Updated 4 years ago
- Taylor moment expansion in Python (JaX and SymPy) and Matlab☆11Nov 26, 2024Updated last year
- Code for "Translatotron-V(ison): An End-to-End Model for In-Image Machine Translation" (Findings of ACL 2024)☆16Jul 4, 2024Updated last year
- Overlapping Reads COmpression with Minimizers☆16May 19, 2022Updated 4 years ago
- STRODE: Stochastic Boundary Ordinary Differential Equation☆13Jul 20, 2021Updated 4 years ago
- NetChain in P4☆18Jul 4, 2020Updated 5 years ago
- Go Micro Service by (Beego+Go-Micro-v2.*)☆16Apr 12, 2022Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Advanced implementation of DeepSeek-R1 featuring Group Relative Policy Optimization (GRPO) for mathematical reasoning AI. Integrates safe…☆13Jan 29, 2025Updated last year
- Simple BackTest Engine inspired by Backtrader framework available in python.☆18Mar 7, 2026Updated 3 months ago
- python3 library to handle files from neuromorphic cameras☆17Jan 26, 2025Updated last year
- This is the code for the work accepted by ICRA2022.☆19Jun 20, 2022Updated 4 years ago
- ☆20Apr 17, 2023Updated 3 years ago
- The official repository for AdaMuon☆39Aug 27, 2025Updated 9 months ago
- Code used for the AAAI 2020 paper "System Identification with Time-Aware Neural Sequence Models"☆16Nov 22, 2019Updated 6 years ago