☆53Mar 4, 2026Updated this week
Alternatives and similar repositories for llm_trainer
Users that are interested in llm_trainer are comparing it to the libraries listed below
Sorting:
- Implement llm model in pytorch, support MoE and RoPE☆41Jan 29, 2026Updated last month
- Multiagent optimization system (MAOS) for solving the Traveling Salesman Problem (TSP).☆12Aug 7, 2019Updated 6 years ago
- Repository for score-based transport modeling.☆11Jul 22, 2023Updated 2 years ago
- The implementation of Text Classification with Negative Supervision (ACL, 2020)☆10Oct 8, 2020Updated 5 years ago
- kitti-devkit for generating the error maps, KITTI-color-space disparity maps, and pfm2uint16png and uint16png2pfm converting☆12Feb 20, 2021Updated 5 years ago
- ☆12Sep 25, 2021Updated 4 years ago
- ☆25Jun 26, 2025Updated 8 months ago
- Methods and experiments for assumed density SDE approximations☆12Jan 26, 2022Updated 4 years ago
- [ICLR 2022] Denoising Likelihood Score Matching for Conditional Score-based Data Generation☆11Jan 2, 2025Updated last year
- STRODE: Stochastic Boundary Ordinary Differential Equation☆13Jul 20, 2021Updated 4 years ago
- ☆15Jun 22, 2025Updated 8 months ago
- 👂 Typing is slow, talk to me. The project name means ' i am tired ' in Chinese (我累了). This is a AI efficiency assistant, complete your d…☆16Jun 8, 2024Updated last year
- Overlapping Reads COmpression with Minimizers☆16May 19, 2022Updated 3 years ago
- Code for "Translatotron-V(ison): An End-to-End Model for In-Image Machine Translation" (Findings of ACL 2024)☆16Jul 4, 2024Updated last year
- ☆13Oct 24, 2021Updated 4 years ago
- Advanced implementation of DeepSeek-R1 featuring Group Relative Policy Optimization (GRPO) for mathematical reasoning AI. Integrates safe…☆13Jan 29, 2025Updated last year
- ☆17Jan 31, 2025Updated last year
- ☆16May 12, 2023Updated 2 years ago
- IPLoM (Iterative Partitioning Log Mining) - Java☆15Mar 13, 2016Updated 9 years ago
- [T-PAMI 2025] Scale Propagation Network for Generalizable Depth Completion☆25Apr 1, 2025Updated 11 months ago
- ☆14Mar 3, 2025Updated last year
- Source code for UQnet☆16May 23, 2024Updated last year
- The official repository for AdaMuon☆35Aug 27, 2025Updated 6 months ago
- Code used for the AAAI 2020 paper "System Identification with Time-Aware Neural Sequence Models"☆16Nov 22, 2019Updated 6 years ago
- A lightweight, production-friendly orchestration layer on top of PyTorch. JackFramework standardizes model/data wiring, distributed execu…☆17Oct 20, 2025Updated 4 months ago
- ViLReF: A Expert Knowledge Enabled Vision-Language Retinal Foundation Model☆22Oct 16, 2024Updated last year
- python3 library to handle files from neuromorphic cameras☆17Jan 26, 2025Updated last year
- ☆19Aug 9, 2024Updated last year
- ☆20Jan 19, 2022Updated 4 years ago
- This is the code for the work accepted by ICRA2022.☆19Jun 20, 2022Updated 3 years ago
- Code For Beyond Finite Layer Neural Network:Bridging Deep Architects and Numerical Differential Equations☆15Jun 4, 2019Updated 6 years ago
- AI Agent 教学仓库 | 系统化 LangChain、RAG、LangGraph、MCP 全栈实战代码 | 万字博客详解 | 开源可运行示例 | 从零构建智能体☆94Feb 7, 2026Updated last month
- ☆22Jul 24, 2023Updated 2 years ago
- PyTorch implementation of the NCDSSM models presented in the ICML '23 paper "Neural Continuous-Discrete State Space Models for Irregularl…☆25Jul 9, 2023Updated 2 years ago
- Human Mesh Recovery / Human Pose Estimation☆24Aug 29, 2022Updated 3 years ago
- ☆30Feb 12, 2026Updated 3 weeks ago
- Collection of resources that combine dynamic systems, control with deep learning.☆28May 18, 2021Updated 4 years ago
- Layer-wise Pruning of Transformer Heads for Efficient Language Modeling☆22Feb 22, 2022Updated 4 years ago
- Long CoT Fine-Tuning and Reinforcement Learning for LLMs in the Context of the 24-Point Game: A Toy Project☆25Feb 22, 2025Updated last year