Pipeline-Parallel Lecture: Simplest Dualpipe Implementation.
☆31Sep 17, 2025Updated 6 months ago
Alternatives and similar repositories for easy-dualpipe
Users that are interested in easy-dualpipe are comparing it to the libraries listed below
Sorting:
- ☆19Sep 10, 2025Updated 6 months ago
- ☆13Oct 23, 2023Updated 2 years ago
- ☆41Dec 31, 2021Updated 4 years ago
- Official PyTorch implementation of the TMI paper "Nucleus-aware Self-supervised Pretraining Using Unpaired Image-to-image Translation for…☆16Mar 13, 2024Updated 2 years ago
- ☆47Dec 13, 2024Updated last year
- This repository provides the code for applying Contrastive Learning Penalty Loss (CLPL) and Mixture of Experts (MoE) to the BGE-M3 text e…☆11Dec 27, 2024Updated last year
- Beyond KV Caching: Shared Attention for Efficient LLMs☆20Jul 19, 2024Updated last year
- [NeurIPS 2025] Scaling Language-centric Omnimodal Representation Learning☆36Feb 6, 2026Updated last month
- Lumina is a user-friendly tool to test the correctness and performance of hardware network stacks.☆29Jan 8, 2024Updated 2 years ago
- Accelerating GOT-OCRv2 with VLLM☆11Nov 15, 2024Updated last year
- rdma编程学习☆25Dec 6, 2021Updated 4 years ago
- Learning to Skip the Middle Layers of Transformers☆17Aug 7, 2025Updated 7 months ago
- Official repo for "TiMo: Spatiotemporal Foundation Model for Satellite Image Time Series"☆28May 14, 2025Updated 10 months ago
- 条件随机场(CRF)的pytorch实现☆10Mar 7, 2021Updated 5 years ago
- ☆17Jul 17, 2025Updated 8 months ago
- A simple wrapper around "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching" that provides an OpenAI-compatibl…☆14Feb 7, 2025Updated last year
- This is a deep-learning based model for Electronic Design Automation(EDA), predicting the Design Rule Check (DRC) violation location.☆13Jun 24, 2023Updated 2 years ago
- Wrapper for Normalized Gradient Descent in Keras☆17Jun 9, 2018Updated 7 years ago
- The codebase and some introductions of FineMed.☆31Sep 11, 2025Updated 6 months ago
- fastapi异步IO+threadpool线程池的工作原理☆18Feb 12, 2024Updated 2 years ago
- ☆16Sep 12, 2023Updated 2 years ago
- 山东省第二届数据应用创新创业大赛-主赛场-检验报告单识 别-Baseline☆13Jan 15, 2021Updated 5 years ago
- Having Fun in Deep Learning☆15Mar 5, 2022Updated 4 years ago
- Official code for "From Seeing to Doing: Bridging Reasoning and Decision for Robotic Manipulation" (ICLR2026)☆31Mar 1, 2026Updated 3 weeks ago
- Library for lagged conversion rate estimation. Based on the paper "Modeling Delayed Feedback in Display Advertising", Chapelle, 2014.☆14Mar 21, 2019Updated 7 years ago
- Quartet II Official Code☆61Updated this week
- ☆34Jul 16, 2025Updated 8 months ago
- [ICML 2025] Retraining-Free Merging of Sparse MoE via Hierarchical Clustering☆22Oct 26, 2025Updated 4 months ago
- ☆16Feb 7, 2024Updated 2 years ago
- 用RLHF可选LoRA对LLaMA和MOSS进行训练|Training LLaMA or MOSS with RLHF [LoRA]☆21May 16, 2023Updated 2 years ago
- Source code for our Paper "Self-Attentive Neural Collaborative Filtering"☆17Jul 6, 2018Updated 7 years ago
- ☆12Jan 21, 2026Updated 2 months ago
- Low-Rank Llama Custom Training☆23Mar 27, 2024Updated last year
- ☆20Jan 7, 2024Updated 2 years ago
- ☆17Jul 25, 2023Updated 2 years ago
- Implemented a script that automatically adjusts Qwen3's inference and non-inference capabilities, based on an OpenAI-like API. The infere…☆21May 9, 2025Updated 10 months ago
- This project enhances the LLaMA-2 model using Quantized Low-Rank Adaptation (QLoRA) and other parameter-efficient fine-tuning techniques …☆13Apr 18, 2024Updated last year
- ☆12Feb 14, 2022Updated 4 years ago
- Apple's Cut Cross Entropy☆30Jan 19, 2025Updated last year