Large language models to diffusion finetuning code
☆24Jun 2, 2025Updated 9 months ago
Alternatives and similar repositories for L2D
Users that are interested in L2D are comparing it to the libraries listed below
Sorting:
- ☆44Updated this week
- Supplemental materials for The ASPLOS 2025 / EuroSys 2025 Contest on Intra-Operator Parallelism for Distributed Deep Learning☆25May 12, 2025Updated 9 months ago
- [ICLR 2026] Official code of "Segment any Events with Language"☆35Feb 7, 2026Updated 3 weeks ago
- ☆28Dec 3, 2025Updated 3 months ago
- 详细双语注释版word2vec源码,well-annotated word2vec☆10Oct 3, 2021Updated 4 years ago
- A Python-based voice assistant integrating speech-to-text (STT), text-to-speech (TTS), and powerful AI capabilities using either a local …☆13Dec 8, 2025Updated 2 months ago
- ☆12Jan 15, 2015Updated 11 years ago
- Anchored Diffusion Language Model (NeurIPS 2025)☆27Oct 13, 2025Updated 4 months ago
- [MQM-APE] Toward High-Quality Error Annotation Predictors with Automatic Post-Editing in LLM Translation Evaluators.☆11Sep 24, 2024Updated last year
- Open source multimodal OpenLKA dataset☆17Feb 13, 2026Updated 2 weeks ago
- Quality scores to evaluate network partitions☆12Apr 28, 2024Updated last year
- Collection of simple General Matrix Multiplication - GEMM implementations☆13Feb 26, 2024Updated 2 years ago
- Efficient Hyper-parameter Tuning at Scale (VLDB'22)☆10Dec 1, 2021Updated 4 years ago
- anydice roller☆12May 26, 2018Updated 7 years ago
- Code Implementation for AutoAttend: Automated Attention Representation Search☆11Jul 26, 2021Updated 4 years ago
- Speeding Up Your Python Codes 1000x☆12Apr 2, 2025Updated 11 months ago
- Persistent dense gemm for Hopper in `CuTeDSL`☆15Aug 9, 2025Updated 6 months ago
- ☆23Jul 11, 2025Updated 7 months ago
- Get the width of fingers according the photo with a hand and a coin besides the hand.☆10Dec 16, 2023Updated 2 years ago
- CPU and GPU tutorial examples☆13Apr 4, 2025Updated 11 months ago
- Feature Decay Algorithms☆11Mar 5, 2014Updated 11 years ago
- My list of awesome pixi.js related parties☆11Feb 14, 2019Updated 7 years ago
- A NCCL extension library, designed to efficiently offload GPU memory allocated by the NCCL communication library.☆98Dec 17, 2025Updated 2 months ago
- On the Complementarity between Pre-Training and Back-Translation for Neural Machine Translation (Findings of EMNLP 2021))☆13Nov 21, 2021Updated 4 years ago
- Score Entropy Discrete Diffusion language model - https://arxiv.org/abs/2310.16834☆17Jul 7, 2025Updated 7 months ago
- ☆10May 16, 2024Updated last year
- Codebase accompanying the paper 'Widening the Representation Bottleneck in Neural Machine Translation with Lexical Shortcuts', (Emelin, D…☆11Feb 14, 2023Updated 3 years ago
- BigBang-Proton is a LLM pretrained on cross-scale, cross-structure, cross-discipline real-world scientific tasks to construct a scienti…☆22Nov 8, 2025Updated 3 months ago
- ☆13Jan 7, 2025Updated last year
- ☆14Feb 11, 2026Updated 2 weeks ago
- ☆12Jul 9, 2021Updated 4 years ago
- Code for "What really matters in matrix-whitening optimizers?"☆21Oct 31, 2025Updated 4 months ago
- ☆18Jun 6, 2025Updated 8 months ago
- triton ver of gqa flash attn, based on the tutorial☆12Aug 4, 2024Updated last year
- Automated bottleneck detection and solution orchestration☆19Feb 24, 2026Updated last week
- The Python solutions of leetcode☆13Apr 26, 2020Updated 5 years ago
- [ICLR 2025] RaSA: Rank-Sharing Low-Rank Adaptation☆10May 19, 2025Updated 9 months ago
- Pytorch routines for (Ker)nel (Mac)hines☆10Oct 10, 2025Updated 4 months ago
- A method for evaluating the high-level coherence of machine-generated texts. Identifies high-level coherence issues in transformer-based …☆11Mar 18, 2023Updated 2 years ago