☆413Dec 26, 2024Updated last year
Alternatives and similar repositories for spring2024-lectures
Users that are interested in spring2024-lectures are comparing it to the libraries listed below
Sorting:
- ☆73Jul 13, 2024Updated last year
- ☆22Apr 22, 2024Updated last year
- Image Tokenizer Needs Post-Training☆24Oct 4, 2025Updated 5 months ago
- ☆2,700Jan 9, 2026Updated 2 months ago
- Vision Large Language Models trained on M3IT instruction tuning dataset☆17Aug 16, 2023Updated 2 years ago
- Open-source framework for the research and development of foundation models.☆781Updated this week
- Personal blog + reading notes on system-ish papers☆15Oct 29, 2023Updated 2 years ago
- ☆292Jul 15, 2024Updated last year
- Minimalistic large language model 3D-parallelism training☆2,588Feb 19, 2026Updated 2 weeks ago
- A scalable asynchronous reinforcement learning implementation with in-flight weight updates.☆373Feb 26, 2026Updated last week
- ☆15Feb 25, 2026Updated last week
- [CVPR'25 - Rating 555] Official PyTorch implementation of Lumos: Learning Visual Generative Priors without Text☆53Mar 16, 2025Updated 11 months ago
- Typed python equivalent for R pipes.☆13Oct 16, 2022Updated 3 years ago
- Code for "SCL-RAI: Span-based Contrastive Learning with Retrieval Augmented Inference for Unlabeled Entity Problem in NER" @COLING-2022☆11Aug 20, 2022Updated 3 years ago
- ☆13Sep 2, 2023Updated 2 years ago
- ☆12Aug 26, 2025Updated 6 months ago
- Forked robosuite for LASER project☆12Jan 8, 2021Updated 5 years ago
- Mixture of Expert (MoE) techniques for enhancing LLM performance through expert-driven prompt mapping and adapter combinations.☆12Feb 11, 2024Updated 2 years ago
- ☆13May 9, 2024Updated last year
- Code for the paper "Interpreting and Improving Diffusion Models from an Optimization Perspective", appearing in ICML 2024☆14Sep 30, 2024Updated last year
- ☆12Jul 6, 2023Updated 2 years ago
- This is a companion repository for the On Prem RAG AIM Event☆11Nov 30, 2024Updated last year
- Physics of Language Models: Part 4.2, Canon Layers at Scale where Synthetic Pretraining Resonates in Reality☆327Jan 5, 2026Updated 2 months ago
- Group Meeting Record for Baobao Chang Group in Peking University☆26May 17, 2021Updated 4 years ago
- Fast and memory-efficient exact attention☆22,460Updated this week
- 🚀 Efficient implementations of state-of-the-art linear attention models☆4,474Mar 3, 2026Updated last week
- My learning notes for ML SYS.☆5,580Mar 2, 2026Updated last week
- Code for TKDE paper: Patient Health Representation Learning via Correlational Sparse Prior of Medical Features.☆11Jan 5, 2023Updated 3 years ago
- MISO: Learning Multiple Initial Solutions to Optimization Problems☆16Nov 8, 2024Updated last year
- Official codebase for Adaptive Online Planning for Continual Lifelong Learning.☆17Mar 26, 2020Updated 5 years ago
- ☆17Apr 9, 2025Updated 11 months ago
- ☆12Nov 15, 2024Updated last year
- GPU programming related news and material links☆2,010Sep 17, 2025Updated 5 months ago
- ☆35Jun 21, 2023Updated 2 years ago
- What would you do with 1000 H100s...☆1,155Jan 10, 2024Updated 2 years ago
- ☆118Jul 21, 2025Updated 7 months ago
- ☆18Jul 10, 2024Updated last year
- Extending context length of visual language models☆12Dec 18, 2024Updated last year
- verl: Volcano Engine Reinforcement Learning for LLMs☆19,739Updated this week