☆424Dec 26, 2024Updated last year
Alternatives and similar repositories for spring2024-lectures
Users that are interested in spring2024-lectures are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆71Jul 13, 2024Updated last year
- ☆101Sep 24, 2024Updated last year
- ☆22Apr 22, 2024Updated 2 years ago
- ☆2,922Apr 29, 2026Updated last week
- Vision Large Language Models trained on M3IT instruction tuning dataset☆17Aug 16, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Personal blog + reading notes on system-ish papers☆16Oct 29, 2023Updated 2 years ago
- Image Tokenizer Needs Post-Training☆24Oct 4, 2025Updated 7 months ago
- Open-source framework for the research and development of foundation models.☆923Updated this week
- AI安全开放社区官方文档☆26Apr 11, 2026Updated 3 weeks ago
- A scalable asynchronous reinforcement learning implementation with in-flight weight updates.☆404Apr 28, 2026Updated last week
- Forked robosuite for LASER project☆12Jan 8, 2021Updated 5 years ago
- ☆308Jul 15, 2024Updated last year
- Artificial Intelligence Professional Program by Stanford School of Engineering☆19May 9, 2023Updated 2 years ago
- Code for the paper "Interpreting and Improving Diffusion Models from an Optimization Perspective", appearing in ICML 2024☆14Sep 30, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- My personal website☆15Dec 31, 2025Updated 4 months ago
- ☆12Jul 6, 2023Updated 2 years ago
- The Structure and Interpretation of Deep Networks Handbook☆14Dec 14, 2024Updated last year
- Mixture of Expert (MoE) techniques for enhancing LLM performance through expert-driven prompt mapping and adapter combinations.☆12Feb 11, 2024Updated 2 years ago
- Physics of Language Models: Part 4.2, Canon Layers at Scale where Synthetic Pretraining Resonates in Reality☆342Jan 5, 2026Updated 4 months ago
- Applies ROME and MEMIT on Mamba-S4 models☆15Apr 5, 2024Updated 2 years ago
- ☆141Mar 30, 2026Updated last month
- ☆28Sep 22, 2025Updated 7 months ago
- My learning notes for ML SYS.☆6,166Apr 23, 2026Updated 2 weeks ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Code for "SCL-RAI: Span-based Contrastive Learning with Retrieval Augmented Inference for Unlabeled Entity Problem in NER" @COLING-2022☆11Aug 20, 2022Updated 3 years ago
- Fast and memory-efficient exact attention☆23,628Updated this week
- Implementation of the paper "Meta-Learning by Adjusting Priors Based on Extended PAC-Bayes Theory", Ron Amit and Ron Meir, ICML 2018☆18Apr 13, 2021Updated 5 years ago
- Official code for the paper: "Metadata Archaeology"☆19May 10, 2023Updated 2 years ago
- General fair regression subject to demographic parity constraint. Paper appeared in ICML 2019.☆16Jul 5, 2020Updated 5 years ago
- [AAAI 24] GradTree: Gradient-Based Axis-Aligned Decision Trees☆15Aug 28, 2024Updated last year
- PyTorch native post-training library☆5,750Updated this week
- [CVPR'25 - Rating 555] Official PyTorch implementation of Lumos: Learning Visual Generative Priors without Text☆53Mar 16, 2025Updated last year
- Agent Skill Induction: "Inducing Programmatic Skills for Agentic Tasks"☆40Apr 24, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Group Meeting Record for Baobao Chang Group in Peking University☆26May 17, 2021Updated 4 years ago
- verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework☆21,046Updated this week
- ☆18Jul 10, 2024Updated last year
- 🚀 Efficient implementations for emerging model architectures☆5,032Updated this week
- SGLang is a high-performance serving framework for large language models and multimodal models.☆26,832Updated this week
- Official eval code for ROVER: Benchmarking Reciprocal Cross-Modal Reasoning for Omnimodal Generation☆27Dec 12, 2025Updated 4 months ago
- Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"