☆418Dec 26, 2024Updated last year
Alternatives and similar repositories for spring2024-lectures
Users that are interested in spring2024-lectures are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆72Jul 13, 2024Updated last year
- ☆101Sep 24, 2024Updated last year
- ☆22Apr 22, 2024Updated last year
- ☆2,857Apr 8, 2026Updated last week
- Vision Large Language Models trained on M3IT instruction tuning dataset☆17Aug 16, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Image Tokenizer Needs Post-Training☆24Oct 4, 2025Updated 6 months ago
- Open-source framework for the research and development of foundation models.☆846Updated this week
- Predicting Out-of-Distribution Error with the Projection Norm☆19Jul 27, 2022Updated 3 years ago
- Numerical Linear Algebra Notes CME 302 Stanford☆28Nov 20, 2025Updated 4 months ago
- Minimalistic large language model 3D-parallelism training☆2,644Apr 7, 2026Updated last week
- [ICML 2022] This is the pytorch implementation of "Rethinking Attention-Model Explainability through Faithfulness Violation Test" (https:…☆20Jul 21, 2022Updated 3 years ago
- A scalable asynchronous reinforcement learning implementation with in-flight weight updates.☆394Apr 9, 2026Updated last week
- Forked robosuite for LASER project☆12Jan 8, 2021Updated 5 years ago
- ☆308Jul 15, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Artificial Intelligence Professional Program by Stanford School of Engineering☆19May 9, 2023Updated 2 years ago
- Code for the paper "Interpreting and Improving Diffusion Models from an Optimization Perspective", appearing in ICML 2024☆14Sep 30, 2024Updated last year
- ☆13Mar 22, 2026Updated 3 weeks ago
- ☆12Jul 6, 2023Updated 2 years ago
- Physics of Language Models: Part 4.2, Canon Layers at Scale where Synthetic Pretraining Resonates in Reality☆336Jan 5, 2026Updated 3 months ago
- Mixture of Expert (MoE) techniques for enhancing LLM performance through expert-driven prompt mapping and adapter combinations.☆12Feb 11, 2024Updated 2 years ago
- The Structure and Interpretation of Deep Networks Handbook☆14Dec 14, 2024Updated last year
- Applies ROME and MEMIT on Mamba-S4 models☆14Apr 5, 2024Updated 2 years ago
- Generic MCP Client to use any MCP tool in a chat☆44May 11, 2025Updated 11 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆135Mar 30, 2026Updated 2 weeks ago
- ☆28Sep 22, 2025Updated 6 months ago
- My learning notes for ML SYS.☆5,970Apr 8, 2026Updated last week
- Fast and memory-efficient exact attention☆23,344Updated this week
- Code for "SCL-RAI: Span-based Contrastive Learning with Retrieval Augmented Inference for Unlabeled Entity Problem in NER" @COLING-2022☆11Aug 20, 2022Updated 3 years ago
- GPU programming related news and material links☆2,093Mar 8, 2026Updated last month
- Official code for the paper: "Metadata Archaeology"☆19May 10, 2023Updated 2 years ago
- [AAAI 24] GradTree: Gradient-Based Axis-Aligned Decision Trees☆15Aug 28, 2024Updated last year
- PyTorch native post-training library☆5,728Updated this week
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Agent Skill Induction: "Inducing Programmatic Skills for Agentic Tasks"☆40Apr 24, 2025Updated 11 months ago
- Group Meeting Record for Baobao Chang Group in Peking University☆26May 17, 2021Updated 4 years ago
- ☆16Nov 1, 2023Updated 2 years ago
- 🚀 Efficient implementations for emerging model architectures☆4,878Updated this week
- verl: Volcano Engine Reinforcement Learning for LLMs☆20,603Updated this week
- SGLang is a high-performance serving framework for large language models and multimodal models.☆25,643Updated this week
- ☆21Mar 1, 2023Updated 3 years ago