SmallDoges / flash-dmattnLinks
Flash Dynamic Mask Attention
☆275Updated this week
Alternatives and similar repositories for flash-dmattn
Users that are interested in flash-dmattn are comparing it to the libraries listed below
Sorting:
- ☆180Updated 4 months ago
- 🎉 The code repository for "Parrot: Multilingual Visual Instruction Tuning" in PyTorch.☆81Updated 2 months ago
- [ACL 2025] The code repository for "Mitigating Visual Forgetting via Take-along Visual Conditioning for Multi-modal Long CoT Reasoning" i…☆151Updated 3 months ago
- Reasoning-Table: Exploring Reinforcement Learning for Table Reasoning☆130Updated 2 months ago
- Official code for "Iterative Self-Incentivization Empowers Large Language Models as Agentic Searchers"☆212Updated last month
- [ICLR 2025] Train Small, Infer Large: Memory-Efficient LoRA Training for Large Language Models☆74Updated 5 months ago
- Official code for "Direct Retrieval-augmented Optimization: Synergizing Knowledge Selection and Language Models"☆294Updated 3 months ago
- 大语言模型(LLM)理论+代码教程,由浅入深,带你系统学习LLM的理论知识并从代码层面理解其如何实现。☆63Updated 4 months ago
- Official code for ACL2025 "🔍 Retrieval Models Aren’t Tool-Savvy: Benchmarking Tool Retrieval for Large Language Models"☆200Updated 2 months ago
- AI Agent configuration platform based on LangChain and LangGraph technology that enables limited programmability☆123Updated this week
- [ACMMM 2025] "Set You Straight: Auto-Steering Denoising Trajectories to Sidestep Unwanted Concepts" (Official Implementation)☆92Updated last month
- ☆115Updated 3 months ago
- CHATS: Combining Human-Aligned Optimization and Test-Time Sampling for Text-to-Image Generation (ICML2025)☆130Updated last week
- Official repository of the paper "Generalist Virtual Agents: A Survey on Autonomous Agents Across Digital Platforms"☆81Updated last month
- [ICLR 2025] MoE++: Accelerating Mixture-of-Experts Methods with Zero-Computation Experts☆240Updated 10 months ago
- A curated list of papers on reinforcement learning for video generation☆120Updated last month
- Official code for paper "Learning to Use Tools via Cooperative and Interactive Agents"☆224Updated last year
- Galvatron is an automatic distributed training system designed for Transformer models, including Large Language Models (LLMs).☆162Updated last month
- Financial News AI Analysis Notification Service☆171Updated 2 months ago
- 🌳 An educational modern C++ deep learning framework supporting CUDA☆58Updated 2 months ago
- 智云星课是一个基于Spring Boot和AI技术的智能学习平台后端系统,提供了丰富的教学管理功能和AI互动特性,为用户打造全方位、智能化的学习体验。项目集成了Dify AI接口,支持AI聊天、课 程学习、每日学习资料等多种智能学习功能。☆117Updated last month
- 🔧 A comprehensive stereo matching toolbox for efficient development and research.☆171Updated this week
- 🏆 ICML 2025 Spotlight☆302Updated last month
- ☆129Updated last month
- ☆88Updated 3 weeks ago
- Smart form with AI☆147Updated 2 months ago
- Pokemon-specific intelligent chat assistant☆192Updated last week
- In-depth analysis of the engineering application examples and performance optimization of data filters in SQL and Redis☆216Updated 2 weeks ago
- UISCI(Urban Intersection Safety-Critical Interaction)Dataset☆137Updated 2 months ago
- 智能助手系统(电商和医疗等多个领域) - 基于 FastAPI + Vue 3 的 AI 对话和搜索平台☆86Updated last week