flash-algo / flash-sparse-attentionLinks
Trainable fast and memory-efficient sparse attention
☆526Updated last week
Alternatives and similar repositories for flash-sparse-attention
Users that are interested in flash-sparse-attention are comparing it to the libraries listed below
Sorting:
- Updating curated list of research advancements on item identification in generative recommender systems.☆50Updated 2 weeks ago
- ☆176Updated 9 months ago
- ☆76Updated 3 weeks ago
- 🎉 The code repository for "Parrot: Multilingual Visual Instruction Tuning" in PyTorch.☆77Updated 8 months ago
- Official repo for 'Large Multimodal Models Evaluation: A Survey'☆100Updated 2 months ago
- Reasoning-Table: Exploring Reinforcement Learning for Table Reasoning☆129Updated 8 months ago
- CX-Mind: A Pioneering Multimodal Large Language Model for Interleaved Reasoning in Chest X-ray via Curriculum-Guided Reinforcement Lear…☆127Updated 2 months ago
- Official implementation of our NeurIPS 2025 paper: "FlyLoRA: Boosting Task Decoupling and Parameter Efficiency via Implicit Rank-Wise Mix…☆177Updated 2 months ago
- ☆76Updated 3 months ago
- [ACL 2025] The code repository for "Mitigating Visual Forgetting via Take-along Visual Conditioning for Multi-modal Long CoT Reasoning" i…☆142Updated 8 months ago
- [ICLR 2025] Train Small, Infer Large: Memory-Efficient LoRA Training for Large Language Models☆71Updated 10 months ago
- Advanced Multi-Agent Optimization System featuring intelligent routing strategies, semantic memory optimization, distributed coordination…☆15Updated 5 months ago
- A High-Performance LLM Inference Engine with vLLM-Style Continuous Batching☆89Updated last month
- Official code for TOIS2026 "Direct Retrieval-augmented Optimization: Synergizing Knowledge Selection and Language Models"☆275Updated 3 weeks ago
- A Flask-based framework for small and medium-sized book sales enterprises or online bookstore book management system.☆293Updated 4 months ago
- A minimalist multi-agent framework for rubost automation of scientific analysis workflows, such as gene expression analysis.☆131Updated 4 months ago
- [ICLR 2025] MoE++: Accelerating Mixture-of-Experts Methods with Zero-Computation Experts☆265Updated last year
- Official repo of paper "SRUM: Fine-Grained Self-Rewarding for Unified Multimodal Models". A post-training framework that creates a cost-e…☆92Updated 2 months ago
- ☆43Updated 3 months ago
- [ICML'25] CSTrack: Enhancing RGB-X Tracking via Compact Spatiotemporal Features☆120Updated 5 months ago
- Official code for NeurIPS2025 "Iterative Self-Incentivization Empowers Large Language Models as Agentic Searchers"☆227Updated 3 weeks ago
- BizSpring Java开发定制,线上商城,购物商城,商城网站,在线购物,免费建站,mall☆94Updated 2 months ago
- [ICLR'26] Scaling Up, Speeding Up: A Benchmark of Speculative Decoding for Efficient LLM Test-Time Scaling☆37Updated 2 weeks ago
- Resource collection of medical agent for clinical dialogue and health☆248Updated last week
- Official code for ACL2025 "🔍 Retrieval Models Aren’t Tool-Savvy: Benchmarking Tool Retrieval for Large Language Models"☆210Updated last month
- This is the code related to "🔥Effective Training Data Synthesis for Improving MLLM Chart Understanding" (ICCV 2025).☆80Updated 5 months ago
- TORM是一个基于Go语言开发的高性能ORM(对象关系映射)框架,灵感来源于PHP ThinkORM。它提供了简洁易用的API、强大的查询构造器、完整的模型系统以及丰富的功能。☆69Updated last month
- BizSpring Java商城平台首选,分销商城,b2b2c商城,saas商城,java电商系统, 可商用;Vue3,Element UI Plus,Uniapp,微服务,SpringCloud,跨境电商,跨境商城,电商国际化,外贸,独立站,多国语言,移动商城,小程序商城…☆88Updated 2 months ago
- 以帮助你快速找到 LLM 相关工作,尽快抓住 AI 红利为目标的【LLM 教程】☆114Updated last week
- BizSpring Java 跨境电商独立站,外贸独立站,跨境独立站,独立站搭建,多国语言☆110Updated 2 months ago