[ICLR 2025] DeFT: Decoding with Flash Tree-attention for Efficient Tree-structured LLM Inference
☆49Jun 17, 2025Updated 8 months ago
Alternatives and similar repositories for DeFT
Users that are interested in DeFT are comparing it to the libraries listed below
Sorting:
- ☆27Mar 24, 2025Updated 11 months ago
- ☆13Sep 2, 2023Updated 2 years ago
- A platform to develop CTM-motivated AI architecture.☆15Updated this week
- An innovative method expediting LLMs via streamlined semi-autoregressive generation and draft verification.☆26Apr 15, 2025Updated 10 months ago
- Aligntune : A Modular Toolkit for Post Training Alignment of LLMs☆33Updated this week
- Course website for Systems Verification Fall 2024☆14Jul 10, 2025Updated 7 months ago
- MMoE: Multimodal Mixture-of-Experts (EMNLP 2024)☆14Nov 14, 2024Updated last year
- World Modeling by Forecasting Vision Foundation Model Features☆35Jan 7, 2026Updated last month
- [ACL 2025 main] FR-Spec: Frequency-Ranked Speculative Sampling☆51Jul 15, 2025Updated 7 months ago
- Handwritten GEMM using Intel AMX (Advanced Matrix Extension)☆17Jan 11, 2025Updated last year
- Official repository of "Distort, Distract, Decode: Instruction-Tuned Model Can Refine its Response from Noisy Instructions", ICLR 2024 Sp…☆21Mar 7, 2024Updated last year
- Time-R1: Framework and resources for endowing LLMs with comprehensive temporal reasoning (understanding, prediction, creative generation)…☆64Jun 11, 2025Updated 8 months ago
- [CVPR 2025] Tra-MoE: Learning Trajectory Prediction Model from Multiple Domains for Adaptive Policy Conditioning☆56Apr 1, 2025Updated 11 months ago
- This project is my attempt at automating work in Notion.☆17Aug 28, 2025Updated 6 months ago
- Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation☆17Apr 3, 2024Updated last year
- ☆20Dec 24, 2024Updated last year
- [Neurips 2024] Video Diffusion Models are Training-free Motion Interpreter and Controller☆50Aug 5, 2025Updated 6 months ago
- ☆20Dec 2, 2024Updated last year
- Beyond KV Caching: Shared Attention for Efficient LLMs☆20Jul 19, 2024Updated last year
- 一键生成课程表ics文件, 可直接导入iOS日历☆21Mar 18, 2023Updated 2 years ago
- ☆22Sep 26, 2024Updated last year
- This repository is the official implementation for the paper “REFRAME: Reflective Surface Real-Time Rendering for Mobile Devices”.☆21Jul 27, 2025Updated 7 months ago
- 😊 TPTT: Transforming Pretrained Transformers into Titans☆59Nov 24, 2025Updated 3 months ago
- ☆224Nov 19, 2025Updated 3 months ago
- Resa: Transparent Reasoning Models via SAEs☆47Sep 23, 2025Updated 5 months ago
- Neuron Activation☆26Nov 21, 2024Updated last year
- [ACL'25 Main] Graph of Records: Boosting Retrieval Augmented Generation for Long-context Summarization with Graphs☆39May 26, 2025Updated 9 months ago
- An implementation of 'simple diffusion: End-to-end diffusion for high resolution images' as published by Hoogeboom et al.☆37Feb 9, 2025Updated last year
- Stability-AI's SV3D (ECCV 2024 oral, Voleti et al.) in the diffusers convention.☆31Feb 5, 2025Updated last year
- ☆32Dec 20, 2023Updated 2 years ago
- This repository provides an improved LLamaGen Model, fine-tuned on 500,000 high-quality images, each accompanied by over 300 token prompt…☆30Oct 21, 2024Updated last year
- Code for Aesop: Paraphrase Generation with Adaptive Syntactic Control (EMNLP 2021)☆26Jan 17, 2022Updated 4 years ago
- ☆51Aug 22, 2025Updated 6 months ago
- [ICCV 2025] Amodal Depth Anything: Amodal Depth Estimation in the Wild☆39Feb 21, 2026Updated last week
- Sotopia-RL: Reward Design for Social Intelligence☆46Jan 29, 2026Updated last month
- ☆29Mar 31, 2023Updated 2 years ago
- configurations of my computer and my brain☆63Jun 7, 2024Updated last year
- PSDR-Room: Single Photo to Scene using Differentiable Rendering (Siggraph Asia 2023)☆32Dec 2, 2023Updated 2 years ago
- [ICCV 2025] Pytorch implementation of "VLIPP: Towards Physically Plausible Video Generation with Vision and Language Informed Physical Pr…☆49Jul 28, 2025Updated 7 months ago