☆14Dec 21, 2025Updated 3 months ago
Alternatives and similar repositories for candle-flash-attn-v3
Users that are interested in candle-flash-attn-v3 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Graph model execution API for Candle☆17Jul 27, 2025Updated 8 months ago
- GPU based FFT written in Rust and CubeCL☆32Feb 23, 2026Updated last month
- Fast, Lightweight, Unified Engine for Text2Image Diffusion Models☆20Apr 13, 2025Updated last year
- Candle Pipelines provides a simple, intuitive interface for Rust developers who want to work with Large Language Models locally, powered …☆23Jan 5, 2026Updated 3 months ago
- Golang SDK for Truss☆40Apr 8, 2026Updated last week
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆41Nov 18, 2024Updated last year
- An extension library to Candle that provides PyTorch functions not currently available in Candle☆42Mar 15, 2024Updated 2 years ago
- ☆12Feb 22, 2024Updated 2 years ago
- An iTransformer implementation in Rust☆19Jan 10, 2026Updated 3 months ago
- Sampling techniques for Candle.☆21Apr 3, 2024Updated 2 years ago
- ☆12Sep 27, 2017Updated 8 years ago
- ☆12Jan 4, 2024Updated 2 years ago
- A rust wrapper for HIP☆12Jun 10, 2025Updated 10 months ago
- Apache Arrow-compatible space-efficient "tape" class in pure Rust to be used with StringZilla for GPU, NUMA, and disk transfers of variab…☆29Nov 21, 2025Updated 4 months ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- 8-bit floating point types for Rust☆63Feb 4, 2026Updated 2 months ago
- The DynaGen model allows the prediction of dynamical field data based on microstructure input. The model is exemplified to dynamic fractu…☆12Mar 17, 2023Updated 3 years ago
- Interface for interacting with Gradient AI in Python☆15Jun 28, 2024Updated last year
- WIP: Ofen is a toolkit aimed at making transformer models production-ready. API included☆17Oct 2, 2024Updated last year
- Rust Workspace Bootstrapper☆18Oct 5, 2025Updated 6 months ago
- implement llava using candle☆15Jun 9, 2024Updated last year
- Multi-Agent Reinforcement Learning Environment for the card game SkyJo, compatible with PettingZoo and RLLIB☆16Feb 21, 2026Updated last month
- Cray-LM unified training and inference stack.☆22Jan 30, 2025Updated last year
- ☆17Jun 9, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Falcon is a powerful, interpreted programming language.☆17Jan 22, 2023Updated 3 years ago
- ☆19Dec 31, 2025Updated 3 months ago
- This repository is designed for deploying and managing server processes that handle embeddings using the Infinity Embedding model or Larg…☆26Mar 6, 2025Updated last year
- An autonomous robot, powered by AI.☆18May 3, 2022Updated 3 years ago
- ☆37Mar 5, 2026Updated last month
- WebCodecs API in Node.js☆86Updated this week
- Automatically derive Python dunder methods for your Rust code☆25Apr 7, 2026Updated last week
- Minimalist ML framework for Rust☆19Dec 4, 2025Updated 4 months ago
- ☆22Apr 9, 2026Updated last week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Easily convert HuggingFace models to GGUF-format for llama.cpp☆23Jul 27, 2024Updated last year
- An automatic differentiation system for dense and sparse problems☆13Jan 16, 2025Updated last year
- Minimalist ML framework for Rust☆22Feb 28, 2026Updated last month
- A Fish Speech implementation in Rust, with Candle.rs☆110Jun 5, 2025Updated 10 months ago
- A collection of optimisers for use with candle☆46Apr 6, 2026Updated 2 weeks ago
- 面向多平台编译优化的深度学习中间表示☆10Oct 28, 2024Updated last year
- A faster Arc.☆82Feb 8, 2024Updated 2 years ago