JAX bindings for the flash-attention3 kernels
☆23Jan 2, 2026Updated 5 months ago
Alternatives and similar repositories for jax-flash-attn3
Users that are interested in jax-flash-attn3 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository provides tutorial, which discusses running sample publisher and subscriber using multiple transports of point_cloud_trans…☆11Mar 17, 2026Updated 3 months ago
- The Open-Source Implementation of Cognition AI's Automated Software Engineer, Devin.☆16Mar 13, 2024Updated 2 years ago
- OCaml ONNX runtime powered by onnxruntime☆25Aug 21, 2023Updated 2 years ago
- Proof of concept for running moshi/hibiki using webrtc☆21Feb 28, 2025Updated last year
- BEVFusion implementation in ROS2☆32Apr 15, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Empowering LLM Agents for Real-World Computer System Optimization☆18Sep 10, 2025Updated 9 months ago
- Personal knowledge library☆10Nov 9, 2017Updated 8 years ago
- Awesome-BEV-Perception☆32Jun 27, 2023Updated 2 years ago
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆31May 19, 2026Updated last month
- implement llava using candle☆15Jun 9, 2024Updated 2 years ago
- ☆31May 1, 2022Updated 4 years ago
- 3d object detection model smoke c++ inference code☆39Dec 1, 2022Updated 3 years ago
- B-Roll: Video data in rosbag2 plugins and utilities☆12Nov 19, 2025Updated 7 months ago
- Emacs 中看 B 站☆10Jul 27, 2025Updated 10 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- This project is intended to build and deploy an SNPE model on Qualcomm Devices, which are having unsupported layers which are not part of…☆10Oct 4, 2021Updated 4 years ago
- MobileSAM のエンコーダー/デコーダーをONNXに変換し、推論するサン プル☆12Apr 11, 2024Updated 2 years ago
- 古诗词分词,词向量分析,输出到excel,云图☆10Jul 6, 2022Updated 3 years ago
- 本项目主要对开源的MOSS SFT数据进行整理 ,转换成mnbvc多轮对话格式。MOSS-003涵盖用性、忠实性、无害性三个层面,共353w样本,MOSS-003 包含更细粒度的有用性类别标记、更广泛的无害性数据和更长对话轮数,共630w样本,☆13Dec 3, 2023Updated 2 years ago
- Learn LaTeX online☆15Apr 1, 2022Updated 4 years ago
- YOLOv12 TensorRT 端到端模型加速推理和INT8量化实现☆14Mar 5, 2025Updated last year
- Converts CLIP models to ONNX☆11Jan 17, 2023Updated 3 years ago
- Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …☆15Mar 11, 2024Updated 2 years ago
- 天池 NVIDIA TensorRT Hackathon 2023 —— 生成式AI模型优化赛 初赛第三名方案☆50Aug 16, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆18Aug 14, 2024Updated last year
- Wenet speech to text for react native☆10Nov 1, 2022Updated 3 years ago
- Isolates any given process using the unshare system call. Suited for ROS, though can work for any process.☆13Jan 31, 2024Updated 2 years ago
- LDC: Lightweight Dense CNN for Edge DetectionのPythonでのONNX推論サンプル☆15May 6, 2023Updated 3 years ago
- DETR tensor去除推理过程无用辅助头+fp16部署再次加速+解决转tensorrt 输出全为0问题的新方法。☆11Jan 9, 2024Updated 2 years ago
- Bert TensorRT模型加速部署☆10Apr 1, 2022Updated 4 years ago
- ROS 2 C++ executor bringing low CPU usage, low latency and deterministic ordering☆50Jul 18, 2024Updated last year
- Safe OCaml-Rust Foreign Function Interface☆47Feb 14, 2023Updated 3 years ago
- A cutlass cute implementation of headdim-64 flashattentionv2 TensorRT plugin for LightGlue. Run on Jetson Orin NX 8GB with TensorRT 8.5.…☆20Mar 3, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Andino simulation using o3de☆12Nov 8, 2024Updated last year
- ffmpeg+cuvid+tensorrt+multicamera☆12Dec 31, 2024Updated last year
- Go vs Rust AI bot voice conversation☆28Mar 19, 2026Updated 3 months ago
- base: https://github.com/Sense-GVT/Fast-BEV , delete time sequence,update mm releated ,add onnx export for tensorrt☆12May 12, 2023Updated 3 years ago
- NanoGPT (124M) in 5 minutes☆15Feb 14, 2025Updated last year
- 关于 tlmgr 使用的简短的介绍.☆19Feb 24, 2022Updated 4 years ago
- Fast command-line tool to extract useful data from LTTng traces of ROS applications.☆20May 13, 2026Updated last month