☆77Mar 23, 2026Updated last week
Alternatives and similar repositories for flash-attention-turing
Users that are interested in flash-attention-turing are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A static deobfuscator for JavaScript Malware☆13May 6, 2020Updated 5 years ago
- AI21 Typescript SDK☆13Dec 18, 2025Updated 3 months ago
- Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.☆10Aug 19, 2023Updated 2 years ago
- Cleanlab Vizzy: illustrating the core ideas behind the Cleanlab algorithm☆16Apr 19, 2023Updated 2 years ago
- Code for "Adaptive Self-improvement LLM Agentic System for ML Library Development" (ICML 2025)☆15Jan 6, 2026Updated 2 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Single-file, pure CUDA C implementation for running inference on Qwen3 0.6B GGUF. No Dependencies.☆23Nov 26, 2025Updated 4 months ago
- A robust Node.js proxy server that automatically rotates API keys for Gemini and OpenAI APIs when rate limits (429 errors) are encountere…☆43Jan 20, 2026Updated 2 months ago
- ☆16Mar 12, 2025Updated last year
- llama.cpp gguf file parser for javascript☆50Dec 11, 2024Updated last year
- Samoyeds: Accelerating MoE Models with Structured Sparsity Leveraging Sparse Tensor Cores (EuroSys'25)☆15Jul 17, 2025Updated 8 months ago
- Implementation of multi-level Contrastive Predictive Coding (CPC) methods☆20Jan 12, 2023Updated 3 years ago
- Javascripts Deobfuscator. Used to debug obfuscated JS from obfuscator.io and other obfuscate tools.