Advancing the frontier of efficient AI
☆66Jun 3, 2026Updated 2 weeks ago
Alternatives and similar repositories for sparse-attention-hub
Users that are interested in sparse-attention-hub are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Automated High-Performance GPU Kernel Generation☆114Jun 1, 2026Updated 2 weeks ago
- ☆28Mar 10, 2026Updated 3 months ago
- Bespoke OLAP: Synthesizing Workload-Specific One-size-fits-one Database Engines☆45Apr 30, 2026Updated last month
- Code repo for efficient quantized MoE inference with mixture of low-rank compensators☆36Apr 14, 2025Updated last year
- [ESEC/FSE'23] Hue: A User-Adaptive Parser for Hybrid Logs☆10Aug 24, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆11Jan 19, 2025Updated last year
- A toolkit for testing and improving named entity recognition [ESEC/FSE'23]☆11Aug 31, 2023Updated 2 years ago
- For building the world's largest dataset of GPU kernels.☆10Jun 11, 2026Updated last week
- ☆28Apr 7, 2026Updated 2 months ago
- Some microbenchmarks and design docs before commencement☆11Feb 1, 2021Updated 5 years ago
- The repo of "BugLens"☆41Nov 12, 2025Updated 7 months ago
- Experiments Notebook of "Understanding the Skill Gap in Recurrent Language Models: The Role of the Gather-and-Aggregate Mechanism"☆15Apr 30, 2025Updated last year
- 一个开源数学大模型项目,旨在探索大模型是否具有数学创造能力,以及大模型在前沿数学研究中的潜在能力。☆20Mar 19, 2026Updated 2 months ago
- Reasoning-based Evaluation and Ranking of Translations.☆20Jun 2, 2026Updated 2 weeks ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- manipulating cointegrated pairs to achieve a market-neutral strategy that outperforms indices☆11Jan 12, 2021Updated 5 years ago
- A simple SQL parser based on Apache Calcite.☆14May 8, 2026Updated last month
- A lightweight tool for detecting bugs on Graph Database Management Systems☆15Jan 9, 2024Updated 2 years ago
- Just an UI for Chatterbox, which uses about 1-2 GB RAM. Double click and you're good to go.☆20Updated this week
- coloring terminal text with intensities (used for plotting probability, entropy with tokens)☆12Oct 11, 2024Updated last year
- A few models converted from caffe to CoreMLs format.☆15Jun 6, 2017Updated 9 years ago
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling☆43Dec 29, 2025Updated 5 months ago
- Scratchpad/Chain-of-Thought Prompts☆12Jun 6, 2022Updated 4 years ago
- [AAAI26]: DS SERVE: The Largest Open Vector Store over Pretain Data; A Framework for Efficient and Scalable Neural Retrieval☆51Jan 28, 2026Updated 4 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- UQ: Assessing Language Models on Unsolved Questions☆30Aug 26, 2025Updated 9 months ago
- Combining SOAP and MUON☆22Feb 11, 2025Updated last year
- Code to reproduce key results accompanying "SAEs (usually) Transfer Between Base and Chat Models"☆13Jul 18, 2024Updated last year
- ☆15Mar 2, 2025Updated last year
- ☆13Mar 14, 2026Updated 3 months ago
- An approximate implementation of the OpenAI paper - An Empirical Model of Large-Batch Training for MNIST☆11Nov 19, 2022Updated 3 years ago
- Benchmarking Semantic Query Processing Engines☆57Jun 8, 2026Updated last week
- ☆66Jul 14, 2025Updated 11 months ago
- Source code for the paper "Positional Attention: Expressivity and Learnability of Algorithmic Computation"☆14May 26, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Uses Processing and Perlin Noise to generate a procedural 2D rendering of different landscapes, which are then rendered into 3D☆16Aug 14, 2018Updated 7 years ago
- A toolkit for Light Log Anomaly Detection [ICSE'24]☆22Feb 22, 2025Updated last year
- Official Repository of VisGym: Diverse, Customizable, Scalable Environments for Multimodal Agents☆112May 3, 2026Updated last month
- This is the code which powers the Twitter Bot https://twitter.com/RGB_Colours☆15Apr 14, 2017Updated 9 years ago
- GoldFinch and other hybrid transformer components☆46Jul 20, 2024Updated last year
- Minimal implementation of TokenFormer for inference and learning☆13Nov 6, 2024Updated last year
- A python library that allows you to quickly and easily generate HTML templates and even create full-on websites.☆18Apr 28, 2025Updated last year