Advancing the frontier of efficient AI
☆58Apr 6, 2026Updated last week
Alternatives and similar repositories for sparse-attention-hub
Users that are interested in sparse-attention-hub are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆27Mar 10, 2026Updated last month
- Reliable and Efficient Semantic Prompt Caching with vCache☆64Dec 17, 2025Updated 3 months ago
- ☆12Apr 9, 2025Updated last year
- ☆11Jan 19, 2025Updated last year
- A toolkit for testing and improving named entity recognition [ESEC/FSE'23]☆11Aug 31, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆25Apr 7, 2026Updated last week
- Some microbenchmarks and design docs before commencement☆11Feb 1, 2021Updated 5 years ago
- 一个开源数学大模型项目,旨在探索大模型是否具有数学创造能力,以及大模型在前沿数学研究中的潜在能力。☆18Mar 19, 2026Updated 3 weeks ago
- ☆16Apr 11, 2022Updated 4 years ago
- [Preprint] RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments☆202Apr 7, 2026Updated last week
- Reasoning-based Evaluation and Ranking of Translations.☆20Jul 18, 2025Updated 8 months ago
- A simple SQL parser based on Apache Calcite.☆13Jan 17, 2026Updated 2 months ago
- ☆21Mar 7, 2024Updated 2 years ago
- coloring terminal text with intensities (used for plotting probability, entropy with tokens)☆12Oct 11, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- A few models converted from caffe to CoreMLs format.☆15Jun 6, 2017Updated 8 years ago
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling☆42Dec 29, 2025Updated 3 months ago
- Scratchpad/Chain-of-Thought Prompts☆12Jun 6, 2022Updated 3 years ago
- [TOIS 2023] On the User Behavior Leakage from Recommender System Exposure☆18Nov 7, 2023Updated 2 years ago
- UQ: Assessing Language Models on Unsolved Questions☆30Aug 26, 2025Updated 7 months ago
- ☆22Jun 11, 2024Updated last year
- Combining SOAP and MUON☆19Feb 11, 2025Updated last year
- Code to reproduce key results accompanying "SAEs (usually) Transfer Between Base and Chat Models"☆13Jul 18, 2024Updated last year
- [CCIR 2023] Self-supervised learning for Sequential Recommender Systems☆24Nov 7, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆32Oct 2, 2024Updated last year
- DeciMamba: Exploring the Length Extrapolation Potential of Mamba (ICLR 2025)☆32Apr 9, 2025Updated last year
- Official Repository of VisGym: Diverse, Customizable, Scalable Environments for Multimodal Agents☆105Mar 10, 2026Updated last month
- AgentSynth: Scalable Task Generation for Generalist Computer-Use Agents☆39Oct 7, 2025Updated 6 months ago
- ☆16Aug 16, 2024Updated last year
- ☆15Mar 2, 2025Updated last year
- ☆13Mar 14, 2026Updated last month
- An approximate implementation of the OpenAI paper - An Empirical Model of Large-Batch Training for MNIST☆11Nov 19, 2022Updated 3 years ago
- A Statistical Arbitrage Strategy to trade Cryptocurrency Pairs☆14Nov 6, 2020Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Benchmarking Semantic Query Processing Engines☆54Mar 22, 2026Updated 3 weeks ago
- ☆65Jul 14, 2025Updated 9 months ago
- Uses Processing and Perlin Noise to generate a procedural 2D rendering of different landscapes, which are then rendered into 3D☆16Aug 14, 2018Updated 7 years ago
- Source code for the paper "Positional Attention: Expressivity and Learnability of Algorithmic Computation"☆14May 26, 2025Updated 10 months ago
- A toolkit for Light Log Anomaly Detection [ICSE'24]☆22Feb 22, 2025Updated last year
- This is the code which powers the Twitter Bot https://twitter.com/RGB_Colours☆15Apr 14, 2017Updated 9 years ago
- GoldFinch and other hybrid transformer components☆46Jul 20, 2024Updated last year