Advancing the frontier of efficient AI
☆63May 15, 2026Updated last week
Alternatives and similar repositories for sparse-attention-hub
Users that are interested in sparse-attention-hub are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- a few utilities to analyze Caffe prototxt files☆16Sep 27, 2017Updated 8 years ago
- ☆28Mar 10, 2026Updated 2 months ago
- Reliable and Efficient Semantic Prompt Caching with vCache☆68Dec 17, 2025Updated 5 months ago
- Code repo for efficient quantized MoE inference with mixture of low-rank compensators☆36Apr 14, 2025Updated last year
- [ESEC/FSE'23] Hue: A User-Adaptive Parser for Hybrid Logs☆10Aug 24, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A toolkit for testing and improving named entity recognition [ESEC/FSE'23]☆11Aug 31, 2023Updated 2 years ago
- ☆27Apr 7, 2026Updated last month
- Some microbenchmarks and design docs before commencement☆11Feb 1, 2021Updated 5 years ago
- The repo of "BugLens"☆41Nov 12, 2025Updated 6 months ago
- ☆16Apr 11, 2022Updated 4 years ago
- Experiments Notebook of "Understanding the Skill Gap in Recurrent Language Models: The Role of the Gather-and-Aggregate Mechanism"☆15Apr 30, 2025Updated last year
- 一个开源数学大模型项目,旨在探索大模型是否具有数学创造能力,以及大模型在前沿数学研究中 的潜在能力。☆19Mar 19, 2026Updated 2 months ago
- [ICML 2026] RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments☆213Apr 30, 2026Updated 3 weeks ago
- [NeurIPS 2025 Spotlight] A Token is Worth over 1,000 Tokens: Efficient Knowledge Distillation through Low-Rank Clone.☆47Oct 29, 2025Updated 6 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆21Mar 7, 2024Updated 2 years ago
- Code of Multi-Defendant Legal Judgment Prediction via Hierarchical Reasoning☆15Apr 29, 2024Updated 2 years ago
- coloring terminal text with intensities (used for plotting probability, entropy with tokens)☆12Oct 11, 2024Updated last year
- Datalog Engines OPtimization Tester.☆13Jan 18, 2024Updated 2 years ago
- A few models converted from caffe to CoreMLs format.☆15Jun 6, 2017Updated 8 years ago
- Scratchpad/Chain-of-Thought Prompts☆12Jun 6, 2022Updated 3 years ago
- [TOIS 2023] On the User Behavior Leakage from Recommender System Exposure☆19Nov 7, 2023Updated 2 years ago
- DS SERVE: The Largest Open Vector Store over Pretain Data; A Framework for Efficient and Scalable Neural Retrieval☆50Jan 28, 2026Updated 3 months ago
- UQ: Assessing Language Models on Unsolved Questions☆30Aug 26, 2025Updated 9 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A toolkit for hybrid log parsing☆18Aug 23, 2023Updated 2 years ago
- Code to reproduce key results accompanying "SAEs (usually) Transfer Between Base and Chat Models"☆13Jul 18, 2024Updated last year
- ☆24Feb 19, 2025Updated last year
- ☆32Oct 2, 2024Updated last year
- [ICLR 2026] AgentSynth: Scalable Task Generation for Generalist Computer-Use Agents☆42Apr 17, 2026Updated last month
- ☆15Mar 2, 2025Updated last year
- ☆13Mar 14, 2026Updated 2 months ago
- An approximate implementation of the OpenAI paper - An Empirical Model of Large-Batch Training for MNIST☆11Nov 19, 2022Updated 3 years ago
- A Statistical Arbitrage Strategy to trade Cryptocurrency Pairs☆14Nov 6, 2020Updated 5 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆65Jul 14, 2025Updated 10 months ago
- A toolkit for Light Log Anomaly Detection [ICSE'24]☆22Feb 22, 2025Updated last year
- This is the code which powers the Twitter Bot https://twitter.com/RGB_Colours☆15Apr 14, 2017Updated 9 years ago
- [ACL'25] UTBoost: Rigorous Evaluation of Coding Agents on SWE-Bench☆36Aug 12, 2025Updated 9 months ago
- GoldFinch and other hybrid transformer components☆46Jul 20, 2024Updated last year
- Minimal implementation of TokenFormer for inference and learning☆13Nov 6, 2024Updated last year
- Schedule free optimiser implemented in JAX using Optimistix☆15May 29, 2024Updated last year