Quartet II Official Code
☆72May 1, 2026Updated 3 weeks ago
Alternatives and similar repositories for Quartet-II
Users that are interested in Quartet-II are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- High performance RMSNorm Implement by using SM Core Storage(Registers and Shared Memory)☆30Jan 22, 2026Updated 4 months ago
- An official implementation of Random Policy Valuation is Enough for LLM Reasoning with Verifiable Rewards☆36Oct 3, 2025Updated 7 months ago
- An experimental communicating attention kernel based on DeepEP.☆34Jul 29, 2025Updated 9 months ago
- Spectral Sphere Optimizer☆116Mar 23, 2026Updated 2 months ago
- 为用户每天推送arxiv的最新发布论文☆18Aug 12, 2025Updated 9 months ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- MicroMix: Efficient Mixed-Precision Quantization with Microscaling Formats for Large Language Models☆28Apr 2, 2026Updated last month
- 十字街的python客户端☆10Sep 25, 2021Updated 4 years ago
- 🎓Automatically Update circult-eda-mlsys-tinyml Papers Daily using Github Actions (Update Every 8th hours)☆10May 18, 2026Updated last week
- KV cache compression via sparse coding☆17Oct 26, 2025Updated 6 months ago
- 这是一个在精神错乱的情况下胡搂出来的操作系统,基于纯手写的宏定义内核和早期类Unix内核☆12Jul 19, 2025Updated 10 months ago
- Instruction Following Eval☆17Jan 16, 2025Updated last year
- A ComfyUI and ComfyScript Gradio-based app for generating characters using a multi-step process.☆19Nov 5, 2025Updated 6 months ago
- [ICML 2025] Fast and Low-Cost Genomic Foundation Models via Outlier Removal.☆19Jun 19, 2025Updated 11 months ago
- Multipurpose lens post process effects node for ComfyUI. Realistic or stylistic lens distortions, chromatic aberration, post-process scal…☆22Jul 10, 2025Updated 10 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆37Aug 7, 2025Updated 9 months ago
- This repository provides the code for applying Contrastive Learning Penalty Loss (CLPL) and Mixture of Experts (MoE) to the BGE-M3 text e…☆11Dec 27, 2024Updated last year
- ☆49May 20, 2025Updated last year
- Work in progress.☆80Nov 25, 2025Updated 6 months ago
- Code for the papers: “Four Over Six: More Accurate NVFP4 Quantization with Adaptive Block Scaling” and “Adaptive Block-Scaled Data Types”☆187Apr 21, 2026Updated last month
- cutile kernel examples☆49Apr 3, 2026Updated last month
- Rethinking the Trust Region in LLM Reinforcement Learning☆54Mar 2, 2026Updated 2 months ago
- ☆22Dec 3, 2025Updated 5 months ago
- ☆37Apr 21, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Official Implementation (Pytorch) of the "Representation Shift: Unifying Token Compression with FlashAttention", ICCV 2025☆34Feb 22, 2026Updated 3 months ago
- ☆44Apr 28, 2026Updated 3 weeks ago
- An Open-Source RAG Workload Trace to Optimize RAG Serving Systems☆36Nov 18, 2025Updated 6 months ago
- Recursive Self-Aggregation evals on ARC-AGI☆36Jan 26, 2026Updated 4 months ago
- ☆43Oct 11, 2025Updated 7 months ago
- A reducer enhancer for using an xstate chart with redux☆13Mar 5, 2018Updated 8 years ago
- Official Repository for Task-Circuit Quantization☆27Jun 1, 2025Updated 11 months ago
- [TMM 2025] Official Implementation of DreamJourney: Perpetual View Generation with Video Diffusion Models☆19Jun 24, 2025Updated 11 months ago
- a simple API to use CUPTI☆10Aug 19, 2025Updated 9 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- The official repository of the first version of ACE-Brain foundation model.☆76Mar 13, 2026Updated 2 months ago
- LONGAGENT: Scaling Language Models to 128k Context through Multi-Agent Collaboration☆11Mar 11, 2024Updated 2 years ago
- [MLSys'24] Atom: Low-bit Quantization for Efficient and Accurate LLM Serving☆340Jul 2, 2024Updated last year
- ☆20May 24, 2025Updated last year
- vTPM with SGX protection☆11May 30, 2019Updated 6 years ago
- ☆84Nov 10, 2025Updated 6 months ago
- Official implementation of Decoupled MeanFlow☆42Oct 28, 2025Updated 6 months ago