[ICLR 2026] GRAPE: Group Representational Position Encoding (https://arxiv.org/abs/2512.07805)
☆95May 13, 2026Updated last month
Alternatives and similar repositories for GRAPE
Users that are interested in GRAPE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICML'26] Phonon fine-tuning (PFT) and [NeurIPS'25 AI4Mat] Nequix: Training a foundation model for materials on a budget☆75Apr 5, 2026Updated 2 months ago
- Local AI runtime for training & running small LLMs directly on Apple Neural Engine (ANE). No CoreML. No Metal. Offline, on-device fine-tu…☆97Mar 6, 2026Updated 3 months ago
- ☆248Nov 19, 2025Updated 6 months ago
- ☆48Jun 16, 2025Updated 11 months ago
- A Python package for data-mining the QM9 dataset☆20Mar 14, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ACL 2026 & NAACL 2025: Bridging Retrieval and Inference through Evidence Fusion☆13Apr 9, 2026Updated 2 months ago
- [NeurIPS 2025, Spotlight]: Ambient-o: Training Good models with Bad Data.☆35Apr 6, 2026Updated 2 months ago
- Code for paper "ElasticTrainer: Speeding Up On-Device Training with Runtime Elastic Tensor Selection" (MobiSys'23)☆14Nov 1, 2023Updated 2 years ago
- [NeurIPS 2025 Spotlight] E2Former: An Efficient and Equivariant Transformer with Linear-Scaling Tensor Products☆29Feb 16, 2026Updated 3 months ago
- Xmixers: A collection of SOTA efficient token/channel mixers☆28Sep 4, 2025Updated 9 months ago
- ☆29Jul 9, 2024Updated last year
- [ICLR 2025] Official Implementation of "Towards Fast, Specialized Machine Learning Force Fields: Distilling Foundation Models via Energy …☆25Apr 17, 2025Updated last year
- Train and run transformers directly on Apple's Neural Engine in Swift bypass coreml entirely☆115Apr 18, 2026Updated last month
- The offical repo for "LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling"☆165May 15, 2026Updated 3 weeks ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Standalone repo for our Atropos integration with Thinking Machines Tinker API (https://thinkingmachines.ai/tinker/)☆88Mar 22, 2026Updated 2 months ago
- Universal Reasoning Model☆131Jan 15, 2026Updated 4 months ago
- Understand Human Behavior to Align True Needs☆25Jul 11, 2024Updated last year
- MLX Implementation of Recursive Reasoning with Tiny Networks☆78Oct 11, 2025Updated 8 months ago
- Official implementation of paper "Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models"☆72Apr 4, 2026Updated 2 months ago
- Official code for "Accelerating Feedforward Computation via Parallel Nonlinear Equation Solving", ICML 2021☆30Sep 25, 2021Updated 4 years ago
- My attempt to improve the speed of the newton schulz algorithm, starting from the dion implementation.☆38Apr 30, 2026Updated last month
- React-OT is a generative transition state search model developed by DeepPrinciple, which uses Optimal Transport (OT) methods to generate …☆46Aug 25, 2025Updated 9 months ago
- Code for NeurIPS 2024 Paper - Superposed Decoding: Multiple Generations from a Single Autoregressive Inference Pass☆21Aug 22, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- THEORY OF SPACE: a benchmark for evaluating whether foundation models can actively explore under partial observability efficiently to bui…☆81Feb 27, 2026Updated 3 months ago
- Awesome Triton Resources☆43Apr 27, 2025Updated last year
- Repository of <FormalMATH: Benchmarking Formal Mathematical Reasoning of Large Language Models>☆75Jan 8, 2026Updated 5 months ago
- ☆33Nov 4, 2024Updated last year
- 🔥 A minimal training framework for scaling FLA models☆391Apr 22, 2026Updated last month
- An easy way for debug python for Slurm HPC users.☆27Mar 23, 2025Updated last year
- ☆13May 21, 2024Updated 2 years ago
- A simple python package for Neural Network based on numpy☆13Sep 6, 2021Updated 4 years ago
- BigKnow2022: Bringing Language Models Up to Speed☆16Mar 27, 2023Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A PyTorch implementation of a conditional Denoising Diffusion Probabilistic Model (DDPM) for multi-modal trajectory prediction. This proj…☆39Feb 20, 2026Updated 3 months ago
- Mindless molecule generator in a Python package.☆42Jan 22, 2026Updated 4 months ago
- ☆51May 26, 2026Updated 2 weeks ago
- ☆13Jan 14, 2026Updated 5 months ago
- A mcp server that uses the Osmosis-Apply-1.7B model to apply code merges☆53Jul 3, 2025Updated 11 months ago
- Here we will test various linear attention designs.☆62Apr 25, 2024Updated 2 years ago
- [CVPR 2026] Official repository for "Reviving ConvNeXt for Efficient Convolutional Diffusion Models"☆69Mar 26, 2026Updated 2 months ago