kyegomez / AlphaDevLinks
Implementation of the model from "Faster sorting algorithms discovered using deep reinforcement learning" that discovered an all-new ultra fast sorting algorithm.
☆11Updated 2 years ago
Alternatives and similar repositories for AlphaDev
Users that are interested in AlphaDev are comparing it to the libraries listed below
Sorting:
- ☆16Updated 2 years ago
- FlexAttention w/ FlashAttention3 Support☆27Updated last year
- Implementation of Hyena Hierarchy in JAX☆10Updated 2 years ago
- MACTA: A Multi-agent Reinforcement Learning Approach for Cache Timing Attacks and Detection☆46Updated 2 years ago
- [ECCV 2022] SuperTickets: Drawing Task-Agnostic Lottery Tickets from Supernets via Jointly Architecture Searching and Parameter Pruning☆20Updated 3 years ago
- A Learnable LSH Framework for Efficient NN Training☆34Updated 4 years ago
- "Moshpit SGD: Communication-Efficient Decentralized Training on Heterogeneous Unreliable Devices", official implementation☆30Updated 11 months ago
- ☆20Updated 2 years ago
- Learning Accurate Decision Trees with Bandit Feedback via Quantized Gradient Descent☆17Updated 3 years ago
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta☆13Updated last year
- Fun project to run your own LLM chat bot using llama.cpp☆11Updated 2 years ago
- Code for the paper "FinRLlama: A Solution to LLM-Engineered Signals Challenge at FinRL Contest 2024"☆13Updated 10 months ago
- ☆29Updated 3 years ago
- This project implements the Titans architecture from the paper "Titans: Learning to Memorize at Test Time" for market data prediction.☆11Updated 11 months ago
- ☆11Updated 2 years ago
- ☆23Updated last year
- Repository for CPU Kernel Generation for LLM Inference☆27Updated 2 years ago
- ☆28Updated 9 months ago
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Updated 2 years ago
- ACL 2023☆39Updated 2 years ago
- AutoMoE: Neural Architecture Search for Efficient Sparsely Activated Transformers☆48Updated 3 years ago
- KVTuner: Sensitivity-Aware Layer-wise Mixed Precision KV Cache Quantization for Efficient and Nearly Lossless LLM Inference☆25Updated 7 months ago
- Experiments to assess SPADE on different LLM pipelines.☆17Updated last year
- ☆22Updated 2 years ago
- Factorized Neural Layers☆31Updated 2 years ago
- Make triton easier☆50Updated last year
- ☆19Updated 2 years ago
- Implementation of a Hierarchical Mamba as described in the paper: "Hierarchical State Space Models for Continuous Sequence-to-Sequence Mo…☆14Updated last year
- Code for the paper: CodeTree: Agent-guided Tree Search for Code Generation with Large Language Models☆30Updated 9 months ago
- ☆10Updated last year