corl-team / rebased
Official implementation of the paper "Linear Transformers with Learnable Kernel Functions are Better In-Context Models"
☆159Updated 2 months ago
Alternatives and similar repositories for rebased:
Users that are interested in rebased are comparing it to the libraries listed below
- Effective LLM Alignment Toolkit☆122Updated last week
- ☆71Updated 6 months ago
- ☆31Updated 5 months ago
- ☆39Updated this week
- ☆20Updated 8 months ago
- Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters☆123Updated 3 months ago
- ☆51Updated 4 months ago
- ☆41Updated last week
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"☆98Updated 2 months ago
- RWKV-7: Surpassing GPT☆81Updated 4 months ago
- RuBLiMP: Russian Benchmark of Linguistic Minimal Pairs☆17Updated last month
- ☆49Updated last year
- QuIP quantization☆52Updated last year
- ☆21Updated last year
- ☆53Updated last year
- Evalica, your favourite evaluation toolkit☆31Updated 2 weeks ago
- Repo for "LoLCATs: On Low-Rank Linearizing of Large Language Models"☆223Updated last month
- Pytorch implementation of the PEER block from the paper, Mixture of A Million Experts, by Xu Owen He at Deepmind☆121Updated 6 months ago
- RWKV, in easy to read code☆71Updated 3 months ago
- Framework for processing and filtering datasets☆27Updated 7 months ago
- A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM☆53Updated 11 months ago
- Normalized Transformer (nGPT)☆159Updated 4 months ago
- Fast, Modern, Memory Efficient, and Low Precision PyTorch Optimizers☆86Updated 8 months ago
- Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.☆196Updated 8 months ago
- A benchmark for role-playing language models☆88Updated this week
- supporting pytorch FSDP for optimizers☆79Updated 3 months ago
- PyTorch implementation of models from the Zamba2 series.☆177Updated last month