hanabi95 / openpilot
Custom fork for Bolt EV
☆16Updated 8 months ago
Related projects ⓘ
Alternatives and complementary repositories for openpilot
- Evaluating majors LLMs on the Abstraction and Reasoning Corpus☆14Updated last year
- Trying out the Mamba architecture on small examples (cifar-10, shakespeare char level etc.)☆42Updated 11 months ago
- Simple implementation of muP, based on Spectral Condition for Feature Learning. The implementation is SGD only, dont use it for Adam☆68Updated 3 months ago
- ☆15Updated last year
- ☆13Updated 4 months ago
- A simple but robust PyTorch implementation of RetNet from "Retentive Network: A Successor to Transformer for Large Language Models" (http…☆100Updated 11 months ago
- PyTorch Implementation of Jamba: "Jamba: A Hybrid Transformer-Mamba Language Model"☆137Updated last week
- Flexibly track outputs and grad-outputs of torch.nn.Module.☆13Updated last year
- Pytorch (Lightning) implementation of the Mamba model☆14Updated 6 months ago
- Training small GPT-2 style models using Kolmogorov-Arnold networks.☆108Updated 5 months ago
- Exploration into the Scaling Value Iteration Networks paper, from Schmidhuber's group☆36Updated last month
- ResiDual: Transformer with Dual Residual Connections, https://arxiv.org/abs/2304.14802☆88Updated last year
- A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM☆50Updated 7 months ago
- Implementation of the Llama architecture with RLHF + Q-learning☆157Updated 11 months ago
- ☆34Updated 3 weeks ago
- Implementation of DreamerV3 in Pytorch☆33Updated this week
- Implementation of Infini-Transformer in Pytorch☆104Updated last month
- ☆62Updated 3 months ago
- DBI im 4. Semester AIF/KIF bzw. im III. Jahrgang HIF☆7Updated 11 months ago
- Exploration into the Firefly algorithm in Pytorch☆35Updated 2 months ago
- A byte-level decoder architecture that matches the performance of tokenized Transformers.☆59Updated 6 months ago
- Official PyTorch Implementation of "The Hidden Attention of Mamba Models"☆200Updated 5 months ago
- Implementation of MambaByte in "MambaByte: Token-free Selective State Space Model" in Pytorch and Zeta☆109Updated last week
- JAX implementation of the Mistral 7b v0.2 model☆33Updated 4 months ago
- Official implementation of "Hydra: Bidirectional State Space Models Through Generalized Matrix Mixers"☆103Updated 3 months ago
- WIP☆89Updated 3 months ago
- Implementation of the proposed MaskBit from Bytedance AI☆62Updated last week
- A set of Python scripts that makes your experience on TPU better☆40Updated 4 months ago
- Implementation of Liquid Nets in Pytorch☆52Updated last week
- Self contained pytorch implementation of a sinkhorn based router, for mixture of experts or otherwise☆31Updated 2 months ago
- The AdEMAMix Optimizer: Better, Faster, Older.☆172Updated 2 months ago