Deep learning library implemented from scratch in numpy. Mixtral, Mamba, LLaMA, GPT, ResNet, and other experiments.
☆54Apr 12, 2024Updated last year
Alternatives and similar repositories for candle
Users that are interested in candle are comparing it to the libraries listed below
Sorting:
- Implementation of a simple linear regression algorithm in MAMBA☆10Feb 12, 2020Updated 6 years ago
- ☆13Jan 20, 2023Updated 3 years ago
- This is the code that went into our practical dive using mamba as information extraction☆57Dec 22, 2023Updated 2 years ago
- Implementation of the Mamba SSM with hf_integration.☆55Aug 31, 2024Updated last year
- Sparse Autoencoders (SAE) vs CLIP fine-tuning fun.☆18Dec 19, 2024Updated last year
- Implementation of a modular, high-performance, and simplistic mamba for high-speed applications☆40Nov 11, 2024Updated last year
- Simple, minimal implementation of the Mamba SSM in one file of PyTorch.☆2,920Mar 8, 2024Updated last year
- A simple implementation of [Mamba: Linear-Time Sequence Modeling with Selective State Spaces](https://arxiv.org/abs/2312.00752)☆22Jan 22, 2024Updated 2 years ago
- The code of "Inductive Unsupervised Domain Adaptation for Few-Shot Classification via Clustering", ECML-PKDD 2020.☆21Dec 8, 2022Updated 3 years ago
- A collection of different ways to implement accessing and modifying internal model activations for LLMs☆20Oct 18, 2024Updated last year
- Annotated version of the Mamba paper☆497Feb 27, 2024Updated 2 years ago
- ☆27Oct 6, 2024Updated last year
- A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM☆61Apr 8, 2024Updated last year
- A collection of some awesome video object detection series projects.☆26Feb 22, 2024Updated 2 years ago
- A simpler Pytorch + Zeta Implementation of the paper: "SiMBA: Simplified Mamba-based Architecture for Vision and Multivariate Time series…☆28Nov 11, 2024Updated last year
- [NeurIPS 2024 Spotlight] Code and data for the paper "Finding Transformer Circuits with Edge Pruning".☆66Aug 15, 2025Updated 6 months ago
- [NeurIPS 2024] Large Language Model Unlearning via Embedding-Corrupted Prompts☆38Sep 26, 2024Updated last year
- Creates CMM script that can directly executed on Kaggle from easy merge script☆14Jan 12, 2026Updated last month
- Clean RL implementation using MLX☆35Mar 8, 2024Updated last year
- A Deepfake detector based on hybrid EfficientNet CNN and Vision Transformer archietcture. The model is explainable by rendering a heatma…☆15Mar 16, 2022Updated 3 years ago
- BLEURT implementation in PyTorch☆37Jan 19, 2023Updated 3 years ago
- EMNLP 2024: Model Editing Harms General Abilities of Large Language Models: Regularization to the Rescue☆38May 26, 2025Updated 9 months ago
- ☆35Nov 22, 2024Updated last year
- ☆20May 24, 2025Updated 9 months ago
- ChatGPT CSS style☆14Apr 28, 2024Updated last year
- Generator for Notation Backing Track Videos from Lilypond Files☆10Oct 23, 2024Updated last year
- Stripped Python images based on alpine variant of library's Python☆10Jan 20, 2022Updated 4 years ago
- Optimize with SigOpt with this standalone SigOpt client driver.☆12Updated this week
- ☆37Mar 24, 2024Updated last year
- My personal solutions to some textbook problems☆10Feb 12, 2020Updated 6 years ago
- PropForthV5.5 is Forth progamming environment for Parallax Propeller P8X32A microcontroller created by Sal Sanci☆11Aug 14, 2016Updated 9 years ago
- Official code for AL-PINNS: Augmented Lagrangian relaxation method for Physics-Informed Neural Networks☆12Jul 29, 2023Updated 2 years ago
- Everything about the Athena☆10Oct 3, 2020Updated 5 years ago
- ☆10Sep 17, 2023Updated 2 years ago
- HyFormer: Hybrid Transformer and CNN For Pixel-level Multispectral Image Classification☆16Feb 15, 2023Updated 3 years ago
- A Model Agnostic function to directly remove specified layers from the LLM☆10May 23, 2024Updated last year
- 一个多媒体系统的桌面端☆11Feb 9, 2024Updated 2 years ago
- TransientViT: A novel CNN - Vision Transformer hybrid real/bogus transient classifier for the Kilodegree Automatic Transient Survey☆10Nov 7, 2024Updated last year
- G-code generator for 3D printers (RepRap, Makerbot, Ultimaker etc.)☆12Oct 21, 2021Updated 4 years ago