Deep learning library implemented from scratch in numpy. Mixtral, Mamba, LLaMA, GPT, ResNet, and other experiments.
☆56Apr 12, 2024Updated 2 years ago
Alternatives and similar repositories for candle
Users that are interested in candle are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Reinforcement Learning example in Nim, playing tic tac toe. Based off original C version from the great Antirez☆15Apr 2, 2025Updated last year
- Gradient-based Hyperparameter Optimization Over Long Horizons☆14Sep 29, 2021Updated 4 years ago
- Simple, minimal implementation of the Mamba SSM in one file of PyTorch.☆2,948Mar 8, 2024Updated 2 years ago
- 🤗 Transformers: State-of-the-art Natural Language Processing for TensorFlow 2.0 and PyTorch.☆17Jun 5, 2025Updated 11 months ago
- ☆13Jan 20, 2023Updated 3 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Sparse Autoencoders (SAE) vs CLIP fine-tuning fun.☆18Dec 19, 2024Updated last year
- A trio of Google-Colab notebooks (ipynb) for training a GPT-2 (127M) model from scratch (useful for other / non-English languages) using …☆17Jun 29, 2020Updated 5 years ago
- Implementation for Proximal Neural Networks.☆11Sep 17, 2021Updated 4 years ago
- Efficient retrieval head analysis with triton flash attention that supports topK probability☆13Jun 15, 2024Updated last year
- Annotated version of the Mamba paper☆501Feb 27, 2024Updated 2 years ago
- [NeurIPS 2024] Large Language Model Unlearning via Embedding-Corrupted Prompts☆40Sep 26, 2024Updated last year
- Prompt Generator model for Stable Diffusion Models☆12Jun 20, 2023Updated 2 years ago
- ☆20May 14, 2026Updated last week
- Blog post☆17Feb 16, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆22Jun 13, 2024Updated last year
- Official code for "Algorithmic Capabilities of Random Transformers" (NeurIPS 2024)☆16Sep 28, 2024Updated last year
- My personal solutions to some textbook problems☆11Feb 12, 2020Updated 6 years ago
- Implementation of a modular, high-performance, and simplistic mamba for high-speed applications☆40Nov 11, 2024Updated last year
- A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM☆62Apr 8, 2024Updated 2 years ago
- RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment☆17Dec 19, 2024Updated last year
- An informal list of libraries, programs, examples, and benchmarks using the Numba CUDA target☆16Sep 28, 2023Updated 2 years ago
- [ICML'25] MELON: Provable Defense Against Indirect Prompt Injection Attacks in AI Agents☆28Jul 31, 2025Updated 9 months ago
- Repository of paper "Establishing Trustworthy LLM Evaluation via Shortcut Neuron Analysis" (ACL 2025 Main)☆19Jul 19, 2025Updated 10 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Multiform Ensemble Self-Supervised Learning for Few-Shot Remote Sensing Scene Classification☆13Mar 10, 2023Updated 3 years ago
- Kanban board made with TailwindCSS☆11Jun 10, 2021Updated 4 years ago
- TensorFlow implementation of the Dissimilarity Mixture Autoencoder: https://arxiv.org/abs/2006.08177☆13Dec 8, 2022Updated 3 years ago
- Download and analyze your computer usage data from RescueTime☆13Mar 7, 2021Updated 5 years ago
- This paper has been accepted by IEEE Transactions on Image Processing.☆10Feb 24, 2023Updated 3 years ago
- ☆20May 24, 2025Updated last year
- Simple, minimal implementation of the Mamba SSM in one pytorch file. Using logcumsumexp (Heisen sequence).☆133Oct 18, 2024Updated last year
- Inference of Mamba, Mamba2 and Mamba3 models in pure C☆201Mar 18, 2026Updated 2 months ago
- MO-LightGBM is a gradient boosting framework based on decision tree algorithms, used for Multi-objective learning to rank tasks.☆20Apr 23, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Nonlinear SVGD for Learning Diversified Mixture Models☆13Jan 23, 2019Updated 7 years ago
- Accompanying codebase for neuroscope.io, a website for displaying max activating dataset examples for language model neurons☆13Feb 13, 2023Updated 3 years ago
- Changes in this fork has been merged to upstream.☆16Jun 10, 2025Updated 11 months ago
- remote sensing scene classification☆12Mar 1, 2023Updated 3 years ago
- ☆18Oct 18, 2024Updated last year
- pyhessian is a TensorFlow module which can be used to estimate Hessian matrices☆25Mar 26, 2021Updated 5 years ago
- ☆23Feb 24, 2022Updated 4 years ago