Deep learning library implemented from scratch in numpy. Mixtral, Mamba, LLaMA, GPT, ResNet, and other experiments.
☆54Apr 12, 2024Updated last year
Alternatives and similar repositories for candle
Users that are interested in candle are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- VI-SVC model is just VITS without MAS and DurationPredictor.☆10Nov 9, 2023Updated 2 years ago
- 4G GPU & 10 Minutes for train☆12Aug 9, 2023Updated 2 years ago
- Gradient-based Hyperparameter Optimization Over Long Horizons☆14Sep 29, 2021Updated 4 years ago
- Simple, minimal implementation of the Mamba SSM in one file of PyTorch.☆2,931Mar 8, 2024Updated 2 years ago
- ☆13Jan 20, 2023Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Conformer block with Rotary Position Embedding, modified from lucidrains' implement☆18Sep 13, 2024Updated last year
- The code of "Inductive Unsupervised Domain Adaptation for Few-Shot Classification via Clustering", ECML-PKDD 2020.☆21Dec 8, 2022Updated 3 years ago
- Sparse Autoencoders (SAE) vs CLIP fine-tuning fun.☆18Dec 19, 2024Updated last year
- A trio of Google-Colab notebooks (ipynb) for training a GPT-2 (127M) model from scratch (useful for other / non-English languages) using …☆17Jun 29, 2020Updated 5 years ago
- Implementation of the Mamba SSM with hf_integration.☆55Aug 31, 2024Updated last year
- ☆15Sep 13, 2022Updated 3 years ago
- ☆51Jan 28, 2024Updated 2 years ago
- Implementation for Proximal Neural Networks.☆11Sep 17, 2021Updated 4 years ago
- Hessian trace estimation using PyTorch and Hutch++☆20Oct 29, 2020Updated 5 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Experimenting with cellular automatas☆16Mar 18, 2021Updated 5 years ago
- Efficient retrieval head analysis with triton flash attention that supports topK probability☆13Jun 15, 2024Updated last year
- Self-study of CS182 (Spring 2021) at UC Berkeley - Designing, Visualizing and Understanding Deep Neural Networks☆11Sep 13, 2023Updated 2 years ago
- Annotated version of the Mamba paper☆499Feb 27, 2024Updated 2 years ago
- Benchmarking Attention Mechanism in Vision Transformers.☆20Oct 10, 2022Updated 3 years ago
- ☆26Mar 20, 2024Updated 2 years ago
- [NeurIPS 2024] Large Language Model Unlearning via Embedding-Corrupted Prompts☆38Sep 26, 2024Updated last year
- [NeurIPS 2024 Spotlight] Code and data for the paper "Finding Transformer Circuits with Edge Pruning".☆66Aug 15, 2025Updated 7 months ago
- AgentsCourt: Building Judicial Decision-Making Agents with Court Debate Simulation and Legal Knowledge Augmentation (EMNLP 2024 Findings)☆16Dec 30, 2024Updated last year
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Breaking the 'doomscrolling' cycle with Contre Sozial.☆16Jan 20, 2021Updated 5 years ago
- Blog post☆17Feb 16, 2024Updated 2 years ago
- ☆16Jun 25, 2022Updated 3 years ago
- A minimal WebRTC SFU Implementation☆19Jun 15, 2025Updated 9 months ago
- BQN↔NumPy bridge☆22Sep 26, 2025Updated 6 months ago
- Code of paper 《Remote Sensing Image Scene Classification Based on an Enhanced Attention Module》☆11Apr 2, 2020Updated 5 years ago
- A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM☆61Apr 8, 2024Updated last year
- Collect papers about Mamba (a selective state space model).☆14Aug 6, 2024Updated last year
- RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment☆16Dec 19, 2024Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- [NeurIPS25] RULE: Reinforcement UnLEarning Achieves Forge-retain Pareto Optimality☆20Oct 22, 2025Updated 5 months ago
- codes for "Self-Checker: Plug-and-Play Modules for Fact-Checking with Large Language Models"☆12Feb 10, 2025Updated last year
- [ICML'25] MELON: Provable Defense Against Indirect Prompt Injection Attacks in AI Agents☆24Jul 31, 2025Updated 7 months ago
- ☆10Apr 25, 2024Updated last year
- This is the pytorch demo code for Multi-Source Unsupervised Domain Adaptation via Pseudo Target Domain, (PTMDA) (IEEE Transactions on Ima…☆11Apr 15, 2022Updated 3 years ago
- ☆20May 31, 2024Updated last year
- A web client for Linux from scratch in C for a variety of alternative web protocols☆17Nov 4, 2023Updated 2 years ago