The official code for Dropping Backward Propagation (DropBP)
☆32Oct 29, 2024Updated last year
Alternatives and similar repositories for dropbp
Users that are interested in dropbp are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-Tuning☆38Apr 4, 2024Updated 2 years ago
- Official implementation of ECCV24 paper: POA☆24Aug 8, 2024Updated last year
- ☆11May 24, 2024Updated 2 years ago
- ☆16Oct 4, 2024Updated last year
- An awesome list that curates the best Flet tools, tutorials, blogs and more.☆10Jan 8, 2023Updated 3 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- JAX Scalify: end-to-end scaled arithmetics☆18Oct 30, 2024Updated last year
- 🧮 Algebraic Positional Encodings.☆21Jun 5, 2026Updated last week
- [ICML 2025] SparseLoRA: Accelerating LLM Fine-Tuning with Contextual Sparsity☆76Mar 10, 2026Updated 3 months ago
- [EMNLP 2024] Source code for the paper "Learning Planning-based Reasoning with Trajectory Collection and Process Rewards Synthesizing".☆84Jan 14, 2025Updated last year
- Finetuning LLaMA with DeepSpeed☆10Apr 14, 2023Updated 3 years ago
- itertree python package - full featured tree data structure☆15Sep 8, 2025Updated 9 months ago
- [ACL 2024 Findings] Light-PEFT: Lightening Parameter-Efficient Fine-Tuning via Early Pruning☆13Sep 2, 2024Updated last year
- [ICLR 2025] RaSA: Rank-Sharing Low-Rank Adaptation☆10May 19, 2025Updated last year
- Code related to the ELM neuron.☆15Feb 27, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆18Apr 8, 2025Updated last year
- Code for paper "Concrete Subspace Learning based Interference Elimination for Multi-task Model Fusion"☆14Mar 28, 2024Updated 2 years ago
- Penn CIS 5650 (GPU Programming and Architecture) Final Project☆44Dec 11, 2023Updated 2 years ago
- sigma-MoE layer☆21Jan 5, 2024Updated 2 years ago
- ☆14Jan 17, 2024Updated 2 years ago
- nanoGPT using Equinox☆15Mar 3, 2023Updated 3 years ago
- ☆20Jan 26, 2026Updated 4 months ago
- Positional Skip-wise Training for Efficient Context Window Extension of LLMs to Extremely Length (ICLR 2024)☆209May 20, 2024Updated 2 years ago
- ☆16Apr 26, 2023Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- An Efficient Supply Chain Management System using Blockchain & Machine Learning.☆10Nov 27, 2019Updated 6 years ago
- Rookie's guide☆13Aug 10, 2024Updated last year
- Unofficial implementation of the Ask-LLM paper 'How to Train Data-Efficient LLMs', arXiv:2402.09668.☆12Jun 19, 2024Updated last year
- ☆20Oct 25, 2022Updated 3 years ago
- Many things I've done with different programming languages☆14Aug 26, 2020Updated 5 years ago
- ☆19Jan 3, 2025Updated last year
- Intersection Over Union☆15Nov 26, 2017Updated 8 years ago
- ☆30Oct 7, 2024Updated last year
- Official code for the paper Improving Language Plasticity via Pretraining with Active Forgetting, NeurIPS 2023☆22Mar 12, 2026Updated 3 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆18Dec 7, 2025Updated 6 months ago
- [ISCA'25] LIA: A Single-GPU LLM Inference Acceleration with Cooperative AMX-Enabled CPU-GPU Computation and CXL Offloading☆12Jun 28, 2025Updated 11 months ago
- ☆67Dec 3, 2024Updated last year
- Auto Build Deepspeed☆19Oct 10, 2025Updated 8 months ago
- Lightweight arXiv literature digest skill for OpenClaw — Zotero-driven interest profiling, 3-dimensional candidate ranking, abstract-firs…☆47Mar 24, 2026Updated 2 months ago
- An implementation of the Hopfield Network using PyTorch, leveraging CUDA for linear algebra speedup☆15Nov 19, 2025Updated 6 months ago
- ☆19Feb 15, 2023Updated 3 years ago