GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
☆22Mar 7, 2024Updated 2 years ago
Alternatives and similar repositories for Galore-pytorch
Users that are interested in Galore-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official code for the paper Improving Language Plasticity via Pretraining with Active Forgetting, NeurIPS 2023☆21Mar 12, 2026Updated 2 weeks ago
- code for ACL24 "MELoRA: Mini-Ensemble Low-Rank Adapter for Parameter-Efficient Fine-Tuning"☆34Feb 19, 2025Updated last year
- [ICLR 2025] RaSA: Rank-Sharing Low-Rank Adaptation☆10May 19, 2025Updated 10 months ago
- ✌ CLoG: Benchmarking Continual Learning of Image Generation Models☆20Jun 10, 2024Updated last year
- ☆13Jun 25, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ICML 2024 - Official Repository for EXO: Towards Efficient Exact Optimization of Language Model Alignment☆56Jun 16, 2024Updated last year
- Code for paper: A Neural Span-Based Continual Named Entity Recognition Model☆18Dec 11, 2023Updated 2 years ago
- Homeworks, Midterm, & Capstone from ML BookCamp☆16Jan 28, 2022Updated 4 years ago
- This the implementation of LeCo☆32Jan 20, 2025Updated last year
- AdaptiveStep: Automatically Dividing Reasoning Step through Model Confidence☆10Mar 2, 2025Updated last year
- GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection☆1,681Oct 28, 2024Updated last year
- ☆17May 1, 2022Updated 3 years ago
- [ICLR 2025] Official implementation of paper "Dynamic Low-Rank Sparse Adaptation for Large Language Models".☆24Mar 16, 2025Updated last year
- [NeurIPS 2024] Efficiency for Free: Ideal Data Are Transportable Representations☆19Jan 19, 2025Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆12Nov 30, 2022Updated 3 years ago
- ☆19Nov 30, 2024Updated last year
- [NAACL 2025] MiLoRA: Harnessing Minor Singular Components for Parameter-Efficient LLM Finetuning☆19May 31, 2025Updated 9 months ago
- A package for Hangul (korean alphabet)☆13Dec 19, 2022Updated 3 years ago
- WebNLG+ Challenge 2020: Scripts to evaluate the RDF-to-text task with automatic metrics (BLEU, METEOR, chrF++, TER and BERT-Score)☆18Aug 20, 2024Updated last year
- The implementation of ICASSP 2024 lecture presentation paper "Hypergraph Transformer for Semi-Supervised Classification"☆20Nov 23, 2024Updated last year
- ☆11Nov 18, 2023Updated 2 years ago
- PyTorch implementation of Joint Privacy Enhancement and Quantization in Federated Learning (IEEE TSP 2023, IEEE ICASSP 2023, IEEE ISIT 20…☆18Oct 28, 2025Updated 5 months ago
- The official code for "GUI-ReWalk: Massive Data Generation for GUI Agent via Stochastic Exploration and Intent-Aware Reasoning"☆32Jan 28, 2026Updated 2 months ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆12Mar 8, 2020Updated 6 years ago
- 河海大学每日健康打卡☆12Dec 4, 2021Updated 4 years ago
- Unofficial implementation for Sigmoid Loss for Language Image Pre-Training☆11Sep 26, 2023Updated 2 years ago
- ☆22Apr 27, 2024Updated last year
- ☆24Jun 7, 2021Updated 4 years ago
- Replication materials for "Identifying the Development and Application of Artificial Intelligence in Scientific Text"☆14Feb 18, 2020Updated 6 years ago
- This is a simple lab I have implemented to test Knowledge Augmented or Retrieval Augmented Generation (RAG) with Large Language Models. I…☆10Dec 10, 2023Updated 2 years ago
- ☆27Jul 11, 2024Updated last year
- ☆16Dec 30, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆12Apr 19, 2024Updated last year
- This is code for How Do Social Bots Participate in Misinformation Spread? A Comprehensive Dataset and Analysis☆16Nov 5, 2025Updated 4 months ago
- ☆14Jul 17, 2025Updated 8 months ago
- ☆35Aug 23, 2023Updated 2 years ago
- Playground project acting as an example for a complex LangChain workflow☆11Jun 20, 2023Updated 2 years ago
- ☆27Oct 9, 2024Updated last year
- Pytorch implementation of Tree Preference Optimization (TPO) (Accepted by ICLR'25)☆26Apr 24, 2025Updated 11 months ago