A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.
☆34Mar 2, 2024Updated 2 years ago
Alternatives and similar repositories for Lightning-ReLoRA
Users that are interested in Lightning-ReLoRA are comparing it to the libraries listed below
Sorting:
- Yet another frontend for LLM, written using .NET and WinUI 3☆10Sep 14, 2025Updated 5 months ago
- Implementation for EACL 2024 paper "Corpus-Steered Query Expansion with Large Language Models"☆12Mar 19, 2024Updated last year
- Llama cute voice assistant☆27Sep 10, 2023Updated 2 years ago
- Source code for GreaTer ICLR 2025 - Gradient Over Reasoning makes Smaller Language Models Strong Prompt Optimizers☆35Apr 18, 2025Updated 10 months ago
- Official code for the paper Improving Language Plasticity via Pretraining with Active Forgetting, NeurIPS 2023☆21Feb 9, 2026Updated 3 weeks ago
- Test your local LLMs on the AIME problems☆32Jun 7, 2025Updated 8 months ago
- ☆13Jun 26, 2024Updated last year
- ☆22Jan 13, 2025Updated last year
- Categorize credit card transactions using a local large language model similar to GPT3☆15Dec 29, 2023Updated 2 years ago
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆37Oct 9, 2025Updated 4 months ago
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆38Jan 29, 2024Updated 2 years ago
- This is an LLM interface that you can use to analyze and get insight into diary entries or other documents completely offline.☆16Dec 31, 2023Updated 2 years ago
- Official Repository for Paper "BaichuanSEED: Sharing the Potential of ExtensivE Data Collection and Deduplication by Introducing a Compet…☆18Aug 28, 2024Updated last year
- AI Lead Generation Agent that automatically discovers and qualifies potential leads from Quora. Using Firecrawl for intelligent web scrap…☆33Jan 24, 2025Updated last year
- QLoRA with Enhanced Multi GPU Support☆38Aug 8, 2023Updated 2 years ago
- Official Implementation of the ACL2024 Findings paper "Controllable Data Augmentation for Few-Shot Text Mining with Chain-of-Thought Attr…☆19May 18, 2024Updated last year
- Implementation of "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"☆40Nov 11, 2024Updated last year
- "a towel is about the most massively useful thing an interstellar AI hitchhiker can have"☆48Oct 9, 2024Updated last year
- KoboldCpp Smart Launcher with GPU Layer and Tensor Override Tuning☆30May 18, 2025Updated 9 months ago
- ☆24Jun 1, 2024Updated last year
- Generative AI web UI and server☆22May 23, 2023Updated 2 years ago
- AutoPEFT: Automatic Configuration Search for Parameter-Efficient Fine-Tuning (Zhou et al.; TACL 2024)☆51Mar 17, 2024Updated last year
- ☆23Jun 4, 2024Updated last year
- A simple Fast API Backend for Ironclad/rivet☆26Jan 9, 2024Updated 2 years ago
- A multimodal, function calling powered LLM webui.☆215Sep 23, 2024Updated last year
- [ACL 2023] Few-shot Reranking for Multi-hop QA via Language Model Prompting☆27Oct 19, 2025Updated 4 months ago
- B-Llama3o a llama3 with Vision Audio and Audio understanding as well as text and Audio and Animation Data output.☆26Jun 3, 2024Updated last year
- ☆235Jun 11, 2024Updated last year
- [ICLR 2024] This is the official PyTorch implementation of "QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Mod…☆31Mar 12, 2024Updated last year
- EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding for…☆28Dec 10, 2024Updated last year
- 🚀 Automatically convert unstructured data into a high-quality 'textbook' format, optimized for fine-tuning Large Language Models (LLMs)☆25Oct 15, 2023Updated 2 years ago
- Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models☆262Apr 23, 2024Updated last year
- GPT-2 small trained on phi-like data☆68Feb 18, 2024Updated 2 years ago
- One Repo To Quickly Build One Docker File for HuggingChat Front and BackEnd☆26Jul 5, 2023Updated 2 years ago
- Mixture-of-Ollamas☆30Aug 12, 2024Updated last year
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31May 22, 2024Updated last year
- CaSIL is an advanced natural language processing system that implements a sophisticated four-layer semantic analysis architecture. It pro…☆67Nov 5, 2024Updated last year
- Inference of Mamba and Mamba2 models in pure C☆197Jan 22, 2026Updated last month
- ☆111Jun 15, 2025Updated 8 months ago