Stop messing around with finicky sampling parameters and just use DRµGS!
☆360Jun 1, 2024Updated last year
Alternatives and similar repositories for DRUGS
Users that are interested in DRUGS are comparing it to the libraries listed below
Sorting:
- Fast approximate inference on a single GPU with sparsity aware offloading☆39Jan 4, 2024Updated 2 years ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆34Mar 2, 2024Updated 2 years ago
- This is our own implementation of 'Layer Selective Rank Reduction'☆240May 26, 2024Updated last year
- Experimental LLM Inference UX to aid in creative writing☆128Dec 14, 2024Updated last year
- 33B Chinese LLM, DPO QLORA, 100K context, AirLLM 70B inference with single 4GB GPU☆13May 5, 2024Updated last year
- LLM backed Fantasy Tribe Game☆19Nov 21, 2024Updated last year
- Visualize the intermediate output of Mistral 7B☆386Jan 22, 2025Updated last year
- Clipboard Conqueror is a novel copy and paste copilot alternative designed to bring your very own LLM AI assistant to any text field.☆439Jan 11, 2025Updated last year
- Tools for merging pretrained large language models.☆6,826Updated this week
- Token Omission Via Attention☆127Oct 13, 2024Updated last year
- ☆579Oct 29, 2024Updated last year
- Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models☆261Apr 23, 2024Updated last year
- Large-scale LLM inference engine☆1,658Feb 17, 2026Updated 2 weeks ago
- Low-Rank adapter extraction for fine-tuned transformers models☆180May 2, 2024Updated last year
- The Truth Is In There: Improving Reasoning in Language Models with Layer-Selective Rank Reduction☆390Jul 9, 2024Updated last year
- Yet another frontend for LLM, written using .NET and WinUI 3☆10Sep 14, 2025Updated 5 months ago
- Simple LLM inference server☆20Jun 13, 2024Updated last year
- [ICML'24 Spotlight] LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning☆665Jun 1, 2024Updated last year
- A stable, fast and easy-to-use inference library with a focus on a sync-to-async API☆48Sep 26, 2024Updated last year
- Entropy Based Sampling and Parallel CoT Decoding☆3,434Nov 13, 2024Updated last year
- A library for making RepE control vectors☆691Sep 24, 2025Updated 5 months ago
- Attend - to what matters.☆17Feb 22, 2025Updated last year
- CaSIL is an advanced natural language processing system that implements a sophisticated four-layer semantic analysis architecture. It pro…☆67Nov 5, 2024Updated last year
- Web UI for ExLlamaV2☆512Feb 5, 2025Updated last year
- A repository for research on medium sized language models.☆78May 23, 2024Updated last year
- SLOP Detector and analyzer based on dictionary for shareGPT JSON and text☆82Feb 7, 2026Updated 3 weeks ago
- A fast batching API to serve LLM models☆189Apr 26, 2024Updated last year
- ☆23Jun 4, 2024Updated last year
- Minimal example of MCP for parsing llms.txt☆40Apr 8, 2025Updated 10 months ago
- Go ahead and axolotl questions☆11,335Updated this week
- Create Custom LLMs☆1,810Nov 8, 2025Updated 3 months ago
- A fast inference library for running LLMs locally on modern consumer-class GPUs☆4,444Dec 9, 2025Updated 2 months ago
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆270Jan 10, 2026Updated last month
- The repository for the code of the UltraFastBERT paper☆519Mar 24, 2024Updated last year
- Reasoning Computers. Lambda Calculus, Fully Differentiable. Also Neural Stacks, Queues, Arrays, Lists, Trees, and Latches.☆285Nov 3, 2024Updated last year
- Customizable implementation of the self-instruct paper.☆1,049Mar 7, 2024Updated last year
- Load multiple LoRA modules simultaneously and automatically switch the appropriate combination of LoRA modules to generate the best answe…☆159Feb 9, 2024Updated 2 years ago
- RS-IMLE☆44Dec 7, 2024Updated last year
- Sparse Autoencoders (SAE) vs CLIP fine-tuning fun.☆18Dec 19, 2024Updated last year