Domain Adapted Language Modeling Toolkit - E2E RAG
☆340Nov 8, 2024Updated last year
Alternatives and similar repositories for DALM
Users that are interested in DALM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The Arcee client for executing domain-adpated language model routines https://pypi.org/project/arcee-py/☆28Oct 8, 2024Updated last year
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆257Oct 30, 2024Updated last year
- ☆14Dec 7, 2023Updated 2 years ago
- Tools for merging pretrained large language models.☆7,083May 6, 2026Updated 2 weeks ago
- ☆21Oct 6, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- QLoRA with Enhanced Multi GPU Support☆38Aug 8, 2023Updated 2 years ago
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆3,782May 15, 2026Updated last week
- Official code for ReLoRA from the paper Stack More Layers Differently: High-Rank Training Through Low-Rank Updates☆473Apr 21, 2024Updated 2 years ago
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,924May 17, 2025Updated last year
- Customizable implementation of the self-instruct paper.☆1,052Mar 7, 2024Updated 2 years ago
- Go ahead and axolotl questions☆11,938Updated this week
- An Open Source Toolkit For LLM Distillation☆942May 12, 2026Updated last week
- A collection of fine-tuning notebooks!☆31Oct 5, 2023Updated 2 years ago
- ModuleFormer is a MoE-based architecture that includes two different types of experts: stick-breaking attention heads and feedforward exp…☆226Sep 18, 2025Updated 8 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Sakura-SOLAR-DPO: Merge, SFT, and DPO☆116Dec 30, 2023Updated 2 years ago
- Full finetuning of large language models without large memory requirements☆94Sep 22, 2025Updated 8 months ago
- [COLING 2024] SentiCSE: A Sentiment-aware Contrastive Sentence Embedding Framework with Sentiment-guided Textual Similarity☆13May 8, 2024Updated 2 years ago
- Structured Outputs☆13,846May 13, 2026Updated last week
- Harness LLMs with Multi-Agent Programming☆4,015May 6, 2026Updated 2 weeks ago
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆13,364May 1, 2026Updated 3 weeks ago
- Pytorch code for paper QA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Models☆25Sep 27, 2023Updated 2 years ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆34Mar 2, 2024Updated 2 years ago
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.☆3,058May 6, 2026Updated 2 weeks ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- GPT-2 small trained on phi-like data☆68Feb 18, 2024Updated 2 years ago
- Create Custom LLMs☆1,844Apr 24, 2026Updated last month
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆77Oct 19, 2024Updated last year
- Code for NeurIPS LLM Efficiency Challenge☆60Apr 9, 2024Updated 2 years ago
- TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients. Published in Nature.☆3,554Jul 25, 2025Updated 9 months ago
- Optimizing inference proxy for LLMs☆3,856May 7, 2026Updated 2 weeks ago
- DSPy: The framework for programming—not prompting—language models☆34,496May 17, 2026Updated last week
- Robust recipes to align language models with human and AI preferences☆5,602Apr 8, 2026Updated last month
- Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models☆267Apr 23, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Make running benchmark simple yet maintainable, again. Now only supports Korean-based cross-encoder.☆33Dec 2, 2025Updated 5 months ago
- llama.cpp with BakLLaVA model describes what does it see☆379Nov 8, 2023Updated 2 years ago
- ☆25Jun 26, 2024Updated last year
- Measuring RAG solutions throughput and latency☆20Jul 23, 2024Updated last year
- Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)☆12,870Apr 13, 2026Updated last month
- AgentSearch is a framework for powering search agents and enabling customizable local search.☆526Apr 22, 2024Updated 2 years ago
- annoy long term memory experiment for oobabooga/text-generation-webui☆30Jul 17, 2023Updated 2 years ago