Domain Adapted Language Modeling Toolkit - E2E RAG
☆333Nov 8, 2024Updated last year
Alternatives and similar repositories for DALM
Users that are interested in DALM are comparing it to the libraries listed below
Sorting:
- The Arcee client for executing domain-adpated language model routines https://pypi.org/project/arcee-py/☆28Oct 8, 2024Updated last year
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆252Oct 30, 2024Updated last year
- Tools for merging pretrained large language models.☆6,826Updated this week
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,859May 17, 2025Updated 9 months ago
- QLoRA with Enhanced Multi GPU Support☆38Aug 8, 2023Updated 2 years ago
- Customizable implementation of the self-instruct paper.☆1,049Mar 7, 2024Updated last year
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆3,728May 21, 2025Updated 9 months ago
- ☆21Oct 6, 2023Updated 2 years ago
- Official code for ReLoRA from the paper Stack More Layers Differently: High-Rank Training Through Low-Rank Updates☆473Apr 21, 2024Updated last year
- An Open Source Toolkit For LLM Distillation☆875Dec 21, 2025Updated 2 months ago
- Go ahead and axolotl questions☆11,335Updated this week
- ☆21Jun 26, 2024Updated last year
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆76Oct 19, 2024Updated last year
- Create Custom LLMs☆1,810Nov 8, 2025Updated 3 months ago
- ☆15Dec 7, 2023Updated 2 years ago
- Structured Outputs☆13,488Updated this week
- Harness LLMs with Multi-Agent Programming☆3,921Updated this week
- ModuleFormer is a MoE-based architecture that includes two different types of experts: stick-breaking attention heads and feedforward exp…☆226Sep 18, 2025Updated 5 months ago
- Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models☆261Apr 23, 2024Updated last year
- Optimizing inference proxy for LLMs☆3,352Jan 28, 2026Updated last month
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆13,206Updated this week
- Robust recipes to align language models with human and AI preferences☆5,510Sep 8, 2025Updated 5 months ago
- Make running benchmark simple yet maintainable, again. Now only supports Korean-based cross-encoder.☆29Dec 2, 2025Updated 3 months ago
- llama.cpp with BakLLaVA model describes what does it see☆379Nov 8, 2023Updated 2 years ago
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.☆2,915Updated this week
- TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients. Published in Nature.☆3,376Jul 25, 2025Updated 7 months ago
- Open-source tool to visualise your RAG 🔮☆1,216Jan 3, 2025Updated last year
- A collection of fine-tuning notebooks!☆30Oct 5, 2023Updated 2 years ago
- AgentSearch is a framework for powering search agents and enabling customizable local search.☆518Apr 22, 2024Updated last year
- Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)☆12,734Feb 9, 2026Updated 3 weeks ago
- Data and tools for generating and inspecting OLMo pre-training data.☆1,416Nov 5, 2025Updated 3 months ago
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆3,108Feb 23, 2026Updated last week
- Automatically evaluate your LLMs in Google Colab☆687May 7, 2024Updated last year
- ☆16Feb 5, 2025Updated last year
- Streamlit app presented to the Streamlit LLMs Hackathon September 23☆16May 13, 2024Updated last year
- Repository for our "RAG in Practice (2025)" event!☆17Mar 26, 2025Updated 11 months ago
- GPT-2 small trained on phi-like data☆68Feb 18, 2024Updated 2 years ago
- Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets☆4,875Feb 23, 2026Updated last week
- Synthetic Data Generation using LLM via Argilla, Distilabel, ChatGPT, etc.☆30May 29, 2024Updated last year