Large Language Model Engineering (LLM Engineering) refers to the emerging best-practices and tools for pretraining, post-training, and optimizing LLMs prior to production deployment. Pre- and post-training techniques include unsupervised pretraining, supervised fine-tuning, alignment, model merging, distillation, quantization. and others.
☆69Feb 18, 2026Updated 2 months ago
Alternatives and similar repositories for LLM-Engineering-Foundations-to-SLMs-Open-Source
Users that are interested in LLM-Engineering-Foundations-to-SLMs-Open-Source are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is a companion repository for the On Prem RAG AIM Event☆11Nov 30, 2024Updated last year
- A collection of fine-tuning notebooks!☆31Oct 5, 2023Updated 2 years ago
- An index of all of our weekly concepts + code events for aspiring AI Engineers and Business Leaders!!☆100Feb 7, 2026Updated 2 months ago
- Data Observability for Data Engineering, published by Packt Publishing☆11Jan 24, 2025Updated last year
- Modern Techniques for Data Science with Big Datasets☆12Apr 6, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆11Jan 28, 2026Updated 2 months ago
- ☆78May 27, 2024Updated last year
- Airflow Demo on Heroku☆13Jul 9, 2022Updated 3 years ago
- A framework for Ranked List Truncation, including the implementation of multiple existing deep models, such as BiCut、Choopy and AttnCut. …☆14May 7, 2022Updated 3 years ago
- A data generator for Apache Druid☆12Mar 26, 2025Updated last year
- Following emerging Large Language Model Operations (LLM Ops) best practices in the industry, you’ll learn all about the key technologies …☆291Apr 11, 2024Updated 2 years ago
- MSPaint for marimo and other Python notebooks☆24Oct 24, 2025Updated 5 months ago
- This is the repo for all cell sorting code and data☆41Oct 30, 2024Updated last year
- Seamlessly integrate IoT data with AI agents, enabling the effortless parsing, processing, and utilization of IoT data streams.☆11Jan 27, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- This repository is meant to optimize hybrid search settings for OpenSearch. It covers a grid search approach to identify a good parameter…☆13Sep 1, 2025Updated 7 months ago
- Structured outputs from DSPy and Jinja2☆27Jun 27, 2025Updated 9 months ago
- ☆27Apr 3, 2024Updated 2 years ago
- Language Cafe Bot☆14Apr 3, 2026Updated 2 weeks ago
- Building your first LLM application with OpenAI, and AI-assisted Development, step-by-step!☆128Nov 22, 2025Updated 4 months ago
- ☆76Apr 2, 2026Updated 2 weeks ago
- Code for RATIONALYST: Pre-training Process-Supervision for Improving Reasoning https://arxiv.org/pdf/2410.01044☆35Oct 3, 2024Updated last year
- Work with your business data using natural language☆19Nov 20, 2024Updated last year
- An easily configurable Debezium Docker image ready for the cloud☆13Jun 5, 2020Updated 5 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- SQL scripts, instructions for MySQL HeatWave benchmarking☆12Mar 17, 2024Updated 2 years ago
- Code for paper https://arxiv.org/abs/2501.00522☆14Apr 28, 2025Updated 11 months ago
- The Agentic Developer Environment. Built on Tauri, Svelte 5, and the Agent Client Protocol.☆57Updated this week
- We run Node.js with Ollama Hosting LLM locally and we use D-ID for Live Avatar☆24Jun 3, 2024Updated last year
- Updating collection of summarization datasets in 100+ languages, based on our paper "The State and Fate of Summarization Datasets: A Surv…☆30Apr 29, 2025Updated 11 months ago
- Repository for Manning Twitch session about building and deploying APIs with Python☆12Jul 19, 2021Updated 4 years ago
- Adding NeMo Guardrails to a LlamaIndex RAG pipeline☆41Feb 20, 2024Updated 2 years ago
- FastAPI ASGI with Django ORM and admin☆15May 15, 2022Updated 3 years ago
- Code to go along with my AI agents youtube video☆17Apr 5, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Collaborative NLP annotation tool supporting enterprise authentication, inter-annotator statistics, active learning☆14Mar 5, 2023Updated 3 years ago
- Information Retrieval system built by BERT and elasticsearch☆14Feb 9, 2020Updated 6 years ago
- Data visualization workshop☆11May 12, 2020Updated 5 years ago
- A useful tool for visualizing and analyzing where our models weaknesses are☆13Aug 15, 2019Updated 6 years ago
- K-RET: Knowledgeable Biomedical Relation Extraction System☆10Feb 22, 2025Updated last year
- A DSL for Domain-Driven Design☆17Feb 1, 2026Updated 2 months ago
- Sample Project for an MLOps 101 course I taught☆47Apr 10, 2025Updated last year