Large Language Model Engineering (LLM Engineering) refers to the emerging best-practices and tools for pretraining, post-training, and optimizing LLMs prior to production deployment. Pre- and post-training techniques include unsupervised pretraining, supervised fine-tuning, alignment, model merging, distillation, quantization. and others.
☆68Feb 18, 2026Updated last month
Alternatives and similar repositories for LLM-Engineering-Foundations-to-SLMs-Open-Source
Users that are interested in LLM-Engineering-Foundations-to-SLMs-Open-Source are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is a companion repository for the On Prem RAG AIM Event☆11Nov 30, 2024Updated last year
- A collection of fine-tuning notebooks!☆31Oct 5, 2023Updated 2 years ago
- An index of all of our weekly concepts + code events for aspiring AI Engineers and Business Leaders!!☆99Feb 7, 2026Updated last month
- Data Observability for Data Engineering, published by Packt Publishing☆11Jan 24, 2025Updated last year
- Modern Techniques for Data Science with Big Datasets☆12Apr 6, 2023Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆11Jan 28, 2026Updated 2 months ago
- Run LLMs on Replicate with vLLM☆26Jul 19, 2025Updated 8 months ago
- mini cli search engine for your docs, knowledge bases, meeting notes, whatever. Tracking current sota approaches while being all local☆28Mar 9, 2026Updated 2 weeks ago
- FastAPI SSE Example☆36Jun 20, 2025Updated 9 months ago
- ☆12Feb 5, 2026Updated last month
- Generate HTML forms from Pydantic models for your FastHTML application☆43Mar 9, 2026Updated 2 weeks ago
- Following emerging Large Language Model Operations (LLM Ops) best practices in the industry, you’ll learn all about the key technologies …☆293Apr 11, 2024Updated last year
- ☆11May 8, 2023Updated 2 years ago
- Base project for bootstrapping frontend projects☆15Jan 28, 2026Updated 2 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- TOKEN-IMPORTANCE GUIDED DIRECT PREFERENCE OPTIMIZATION☆24Jan 26, 2026Updated 2 months ago
- ☆12Jan 10, 2023Updated 3 years ago
- This is the repo for all cell sorting code and data☆41Oct 30, 2024Updated last year
- Practice Notebook for AI Course☆13Mar 1, 2025Updated last year
- Python client library for https://mcp.run - call portable & secure tools for your AI Agents and Apps☆25May 8, 2025Updated 10 months ago
- ☆27Apr 3, 2024Updated last year
- Context-Aware Semantic Cache for Conversational AI☆25Jan 3, 2025Updated last year
- DL Backtrace is a new explainablity technique for deep learning models that works for any modality and model type.☆25Updated this week
- Language Cafe Bot☆14Feb 12, 2026Updated last month
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Building your first LLM application with OpenAI, and AI-assisted Development, step-by-step!☆127Nov 22, 2025Updated 4 months ago
- We run Node.js with Ollama Hosting LLM locally and we use D-ID for Live Avatar☆24Jun 3, 2024Updated last year
- ☆50Mar 13, 2026Updated 2 weeks ago
- ☆35Mar 21, 2026Updated last week
- On-device real-time RAG App built using Jina Reader, Mediapipe, Gemma 2b IT LLM.☆15Apr 15, 2024Updated last year
- A tool for model sparse based on torch.fx☆13Jun 3, 2024Updated last year
- Direct Preference Optimization Implementation☆17Feb 1, 2024Updated 2 years ago
- Adding NeMo Guardrails to a LlamaIndex RAG pipeline☆41Feb 20, 2024Updated 2 years ago
- All-in-One Safety Evaluation Framwork☆46Mar 4, 2026Updated 3 weeks ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Calculate allowed interactions in QED☆10Nov 2, 2022Updated 3 years ago
- Code to go along with my AI agents youtube video☆17Apr 5, 2024Updated last year
- Virtual machine with All-in-one OpenShift installation☆12Sep 14, 2019Updated 6 years ago
- Data and code for the paper: Finding Safety Neurons in Large Language Models☆25Jan 29, 2026Updated 2 months ago
- PyTorch compilation tutorial covering TorchScript, torch.fx, and Slapo☆17Mar 13, 2023Updated 3 years ago
- ☆17Aug 30, 2025Updated 6 months ago
- General Utilities☆49Mar 6, 2026Updated 3 weeks ago