project-ai101 / llm-infra

In-depth tutorials and examples on LLM training and inference infrastructure, such as, Pytorch, Fairscale, Nvidia AI Modules (cuDNN, tensorRT, Megatron-LM), HuggingFace.

☆12

Alternatives and similar repositories for llm-infra:

Users that are interested in llm-infra are comparing it to the libraries listed below

AutonomicPerfectionist / PipeInfer
PipeInfer: Accelerating LLM Inference using Asynchronous Pipelined Speculation
☆28Updated 4 months ago
teknium1 / LLM-Logbook
Public reports detailing responses to sets of prompts by Large Language Models.
☆30Updated 2 months ago
relari-ai / agent-contracts
A structured framework for defining, verifying and certifying AI systems.
☆10Updated 3 weeks ago
thisishugow / create-llama-ollama
a simple create-llama template using llama-index v0.10 and integrated with Ollama
☆10Updated 10 months ago
kyegomez / forest-of-thoughts
A forest of autonomous agents.
☆19Updated 2 months ago
dkubeai / langrunner
☆16Updated 7 months ago
weaviate / how-to-ingest-pdfs-with-unstructured
☆19Updated last year
isEmmanuelOlowe / llm-cost-estimator
Estimating hardware and cloud costs of LLMs and transformer projects
☆14Updated last year
Zyphra / zcookbook
Training hybrid models for dummies.
☆20Updated 2 months ago
nomic-ai / cookbook
examples and guides to using Nomic Atlas
☆27Updated this week
eruiz1996 / Selected-Statistics-Topics
Lecture notes, scripts, and material for the lecture of Selected Statistics Topics in the Autonomous University of Querétaro
☆12Updated 4 months ago
ml-dev-bench / ml-dev-bench
ML-Dev-Bench is a benchmark for evaluating AI agents against various ML development tasks.
☆31Updated 2 weeks ago
nateraw / modal-examples
Apps that run on modal.com
☆12Updated 10 months ago
uogbuji / mlx-notes
Shared personal notes created while working with the Apple MLX machine learning framework
☆22Updated 9 months ago
ashvardanian / MongooseMiner
Documentation retrieval system to help LLMs navigate less-popular (yet often more powerful) Python libraries
☆12Updated 10 months ago
allenai / olmo-cookbook
OLMost every training recipe you need to perform data interventions with the OLMo family of models.
☆16Updated this week
Just-Curieous / Curie
❓Curie: Automated and Rigorous Scientific Experimentation with AI Agents
☆54Updated this week
kyegomez / WhiteRock
The world's first fully automated VC fund.
☆20Updated 2 weeks ago
kyegomez / SwarmOS
An all-new OS that orchestrates autonomous agents as workers to execute tasks.
☆17Updated 4 months ago
mohamedfawzy96 / ragxo
☆13Updated last month
ictorv / Large-Language-Pretraining
Building large language foundational model
☆9Updated 3 weeks ago
guestrin-lab / ACORN
state-of-the-art search over vector embeddings and structured data (SIGMOD '24)
☆70Updated 3 weeks ago
omkaark / spotty
Simple orchestration for EC2 spot containers
☆20Updated 6 months ago
kyegomez / Exa
Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…
☆26Updated 4 months ago
AIAnytime / Phi-2-Fine-Tuning
Phi-2 Fine Tuning to build a mental health GPT.
☆10Updated last year
benzocv / git-toolbox
This repository serves as a comprehensive reference for both beginners and advanced users of Git. It provides an organized and easy-to-fo…
☆11Updated 4 months ago
kyegomez / MultiModal-ToT
Multi-Modal Tree of thoughts for DALLE-3 like auto self improvement
☆16Updated 4 months ago
microsoft / glinthawk
An LLM inference engine, written in C++
☆12Updated 2 months ago
sofi444 / realtime-openai-dotpy
Speech to Speech conversation using the OpenAI RealTime API in Python 🐍
☆23Updated 4 months ago
SymbioticLab / Oobleck
A resilient distributed training framework
☆93Updated 11 months ago