project-ai101 / llm-infra
In-depth tutorials and examples on LLM training and inference infrastructure, such as, Pytorch, Fairscale, Nvidia AI Modules (cuDNN, tensorRT, Megatron-LM), HuggingFace.
☆12Updated last week
Alternatives and similar repositories for llm-infra:
Users that are interested in llm-infra are comparing it to the libraries listed below
- PipeInfer: Accelerating LLM Inference using Asynchronous Pipelined Speculation☆28Updated 4 months ago
- Public reports detailing responses to sets of prompts by Large Language Models.☆30Updated 2 months ago
- A structured framework for defining, verifying and certifying AI systems.☆10Updated 3 weeks ago
- a simple create-llama template using llama-index v0.10 and integrated with Ollama☆10Updated 10 months ago
- A forest of autonomous agents.☆19Updated 2 months ago
- ☆16Updated 7 months ago
- ☆19Updated last year
- Estimating hardware and cloud costs of LLMs and transformer projects☆14Updated last year
- Training hybrid models for dummies.☆20Updated 2 months ago
- examples and guides to using Nomic Atlas☆27Updated this week
- Lecture notes, scripts, and material for the lecture of Selected Statistics Topics in the Autonomous University of Querétaro☆12Updated 4 months ago
- ML-Dev-Bench is a benchmark for evaluating AI agents against various ML development tasks.☆31Updated 2 weeks ago
- Apps that run on modal.com☆12Updated 10 months ago
- Shared personal notes created while working with the Apple MLX machine learning framework☆22Updated 9 months ago
- Documentation retrieval system to help LLMs navigate less-popular (yet often more powerful) Python libraries☆12Updated 10 months ago
- OLMost every training recipe you need to perform data interventions with the OLMo family of models.☆16Updated this week
- ❓Curie: Automated and Rigorous Scientific Experimentation with AI Agents☆54Updated this week
- The world's first fully automated VC fund.☆20Updated 2 weeks ago
- An all-new OS that orchestrates autonomous agents as workers to execute tasks.☆17Updated 4 months ago
- ☆13Updated last month
- Building large language foundational model☆9Updated 3 weeks ago
- state-of-the-art search over vector embeddings and structured data (SIGMOD '24)☆70Updated 3 weeks ago
- Simple orchestration for EC2 spot containers☆20Updated 6 months ago
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆26Updated 4 months ago
- Phi-2 Fine Tuning to build a mental health GPT.☆10Updated last year
- This repository serves as a comprehensive reference for both beginners and advanced users of Git. It provides an organized and easy-to-fo…☆11Updated 4 months ago
- Multi-Modal Tree of thoughts for DALLE-3 like auto self improvement☆16Updated 4 months ago
- An LLM inference engine, written in C++☆12Updated 2 months ago
- Speech to Speech conversation using the OpenAI RealTime API in Python 🐍☆23Updated 4 months ago
- A resilient distributed training framework☆93Updated 11 months ago