In-depth tutorials and examples on LLM training and inference infrastructure, such as, Pytorch, Fairscale, Nvidia AI Modules (cuDNN, tensorRT, Megatron-LM), HuggingFace.
☆22May 19, 2025Updated 11 months ago
Alternatives and similar repositories for llm-infra
Users that are interested in llm-infra are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An end-to-end pipeline to optimize and host LLM for 100K parallel queries☆36Jul 6, 2025Updated 10 months ago
- TinyML & Edge AI: On-device inference, model quantization, embedded ML, ultra-low-power AI for microcontrollers and IoT devices.☆42Nov 10, 2025Updated 6 months ago
- ☆13Mar 18, 2026Updated last month
- Ultra high-performance secp256k1 ECC engine | Python, Node.js, Rust, Go, C#, Swift, Java bindings | CUDA, Metal, OpenCL GPU | ECDSA, Schn…☆37Updated this week
- Declare all your project's metadata and what you can do with it in one single place.☆48Aug 23, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Evaluate the performance of computer vision models and prompts for zero-shot models (Grounding DINO, CLIP, BLIP, DINOv2, ImageBind, model…☆36Oct 18, 2023Updated 2 years ago
- Auto-Browse: AI Enabled Browser Automation☆18Jul 7, 2025Updated 10 months ago
- Core Data Generator (CDG for short) is a framework for generation (using Sourcery) of Core Data entities from plain structs/classes/enums…☆20Aug 3, 2021Updated 4 years ago
- MCPB Bundle for connecting Claude Desktop to Macuse. Macuse is a macOS app that bridges AI assistants with native macOS functionality.☆26Mar 3, 2026Updated 2 months ago
- A new experience for agentic-coding Android and Apple apps☆22Mar 1, 2026Updated 2 months ago
- Rails API with authentication via OAuth using Doorkeeper, Omniauth, and a React Native iOS client.☆11Jul 3, 2018Updated 7 years ago
- A Node.js library that enables communication with iOS devices using remote XPC services. It supports device lockdown, property-list (plis…☆26May 1, 2026Updated last week
- Code for the paper "Interpreting and Improving Diffusion Models from an Optimization Perspective", appearing in ICML 2024☆14Sep 30, 2024Updated last year
- ☆11Aug 12, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- For syncing my . files across different machines and setting up a new computer☆13Mar 23, 2026Updated last month
- A boilerplate / utility wrapper for JXA (AppleScript JavaScript) scripts — providing common automation functions.☆20Aug 31, 2022Updated 3 years ago
- A tiny server to run local inference on MLX model in the style of OpenAI☆13Jan 31, 2024Updated 2 years ago
- ☆13Apr 7, 2024Updated 2 years ago
- 📦 A collection of pastable code gathered from past projects☆12Sep 9, 2024Updated last year
- Default syntax highlighter for GitBook☆12Aug 30, 2018Updated 7 years ago
- CardiacProfileR: An R package for extraction and visualisation of heart rate profiles from wearable fitness trackers☆13Jun 10, 2018Updated 7 years ago
- Sleep tracking app with HealthKit support.☆10Dec 24, 2017Updated 8 years ago
- Spezi Module to Handle and Display User Interfaces for Chat-based Interactions☆11Apr 27, 2026Updated last week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆59Mar 14, 2024Updated 2 years ago
- This project is a modification of Openra1n to fully boot jailbreak on iOS 15 - 16.5.1 . with the help of Palera1n's code, this project no…☆13Jul 27, 2023Updated 2 years ago
- Generating Summaries with Controllable Readability Levels (EMNLP 2023)☆15Apr 8, 2026Updated last month
- ☆20May 1, 2026Updated last week
- A project to show the possibility to save and load a session in ARkit using CoreML that uses GPS to improve the localization precision☆10Jul 21, 2022Updated 3 years ago
- Easy MCP (Model Context Protocol) servers and AI agents, defined as YAML.☆19Dec 9, 2025Updated 5 months ago
- A few stylization coreML models that I've trained with CreateML☆14Dec 23, 2021Updated 4 years ago
- A chatbot built with SwiftUI, powered by OpenAI☆12Apr 9, 2023Updated 3 years ago
- 🗂️ A build tool plugin for reporting the contents of Xcode's IndexStore in a customizable format.☆19Sep 2, 2025Updated 8 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Interactivity at a new level - MCP apps☆31Mar 12, 2026Updated last month
- Phi-2 Fine Tuning to build a mental health GPT.☆11Jan 6, 2024Updated 2 years ago
- Build extensions for Copilot for Xcode.☆15Oct 21, 2024Updated last year
- ☆15Apr 13, 2024Updated 2 years ago
- A reducer enhancer for using an xstate chart with redux☆13Mar 5, 2018Updated 8 years ago
- ☆12Apr 3, 2024Updated 2 years ago
- Command-line tool to convert Apple HealthKit data to a SQLite database.☆12Jan 18, 2023Updated 3 years ago