Lepton Examples
☆146Oct 30, 2025Updated 6 months ago
Alternatives and similar repositories for examples
Users that are interested in examples are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Pythonic framework to simplify AI service building☆2,805May 7, 2026Updated 2 weeks ago
- An auxiliary project analysis of the characteristics of KV in DiT Attention.☆34Nov 29, 2024Updated last year
- PSTensor provides a way to hack the memory management of tensors in TensorFlow and PyTorch by defining your own C++ Tensor Class.☆10Feb 10, 2022Updated 4 years ago
- Remove all your star from GitHub☆15Jun 3, 2024Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆17Jun 3, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Benchmarking Attention Mechanism in Vision Transformers.☆20Oct 10, 2022Updated 3 years ago
- ☆30Mar 24, 2025Updated last year
- Building a quick conversation-based search demo with Lepton AI.☆8,095Dec 2, 2025Updated 5 months ago
- Performance benchmarking with ColossalAI☆39Jul 6, 2022Updated 3 years ago
- High Performance Grouped GEMM in PyTorch☆30May 10, 2022Updated 4 years ago
- An Optimizing Compiler for Recommendation Model Inference☆26Jun 5, 2025Updated 11 months ago
- BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.☆926Dec 30, 2024Updated last year
- 实验室找工作交流☆10Oct 16, 2015Updated 10 years ago
- ☆28Jul 11, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- PaLM-Kosmos-Vision is a foundational project showcasing basic ChatGPT with vision capabilities, inviting further development for advanced…☆16Nov 15, 2023Updated 2 years ago
- The code in this repository deploys Weaviate into Snowpark Container Services (SPCS), demonstrating how to run Weaviate in Snowflake.☆15Oct 15, 2024Updated last year
- ☆11Mar 23, 2022Updated 4 years ago
- Kexplain is an interactive kubectl explain☆12Oct 23, 2023Updated 2 years ago
- An object detection codebase based on MegEngine.☆28Dec 14, 2022Updated 3 years ago
- PyTorch reimplementation of the Smooth ReLU activation function proposed in the paper "Real World Large Scale Recommendation Systems Repr…☆22Apr 13, 2022Updated 4 years ago
- ☆10Oct 26, 2016Updated 9 years ago
- TargetProp for RNNs☆27Mar 22, 2019Updated 7 years ago
- Crowdsourced cypher statement evaluation☆32Feb 6, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Torch Distributed Experimental☆117Aug 5, 2024Updated last year
- Quantized Attention on GPU☆44Nov 22, 2024Updated last year
- Send HTTP scrapers to Wonderland☆24Jan 7, 2019Updated 7 years ago
- RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.☆1,119Updated this week
- GraphQL ABC - a GraphQL middleware for the abc web framework☆11Jun 11, 2020Updated 5 years ago
- 💻 Terminal-Agent with Human-in-the-Loop Learning☆39Jan 16, 2026Updated 4 months ago
- [CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models☆727Dec 2, 2024Updated last year
- ☆106Sep 9, 2024Updated last year
- Performance of the C++ interface of flash attention and flash attention v2 in large language model (LLM) inference scenarios.☆45Feb 27, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Concurrent inverse Bloom filter.☆15Feb 3, 2015Updated 11 years ago
- Hanzi to Pinyin engine in Swift 拼音输入法引擎☆14Mar 29, 2024Updated 2 years ago
- A mono-repo to house the various supported Transport options to be used with Pipecat's client-js package☆29Updated this week
- Library for fast text representation and classification.☆31Jan 9, 2024Updated 2 years ago
- a simple programming language under development☆11Dec 3, 2023Updated 2 years ago
- SMCA replication☆21Jul 24, 2021Updated 4 years ago
- A Translation Task using TurboTransformers☆10Dec 17, 2020Updated 5 years ago