Lepton Examples
☆146Oct 30, 2025Updated 5 months ago
Alternatives and similar repositories for examples
Users that are interested in examples are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Pythonic framework to simplify AI service building☆2,803Updated this week
- An auxiliary project analysis of the characteristics of KV in DiT Attention.☆34Nov 29, 2024Updated last year
- ☆21Jul 24, 2025Updated 8 months ago
- PSTensor provides a way to hack the memory management of tensors in TensorFlow and PyTorch by defining your own C++ Tensor Class.☆10Feb 10, 2022Updated 4 years ago
- Reacting to content with GPT-4V, OpenAI tts, Cloudflare Workers and Mac shortcuts☆21Nov 29, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A high-throughput and memory-efficient inference and serving engine for LLMs☆17Jun 3, 2024Updated last year
- An external memory allocator example for PyTorch.☆16Aug 10, 2025Updated 8 months ago
- ☆29Mar 24, 2025Updated last year
- Building a quick conversation-based search demo with Lepton AI.☆8,100Dec 2, 2025Updated 4 months ago
- WIP. Veloce is a low-code Ray-based parallelization library that makes machine learning computation novel, efficient, and heterogeneous.☆17Aug 4, 2022Updated 3 years ago
- Repository to quickly label lots of images using CLIP embeddings☆16Apr 29, 2025Updated 11 months ago
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆19Jul 24, 2025Updated 8 months ago
- ☆23Jan 7, 2022Updated 4 years ago
- High Performance Grouped GEMM in PyTorch☆30May 10, 2022Updated 3 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Manages vllm-nccl dependency☆18Jun 3, 2024Updated last year
- 极速页导航-无服务版本☆24Feb 2, 2026Updated 2 months ago
- BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.☆924Dec 30, 2024Updated last year
- Data for paper "Dr.Spider: A Diagnostic Evaluation Benchmark towards Text-to-SQL Robustness"☆34May 3, 2023Updated 2 years ago
- 实验室找工作交流☆10Oct 16, 2015Updated 10 years ago
- ☆28Jul 11, 2021Updated 4 years ago
- The code in this repository deploys Weaviate into Snowpark Container Services (SPCS), demonstrating how to run Weaviate in Snowflake.☆15Oct 15, 2024Updated last year
- PaLM-Kosmos-Vision is a foundational project showcasing basic ChatGPT with vision capabilities, inviting further development for advanced…☆16Nov 15, 2023Updated 2 years ago
- A huge dataset for Document Visual Question Answering☆22Jul 29, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆19Mar 14, 2024Updated 2 years ago
- An object detection codebase based on MegEngine.☆28Dec 14, 2022Updated 3 years ago
- PyTorch reimplementation of the Smooth ReLU activation function proposed in the paper "Real World Large Scale Recommendation Systems Repr…☆22Apr 13, 2022Updated 4 years ago
- A sample pattern for running CI tests on Modal☆19Apr 12, 2025Updated last year
- ☆10Oct 26, 2016Updated 9 years ago
- TargetProp for RNNs☆27Mar 22, 2019Updated 7 years ago
- GPT Demo with hybrid distributed training☆10Dec 1, 2022Updated 3 years ago
- optimized BERT transformer inference on NVIDIA GPU. https://arxiv.org/abs/2210.03052☆479Mar 15, 2024Updated 2 years ago
- Quantized Attention on GPU☆44Nov 22, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Send HTTP scrapers to Wonderland☆24Jan 7, 2019Updated 7 years ago
- RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.☆1,088Updated this week
- An IR for efficiently simulating distributed ML computation.☆33Jan 13, 2024Updated 2 years ago
- 💻 Terminal-Agent with Human-in-the-Loop Learning☆39Jan 16, 2026Updated 3 months ago
- [CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models☆727Dec 2, 2024Updated last year
- ☆105Sep 9, 2024Updated last year
- A Vue App for quickly generating KML Search Grids☆13Nov 12, 2024Updated last year