Lepton Examples
☆146Oct 30, 2025Updated 4 months ago
Alternatives and similar repositories for examples
Users that are interested in examples are comparing it to the libraries listed below
Sorting:
- A Pythonic framework to simplify AI service building☆2,809Jan 31, 2026Updated last month
- A high-throughput and memory-efficient inference and serving engine for LLMs☆17Jun 3, 2024Updated last year
- An auxiliary project analysis of the characteristics of KV in DiT Attention.☆33Nov 29, 2024Updated last year
- Benchmarking Attention Mechanism in Vision Transformers.☆20Oct 10, 2022Updated 3 years ago
- A PyTorch implementation of Proxy Anchor Loss based on CVPR 2020 paper "Proxy Anchor Loss for Deep Metric Learning"☆11Jan 16, 2021Updated 5 years ago
- ☆11Mar 23, 2022Updated 3 years ago
- ☆21Jul 24, 2025Updated 7 months ago
- ☆12Dec 21, 2021Updated 4 years ago
- Ask GPT to generate frontend based on Shadcn components☆14Mar 3, 2024Updated 2 years ago
- An external memory allocator example for PyTorch.☆16Aug 10, 2025Updated 6 months ago
- A sample pattern for running CI tests on Modal☆19Apr 12, 2025Updated 10 months ago
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆18Jul 24, 2025Updated 7 months ago
- Building a quick conversation-based search demo with Lepton AI.☆8,115Dec 2, 2025Updated 3 months ago
- Pytorch GUI(demo) implementation of CVPR2021 paper and ECCV2020 paper, "Guided Interactive Video Object Segmentation Using Reliability-B…☆18May 3, 2022Updated 3 years ago
- Performance benchmarking with ColossalAI☆39Jul 6, 2022Updated 3 years ago
- ☆18Jun 30, 2022Updated 3 years ago
- WIP. Veloce is a low-code Ray-based parallelization library that makes machine learning computation novel, efficient, and heterogeneous.☆17Aug 4, 2022Updated 3 years ago
- Manages vllm-nccl dependency☆17Jun 3, 2024Updated last year
- SMCA replication☆21Jul 24, 2021Updated 4 years ago
- A huge dataset for Document Visual Question Answering☆20Jul 29, 2024Updated last year
- 极速页导航-无服务版本☆24Feb 2, 2026Updated last month
- ☆56Apr 23, 2024Updated last year
- Summary of system papers/frameworks/codes/tools on training or serving large model☆57Dec 17, 2023Updated 2 years ago
- ☆24Feb 19, 2025Updated last year
- An object detection codebase based on MegEngine.☆28Dec 14, 2022Updated 3 years ago
- ☆23Jan 7, 2022Updated 4 years ago
- Implementation of PGONAS for CVPR22W and RD-NAS for ICASSP23☆23Apr 25, 2023Updated 2 years ago
- Unofficial pytorch implementation of ReZero in ResNet☆24Mar 29, 2020Updated 5 years ago
- Code for SaGe subword tokenizer (EACL 2023)☆27Nov 30, 2024Updated last year
- PyTorch reimplementation of the Smooth ReLU activation function proposed in the paper "Real World Large Scale Recommendation Systems Repr…☆22Apr 13, 2022Updated 3 years ago
- Library for fast text representation and classification.☆31Jan 9, 2024Updated 2 years ago
- Pytorch implementation of HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models☆28Mar 22, 2024Updated last year
- An Optimizing Compiler for Recommendation Model Inference☆26Jun 5, 2025Updated 9 months ago
- ☆28Jul 11, 2021Updated 4 years ago
- Implementation of our paper "Self-training Sampling with Monolingual Data Uncertainty for Neural Machine Translation" to appear in ACL-20…☆31Jul 16, 2021Updated 4 years ago
- An IR for efficiently simulating distributed ML computation.☆32Jan 13, 2024Updated 2 years ago
- DistilBERT for Chinese 海量中文预训练蒸馏bert模型☆95Dec 5, 2019Updated 6 years ago
- TargetProp for RNNs☆27Mar 22, 2019Updated 6 years ago
- 📚 A curated list of Awesome Efficient dLLMs Papers with Codes☆110Updated this week