Lepton Examples
☆146Oct 30, 2025Updated 4 months ago
Alternatives and similar repositories for examples
Users that are interested in examples are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Pythonic framework to simplify AI service building☆2,806Jan 31, 2026Updated last month
- ☆21Jul 24, 2025Updated 8 months ago
- PSTensor provides a way to hack the memory management of tensors in TensorFlow and PyTorch by defining your own C++ Tensor Class.☆10Feb 10, 2022Updated 4 years ago
- Reacting to content with GPT-4V, OpenAI tts, Cloudflare Workers and Mac shortcuts☆21Nov 29, 2023Updated 2 years ago
- Depict GPU memory footprint during DNN training of PyTorch☆11Nov 17, 2022Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A high-throughput and memory-efficient inference and serving engine for LLMs☆17Jun 3, 2024Updated last year
- Benchmarking Attention Mechanism in Vision Transformers.☆20Oct 10, 2022Updated 3 years ago
- An external memory allocator example for PyTorch.☆16Aug 10, 2025Updated 7 months ago
- Neural Style Transfer with Caffe2 on your Android phone☆82Mar 28, 2019Updated 6 years ago
- Building a quick conversation-based search demo with Lepton AI.☆8,108Dec 2, 2025Updated 3 months ago
- Repository to quickly label lots of images using CLIP embeddings☆16Apr 29, 2025Updated 10 months ago
- ☆24Jan 7, 2022Updated 4 years ago
- This Discord bot allows users to trigger an update process for the LibreChat server directly from a Discord server. The bot provides a co…☆16Jun 10, 2025Updated 9 months ago
- High Performance Grouped GEMM in PyTorch☆30May 10, 2022Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Manages vllm-nccl dependency☆17Jun 3, 2024Updated last year
- 极速页导航-无服务版本☆24Feb 2, 2026Updated last month
- BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.☆921Dec 30, 2024Updated last year
- Data for paper "Dr.Spider: A Diagnostic Evaluation Benchmark towards Text-to-SQL Robustness"☆33May 3, 2023Updated 2 years ago
- Open single and half precision gemm implementations☆398Apr 2, 2023Updated 2 years ago
- PaLM-Kosmos-Vision is a foundational project showcasing basic ChatGPT with vision capabilities, inviting further development for advanced…☆16Nov 15, 2023Updated 2 years ago
- ☆19Mar 14, 2024Updated 2 years ago
- [CVPR 2021] Searching by Generating: Flexible and Efficient One-Shot NAS with Architecture Generator☆39May 19, 2022Updated 3 years ago
- An object detection codebase based on MegEngine.☆28Dec 14, 2022Updated 3 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- CircleIt: Web3 Reddit on DeSo Blockchain☆10Jun 14, 2023Updated 2 years ago
- PyTorch reimplementation of the Smooth ReLU activation function proposed in the paper "Real World Large Scale Recommendation Systems Repr…☆22Apr 13, 2022Updated 3 years ago
- A sample pattern for running CI tests on Modal☆19Apr 12, 2025Updated 11 months ago
- A Survey of AI startups☆402Aug 27, 2023Updated 2 years ago
- GPT Demo with hybrid distributed training☆10Dec 1, 2022Updated 3 years ago
- Crowdsourced cypher statement evaluation☆32Feb 6, 2024Updated 2 years ago
- optimized BERT transformer inference on NVIDIA GPU. https://arxiv.org/abs/2210.03052☆478Mar 15, 2024Updated 2 years ago
- An ai-powered product photography studio☆16Sep 22, 2023Updated 2 years ago
- This very simple python script takes inputs from your business and outputs articles written bhy claude.☆13Apr 3, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Torch Distributed Experimental☆117Aug 5, 2024Updated last year
- Quantized Attention on GPU☆44Nov 22, 2024Updated last year
- Proof of concept prototype to perform distributed training using BVLC/caffe, based on a parameter server implementation using MPI. Data p…☆13May 7, 2015Updated 10 years ago
- RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.☆1,074Updated this week
- Documentation for Pixura NFT APIs☆11Oct 26, 2018Updated 7 years ago
- The face of Trickle.gg decentralized application.☆12Nov 11, 2022Updated 3 years ago
- [CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models☆726Dec 2, 2024Updated last year