leptonai / examplesLinks
Lepton Examples
☆145Updated last month
Alternatives and similar repositories for examples
Users that are interested in examples are comparing it to the libraries listed below
Sorting:
- Efficient AI Inference & Serving☆477Updated last year
- A high-performance inference system for large language models, designed for production environments.☆479Updated 3 weeks ago
- LLM Reasoning and Generation Benchmark. Evaluate LLMs in complex scenarios systematically.☆163Updated 5 months ago
- 👾📦 CodeBoxAPI is the simplest sandboxing infrastructure for your LLM Apps and Services.☆353Updated 8 months ago
- Autoscale LLM (vLLM, SGLang, LMDeploy) inferences on Kubernetes (and others)☆275Updated last year
- ☆66Updated last year
- AI for all: Build the large graph of the language models☆276Updated last year
- Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).☆249Updated last year
- The next generation of Multi-Modal Multi-Agent platform.☆106Updated 5 months ago
- ☆74Updated last year
- Benchmarking suite for popular AI APIs☆87Updated 8 months ago
- A third-party component library based on Gradio. Integrates Ant Design, Ant Design X, and more advanced components to help you build appl…☆125Updated 2 weeks ago
- Pretrain, finetune and serve LLMs on Intel platforms with Ray☆132Updated last month
- llm-inference is a platform for publishing and managing llm inference, providing a wide range of out-of-the-box features for model deploy…☆86Updated last year
- A curated list of autonomous agents and developer tools powered by LLM.☆41Updated last year
- The source of LMSYS website and blogs☆66Updated last week
- Mixture-of-Experts (MoE) Language Model☆189Updated last year
- a local implementation of OpenAI Assistants API: myla stands for MY Local Assistant☆57Updated last year
- Inference code for Mistral and Mixtral hacked up into original Llama implementation☆368Updated last year
- OpenAI compatible API for LLMs and embeddings (LLaMA, Vicuna, ChatGLM and many others)☆275Updated 2 years ago
- Website with current metrics on the fastest AI models.☆42Updated 11 months ago
- NexusRaven-13B, a new SOTA Open-Source LLM for function calling. This repo contains everything for reproducing our evaluation on NexusRav…☆316Updated 2 years ago
- This is an NVIDIA AI Workbench example project that demonstrates an end-to-end model development workflow using Llamafactory.☆67Updated last year
- ☆120Updated last year
- Langchain implementation of HuggingGPT☆133Updated 2 years ago
- Self-host LLMs with LMDeploy and BentoML☆21Updated 3 months ago
- Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models☆137Updated last year
- 🌟 Revolutionize Your Operations with One Sentence Automation: Utilizing large language models and Multi-Agents to generate operational c…☆56Updated last year
- ☆29Updated last year
- Multi-Faceted AI Agent and Workflow Autotuning. Automatically optimizes LangChain, LangGraph, DSPy programs for better quality, lower exe…☆260Updated 5 months ago