leptonai / examplesLinks
Lepton Examples
☆146Updated 3 months ago
Alternatives and similar repositories for examples
Users that are interested in examples are comparing it to the libraries listed below
Sorting:
- Efficient AI Inference & Serving☆479Updated 2 years ago
- A high-performance inference system for large language models, designed for production environments.☆491Updated last month
- Autoscale LLM (vLLM, SGLang, LMDeploy) inferences on Kubernetes (and others)☆279Updated 2 years ago
- 👾📦 CodeBoxAPI is the simplest sandboxing infrastructure for your LLM Apps and Services.☆363Updated last year
- Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).☆251Updated last year
- Benchmarking suite for popular AI APIs☆88Updated last year
- LLM Reasoning and Generation Benchmark. Evaluate LLMs in complex scenarios systematically.☆166Updated 8 months ago
- ☆67Updated last year
- Pretrain, finetune and serve LLMs on Intel platforms with Ray☆131Updated 4 months ago
- A curated list of autonomous agents and developer tools powered by LLM.☆46Updated 2 years ago
- ☆74Updated last year
- The next generation of Multi-Modal Multi-Agent platform.☆111Updated 8 months ago
- Inference code for Mistral and Mixtral hacked up into original Llama implementation☆371Updated 2 years ago
- OpenAI compatible API for LLMs and embeddings (LLaMA, Vicuna, ChatGLM and many others)☆275Updated 2 years ago
- an MLOps/LLMOps platform☆236Updated last year
- ☆476Updated 2 years ago
- Social and customizable AI writing assistant! ✍️☆260Updated 7 months ago
- An open-source LLM tool for extracting repeatable tasks from your conversations, and saving them into a customized skill library for retr…☆128Updated 2 years ago
- NexusRaven-13B, a new SOTA Open-Source LLM for function calling. This repo contains everything for reproducing our evaluation on NexusRav…☆318Updated 2 years ago
- Akcio is a demonstration project for Retrieval Augmented Generation (RAG). It leverages the power of LLM to generate responses and uses v…☆259Updated 2 years ago
- A simple service that integrates vLLM with Ray Serve for fast and scalable LLM serving.☆78Updated last year
- GPT-Fathom is an open-source and reproducible LLM evaluation suite, benchmarking 10+ leading open-source and closed-source LLMs as well a…☆346Updated last year
- llm-inference is a platform for publishing and managing llm inference, providing a wide range of out-of-the-box features for model deploy…☆91Updated last year
- Complex question answering in LLMs with enhanced reasoning and information-seeking capabilities.☆204Updated 2 years ago
- FineTune LLMs in few lines of code (Text2Text, Text2Speech, Speech2Text)☆246Updated 2 years ago
- A Survey of AI startups☆402Updated 2 years ago
- Multi-Faceted AI Agent and Workflow Autotuning. Automatically optimizes LangChain, LangGraph, DSPy programs for better quality, lower exe…☆269Updated 8 months ago
- Langchain implementation of HuggingGPT☆134Updated 2 years ago
- Self-host LLMs with LMDeploy and BentoML☆22Updated last month
- ☆120Updated last year