leptonai / examples
Lepton Examples
☆139Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for examples
- LLM Reasoning and Generation Benchmark. Evaluate LLMs in complex scenarios systematically.☆131Updated last week
- Efficient AI Inference & Serving☆458Updated 10 months ago
- Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).☆236Updated 8 months ago
- Pretrain, finetune and serve LLMs on Intel platforms with Ray☆103Updated last week
- Benchmarking suite for popular AI APIs☆77Updated 2 weeks ago
- 中文版 llm-numbers☆109Updated 10 months ago
- LLM Inference benchmark☆350Updated 3 months ago
- A third-party component library based on Gradio.☆45Updated this week
- A high-performance inference system for large language models, designed for production environments.☆393Updated this week
- ☆51Updated 3 months ago
- an MLOps/LLMOps platform☆209Updated 3 months ago
- DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including …☆137Updated 2 months ago
- a local implementation of OpenAI Assistants API: myla stands for MY Local Assistant☆50Updated 2 months ago
- ☆76Updated 7 months ago
- A simple service that integrates vLLM with Ray Serve for fast and scalable LLM serving.☆54Updated 7 months ago
- A Survey of AI startups☆393Updated last year
- ☆192Updated this week
- Benchmark suite for LLMs from Fireworks.ai☆58Updated 2 weeks ago
- AGI模块库架构图☆75Updated last year
- Mixture-of-Experts (MoE) Language Model☆180Updated 2 months ago
- A flexible and efficient training framework for large-scale alignment tasks☆209Updated this week
- InternEvo is an open-sourced lightweight training framework aims to support model pre-training without the need for extensive dependencie…☆310Updated this week
- Compare different hardware platforms via the Roofline Model for LLM inference tasks.☆75Updated 8 months ago
- Easy, fast, and cheap pretrain,finetune, serving for everyone☆263Updated 2 weeks ago
- ☆50Updated last month
- Byzer-retrieval is a distributed retrieval system which designed as a backend for LLM RAG (Retrieval Augmented Generation). The system su…☆43Updated 8 months ago
- AI-native application framework and runtime, simply write a YAML file.☆50Updated last year
- agentcraft 可以帮助您快速构建各类应用场景的ai agent应用☆49Updated 2 weeks ago
- Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs☆165Updated 2 weeks ago
- AI for all: Build the large graph of the language models☆243Updated 5 months ago