leptonai / leptonaiLinks

A Pythonic framework to simplify AI service building

☆2,763

Alternatives and similar repositories for leptonai

Users that are interested in leptonai are comparing it to the libraries listed below

Sorting:

leptonai / search_with_lepton
Building a quick conversation-based search demo with Lepton AI.
☆8,124Updated 3 weeks ago
intel / intel-extension-for-transformers
⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Pl…
☆2,166Updated 8 months ago
dvmazur / mixtral-offloading
Run Mixtral-8x7B models in Colab or consumer desktops
☆2,310Updated last year
aiwaves-cn / agents
An Open-source Framework for Data-centric, Self-evolving Autonomous Language Agents
☆5,643Updated 9 months ago
LargeWorldModel / LWM
Large World Model -- Modeling Text and Video with Millions Context
☆7,300Updated 8 months ago
huggingface / optimum-nvidia
☆979Updated 5 months ago
dvlab-research / MGM
Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"
☆3,287Updated last year
xlang-ai / OpenAgents
[COLM 2024] OpenAgents: An Open Platform for Language Agents in the Wild
☆4,364Updated 7 months ago
aixcoder-plugin / aiXcoder-7B
official repository of aiXcoder-7B Code Large Language Model
☆2,262Updated 5 months ago
developersdigest / llm-answer-engine
Build a Perplexity-Inspired Answer Engine Using Next.js, Groq, Llama-3, Langchain, OpenAI, Upstash, Brave & Serper
☆4,899Updated last week
01-ai / Yi
A series of large language models trained from scratch by developers @01-ai
☆7,831Updated 7 months ago
allenai / OLMo
Modeling, training, eval, and inference code for OLMo
☆5,739Updated this week
google / gemma_pytorch
The official PyTorch implementation of Google's Gemma models
☆5,493Updated last month
deepspeedai / DeepSpeed-MII
MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.
☆2,026Updated last week
ModelTC / lightllm
LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalabili…
☆3,366Updated this week
google / gemma.cpp
lightweight, standalone C++ inference engine for Google's Gemma models.
☆6,491Updated this week
PKU-YuanGroup / MoE-LLaVA
Mixture-of-Experts for Large Vision-Language Models
☆2,183Updated 7 months ago
leptonai / examples
Lepton Examples
☆141Updated last month
cohere-ai / cohere-toolkit
Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.
☆3,060Updated 2 weeks ago
InternLM / xtuner
An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)
☆4,628Updated this week
myshell-ai / JetMoE
Reaching LLaMA2 Performance with 0.1M Dollars
☆984Updated 11 months ago
neulab / prompt2model
prompt2model - Generate Deployable Models from Natural Language Instructions
☆2,003Updated 6 months ago
openai / weak-to-strong
☆2,529Updated last year
Eladlev / AutoPrompt
A framework for prompt tuning using Intent-based Prompt Calibration
☆2,657Updated 2 months ago
liltom-eth / llama2-webui
Run any Llama 2 locally with gradio UI on GPU or CPU from anywhere (Linux/Windows/Mac). Use `llama2-wrapper` as your local llama2 backend…
☆1,957Updated last year
microsoft / TaskWeaver
A code-first agent framework for seamlessly planning and executing data analytics tasks.
☆5,806Updated last month
open-compass / MixtralKit
A toolkit for inference and evaluation of 'mixtral-8x7b-32kseqlen' from Mistral AI
☆767Updated last year
Vahe1994 / AQLM
Official Pytorch repository for Extreme Compression of Large Language Models via Additive Quantization https://arxiv.org/pdf/2401.06118.p…
☆1,267Updated 2 months ago
CStanKonrad / long_llama
LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA and fine-tuned with the Focused Transform…
☆1,458Updated last year
GoogleCloudPlatform / localllm
☆1,556Updated last year