scaleapi / llm-engine
Scale LLM Engine public repository
☆791Updated this week
Alternatives and similar repositories for llm-engine:
Users that are interested in llm-engine are comparing it to the libraries listed below
- RayLLM - LLMs on Ray☆1,254Updated 8 months ago
- Extend existing LLMs way beyond the original training length with constant memory usage, without retraining☆687Updated 10 months ago
- Get 100% uptime, reliability from OpenAI. Handle Rate Limit, Timeout, API, Keys Errors☆641Updated last year
- A tiny library for coding with large language models.☆1,223Updated 7 months ago
- Prompt programming with FMs.☆440Updated 6 months ago
- ☆756Updated last year
- Fine-tune mistral-7B on 3090s, a100s, h100s☆705Updated last year
- Customizable implementation of the self-instruct paper.☆1,038Updated 11 months ago
- LLM fine-tuning and eval☆344Updated 10 months ago
- Interactive Composition Explorer: a debugger for compositional language model programs☆543Updated last month
- Exact structure out of any language model completion.☆506Updated last year
- 🧠 Motorhead is a memory and information retrieval server for LLMs.☆865Updated 3 months ago
- A tool for evaluating LLMs☆400Updated 9 months ago
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆2,358Updated last week
- Custom AI assistant platform to speed up your work.☆1,031Updated this week
- A FastAPI service for semantic text search using precomputed embeddings and advanced similarity measures, with built-in support for vario…☆999Updated 4 months ago
- This repo contains data and code for the paper "Language Models Enable Simple Systems for Generating Structured Views of Heterogeneous Da…☆486Updated 10 months ago
- Automatically evaluate your LLMs in Google Colab☆590Updated 9 months ago
- Build robust LLM applications with true composability 🔗☆415Updated last year
- ☆446Updated last year
- ☆412Updated last year
- This repository contains code and tooling for the Abacus.AI LLM Context Expansion project. Also included are evaluation scripts and bench…☆584Updated last year
- [ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the dive…☆912Updated 3 months ago
- 🍰 PromptLayer - Maintain a log of your prompts and OpenAI API requests. Track, debug, and replay old completions.☆557Updated this week
- An LLM-powered advanced RAG pipeline built from scratch☆827Updated last year
- Guide for fine-tuning Llama/Mistral/CodeLlama models and more☆567Updated 5 months ago
- Directly Connecting Python to LLMs via Strongly-Typed Functions, Dataclasses, Interfaces & Generic Types☆393Updated last month
- Fine-Tuning Embedding for RAG with Synthetic Data☆483Updated last year
- Python bindings for the Transformer models implemented in C/C++ using GGML library.☆1,842Updated last year
- Inference code for Persimmon-8B☆416Updated last year