abacaj / mpt-30B-inference
Run inference on MPT-30B using CPU
☆575Updated last year
Alternatives and similar repositories for mpt-30B-inference:
Users that are interested in mpt-30B-inference are comparing it to the libraries listed below
- C++ implementation for BLOOM☆809Updated last year
- A school for camelids☆1,208Updated last year
- LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions☆820Updated last year
- ☆588Updated last year
- OpenAI-compatible Python client that can call any LLM☆371Updated last year
- Salesforce open-source LLMs with 8k sequence length.☆716Updated last month
- ☆458Updated last year
- Large Language Models for All, 🦙 Cult and More, Stay in touch !☆443Updated last year
- ☆276Updated last year
- Directly Connecting Python to LLMs via Strongly-Typed Functions, Dataclasses, Interfaces & Generic Types☆395Updated 3 weeks ago
- Locally hosted tool that connects documents to LLMs for summarization and querying, with a simple GUI.☆787Updated last year
- This repository contains code and tooling for the Abacus.AI LLM Context Expansion project. Also included are evaluation scripts and bench…☆584Updated last year
- SoTA Transformers with C-backend for fast inference on your CPU.☆311Updated last year
- Fine-tune mistral-7B on 3090s, a100s, h100s☆709Updated last year
- Evaluation tool for LLM QA chains☆1,072Updated last year
- LLaMa retrieval plugin script using OpenAI's retrieval plugin☆324Updated 2 years ago
- ☆535Updated last year
- Finetuning Large Language Models on One Consumer GPU in 2 Bits☆720Updated 10 months ago
- C++ implementation for 💫StarCoder☆453Updated last year
- ☆405Updated 2 years ago
- A tiny implementation of an autonomous agent powered by LLMs (OpenAI GPT-4)☆443Updated last year
- A command-line interface to generate textual and conversational datasets with LLMs.☆293Updated last year
- Chain together LLMs for reasoning & orchestrate multiple large models for accomplishing complex tasks☆603Updated last year
- Official supported Python bindings for llama.cpp + gpt4all☆1,020Updated last year
- ☆1,462Updated last year
- A voice chat app☆1,105Updated 4 months ago
- Agent techniques to augment your LLM and push it beyong its limits☆1,571Updated 10 months ago
- Running Llama 2 and other Open-Source LLMs on CPU Inference Locally for Document Q&A☆961Updated last year
- An Autonomous LLM Agent that runs on Wizcoder-15B☆336Updated 5 months ago
- Extend existing LLMs way beyond the original training length with constant memory usage, without retraining☆691Updated 11 months ago