abacaj / mpt-30B-inferenceLinks

Run inference on MPT-30B using CPU

☆576

Alternatives and similar repositories for mpt-30B-inference

Users that are interested in mpt-30B-inference are comparing it to the libraries listed below

Sorting:

NouamaneTazi / bloomz.cpp
C++ implementation for BLOOM
☆808Updated 2 years ago
bigcode-project / starcoder.cpp
C++ implementation for 💫StarCoder
☆457Updated 2 years ago
danielgross / LlamaAcademy
A school for camelids
☆1,209Updated 2 years ago
freddyaboulton / gradio-tools
☆599Updated 2 years ago
mbzuai-nlp / LaMini-LM
LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions
☆823Updated 2 years ago
rlancemartin / auto-evaluator
Evaluation tool for LLM QA chains
☆1,087Updated 2 years ago
salesforce / xgen
Salesforce open-source LLMs with 8k sequence length.
☆722Updated 10 months ago
zhudotexe / kani
kani (カニ) is a highly hackable microframework for tool-calling language models. (NLP-OSS @ EMNLP 2023)
☆594Updated 3 weeks ago
scaleapi / llm-engine
Scale LLM Engine public repository
☆816Updated this week
lastmile-ai / llama-retrieval-plugin
LLaMa retrieval plugin script using OpenAI's retrieval plugin
☆324Updated 2 years ago
paolorechia / learn-langchain
☆276Updated 2 years ago
BlackHC / llm-strategy
Directly Connecting Python to LLMs via Strongly-Typed Functions, Dataclasses, Interfaces & Generic Types
☆401Updated 9 months ago
r2d4 / openlm
OpenAI-compatible Python client that can call any LLM
☆373Updated 2 years ago
jina-ai / agentchain
Chain together LLMs for reasoning & orchestrate multiple large models for accomplishing complex tasks
☆607Updated 2 years ago
e-johnstonn / BriefGPT
Locally hosted tool that connects documents to LLMs for summarization and querying, with a simple GUI.
☆799Updated 2 years ago
modal-labs / quillman
A voice chat app
☆1,178Updated 6 months ago
pHaeusler / micro-agent
A tiny implementation of an autonomous agent powered by LLMs (OpenAI GPT-4)
☆439Updated 2 years ago
Nuggt-dev / Nuggt
An Autonomous LLM Agent that runs on Wizcoder-15B
☆334Updated last year
shm007g / LLaMA-Cult-and-More
Large Language Models for All, 🦙 Cult and More, Stay in touch !
☆450Updated 2 years ago
teknium1 / GPTeacher
A collection of modular datasets generated by GPT-4, General-Instruct - Roleplay-Instruct - Code-Instruct - and Toolformer
☆1,631Updated 2 years ago
abacusai / Long-Context
This repository contains code and tooling for the Abacus.AI LLM Context Expansion project. Also included are evaluation scripts and bench…
☆598Updated 2 years ago
PotatoSpudowski / fastLLaMa
fastLLaMa: An experimental high-performance framework for running Decoder-only LLMs with 4-bit quantization in Python using a C/C++ backe…
☆412Updated 2 years ago
rmihaylov / falcontune
Tune any FALCON in 4-bit
☆465Updated 2 years ago
booydar / recurrent-memory-transformer
[NeurIPS 22] [AAAI 24] Recurrent Transformer-based long-context architecture.
☆775Updated last year
johnsmith0031 / alpaca_lora_4bit
☆535Updated 2 years ago
bborn / howdoi.ai
howdoi.ai
☆257Updated 2 years ago
andrewgcodes / lightspeedGPT
Use GPT4 and GPT3.5 on inputs of unlimited size. Uses multithreading to process multiple chunks in parallel. Useful for tasks like Named …
☆269Updated 2 years ago
kuleshov-group / llmtools
Finetuning Large Language Models on One Consumer GPU in 2 Bits
☆733Updated last year
plchld / InsightFlow
LLM-based tool for parsing information and chatting with it
☆214Updated 2 years ago
rogeriochaves / langstream
Build robust LLM applications with true composability 🔗
☆422Updated last year