Run inference on MPT-30B using CPU
☆576Jun 30, 2023Updated 2 years ago
Alternatives and similar repositories for mpt-30B-inference
Users that are interested in mpt-30B-inference are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Run inference on replit-3B code instruct model using CPU☆160Jul 5, 2023Updated 2 years ago
- Python bindings for the Transformer models implemented in C/C++ using GGML library.☆1,885Jan 28, 2024Updated 2 years ago
- Running Llama 2 and other Open-Source LLMs on CPU Inference Locally for Document Q&A☆976Nov 6, 2023Updated 2 years ago
- This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!☆5,691Dec 19, 2025Updated 3 months ago
- LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath☆9,471Jun 7, 2025Updated 10 months ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- CodeTF: One-stop Transformer Library for State-of-the-art Code LLM☆1,480May 1, 2025Updated 11 months ago
- A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.☆2,915Sep 30, 2023Updated 2 years ago
- Salesforce open-source LLMs with 8k sequence length.☆726Jan 31, 2025Updated last year
- Run any Llama 2 locally with gradio UI on GPU or CPU from anywhere (Linux/Windows/Mac). Use `llama2-wrapper` as your local llama2 backend…☆1,940Mar 22, 2024Updated 2 years ago
- ☆135Nov 24, 2023Updated 2 years ago
- LLM as a Chatbot Service☆3,325Nov 20, 2023Updated 2 years ago
- ☆2,558Jan 7, 2025Updated last year
- Large Language Model Text Generation Inference☆10,830Mar 21, 2026Updated 3 weeks ago
- Locally hosted tool that connects documents to LLMs for summarization and querying, with a simple GUI.☆796Aug 1, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset☆7,536Jul 16, 2023Updated 2 years ago
- H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://docs.h2o.ai/h2o-llmstudio/☆4,905Apr 8, 2026Updated last week
- Run evaluation on LLMs using human-eval benchmark☆430Sep 12, 2023Updated 2 years ago
- Cross-Platform, GPU Accelerated Whisper 🏎️☆1,801Feb 27, 2024Updated 2 years ago
- LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA and fine-tuned with the Focused Transform…☆1,465Nov 7, 2023Updated 2 years ago
- prompt2model - Generate Deployable Models from Natural Language Instructions☆2,012Dec 29, 2024Updated last year
- LLMs custom-chatbots console ⚡☆5,252Feb 27, 2024Updated 2 years ago
- LOMO: LOw-Memory Optimization☆990Jul 2, 2024Updated last year
- An Open-source Toolkit for LLM Development☆2,802Jan 13, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- FlagAI (Fast LArge-scale General AI models) is a fast, easy-to-use and extensible toolkit for large-scale model.☆3,877Nov 11, 2025Updated 5 months ago
- Plug in and Play Implementation of Tree of Thoughts: Deliberate Problem Solving with Large Language Models that Elevates Model Reasoning …☆4,567Jul 29, 2025Updated 8 months ago
- Scale LLM Engine public repository☆823Updated this week
- LLaMA v2 Chatbot☆1,415Aug 27, 2023Updated 2 years ago
- QLoRA: Efficient Finetuning of Quantized LLMs