abacaj / replit-3B-inferenceLinks
Run inference on replit-3B code instruct model using CPU
☆159Updated 2 years ago
Alternatives and similar repositories for replit-3B-inference
Users that are interested in replit-3B-inference are comparing it to the libraries listed below
Sorting:
- ☆134Updated last year
- llama.cpp with BakLLaVA model describes what does it see☆381Updated 2 years ago
- 🔓 The open-source autonomous agent LLM initiative 🔓☆91Updated last year
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI☆221Updated last year
- An mlx project to train a base model on your whatsapp chats using (Q)Lora finetuning☆170Updated last year
- ☆116Updated 10 months ago
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA☆123Updated 2 years ago
- Command-line script for inferencing from models such as MPT-7B-Chat☆99Updated 2 years ago
- BabyAGI to run with GPT4All☆248Updated 2 years ago
- An Autonomous LLM Agent that runs on Wizcoder-15B☆333Updated last year
- ☆215Updated 2 years ago
- A collection of LLM services you can self host via docker or modal labs to support your applications development☆197Updated last year
- Harnessing the Memory Power of the Camelids☆147Updated 2 years ago
- CLARA: Code Language Assistant & Repository Analyzer☆94Updated 2 years ago
- Modified Stanford-Alpaca Trainer for Training Replit's Code Model☆41Updated 2 years ago
- ☆38Updated last year
- This repository explains and provides examples for "concept anchoring" in GPT4.☆71Updated last year
- Extend the original llama.cpp repo to support redpajama model.☆118Updated last year
- ☆163Updated 3 months ago
- Full finetuning of large language models without large memory requirements☆93Updated last month
- Demo of AI chatbot that predicts user message to generate response quickly.☆103Updated last year
- Falcon LLM ggml framework with CPU and GPU support☆247Updated last year
- The code we currently use to fine-tune models.☆116Updated last year
- an implementation of Self-Extend, to expand the context window via grouped attention☆118Updated last year
- A Personalised AI Assistant Inspired by 'Diamond Age, Powered by SMS☆93Updated 2 years ago
- inference code for mixtral-8x7b-32kseqlen☆102Updated last year
- TheBloke's Dockerfiles☆307Updated last year
- ☆132Updated 2 years ago
- C++ implementation for 💫StarCoder☆455Updated 2 years ago
- Small finetuned LLMs for a diverse set of useful tasks☆126Updated 2 years ago