Run inference on replit-3B code instruct model using CPU
☆160Jul 5, 2023Updated 2 years ago
Alternatives and similar repositories for replit-3B-inference
Users that are interested in replit-3B-inference are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Run inference on MPT-30B using CPU☆576Jun 30, 2023Updated 2 years ago
- ☆415Nov 2, 2023Updated 2 years ago
- Let's make sand talk☆588Oct 17, 2023Updated 2 years ago
- Python bindings for the Transformer models implemented in C/C++ using GGML library.☆1,885Jan 28, 2024Updated 2 years ago
- C++ implementation for 💫StarCoder☆458Sep 9, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A canvas based platformer powered by stable diffusion☆35Jun 26, 2023Updated 2 years ago
- A collection of modular datasets generated by GPT-4, General-Instruct - Roleplay-Instruct - Code-Instruct - and Toolformer☆1,636Sep 15, 2023Updated 2 years ago
- A shell for your agent☆17Dec 7, 2025Updated 4 months ago
- Command-line script for inferencing from models such as MPT-7B-Chat☆100Jul 6, 2023Updated 2 years ago
- Public reports detailing responses to sets of prompts by Large Language Models.☆34Jan 4, 2025Updated last year
- Run evaluation on LLMs using human-eval benchmark☆430Sep 12, 2023Updated 2 years ago
- QA on your data with visualizations☆17Aug 10, 2023Updated 2 years ago
- ☆26Mar 14, 2024Updated 2 years ago
- Code Interpreter Replica☆26Jul 14, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- The official code repo for "Sub-Sentence Encoder: Contrastive Learning of Propositional Semantic Representations".☆85Jan 19, 2024Updated 2 years ago
- ☆21May 6, 2023Updated 2 years ago
- Audio Cleaner using DeepFilterNet, hosted through Streamlit☆27May 4, 2025Updated 11 months ago
- Inference code and configs for the ReplitLM model family☆1,053Oct 9, 2023Updated 2 years ago
- ☆212Apr 13, 2023Updated 3 years ago
- Simple, Fast, Parallel Huggingface GGML model downloader written in python☆24Jul 26, 2023Updated 2 years ago
- ☆70Aug 24, 2024Updated last year
- various experiments for scaling inference time compute with small reasoning models☆17Jan 16, 2025Updated last year
- ☆28Feb 25, 2026Updated last month
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆198Feb 9, 2024Updated 2 years ago
- Port of OpenAI's Whisper model in C/C++☆10Jul 12, 2023Updated 2 years ago
- SOTA OpenSource code generation model on par with GPT-4 & Beating Google Gemini Ultra, Claude -2 etc.☆22Jan 10, 2024Updated 2 years ago
- CLAIRe: Conversational Learning AI with Recall☆67Aug 8, 2023Updated 2 years ago
- ☆553Feb 8, 2026Updated 2 months ago
- An Autonomous LLM Agent that runs on Wizcoder-15B☆336Oct 21, 2024Updated last year
- A Qwen .5B reasoning model trained on OpenR1-Math-220k☆14Oct 11, 2025Updated 6 months ago
- Full finetuning of large language models without large memory requirements☆94Sep 22, 2025Updated 6 months ago
- LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath☆9,471Jun 7, 2025Updated 10 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Generative neural networks for 3D terrain.☆32Dec 18, 2024Updated last year
- ☆135Nov 24, 2023Updated 2 years ago
- Fine-tune mistral-7B on 3090s, a100s, h100s☆726Oct 11, 2023Updated 2 years ago
- GPT in your browser☆90Oct 7, 2023Updated 2 years ago
- Open-source bridge for Ray-Ban Meta glasses to RTMP streaming.☆32Jan 5, 2026Updated 3 months ago
- Streamlit web app utilizing OpenAI (GPT-4) and LangChain LLM tools. Application includes an SQLite DB for login/authentication and messag…☆33Jul 14, 2023Updated 2 years ago
- A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.☆2,915Sep 30, 2023Updated 2 years ago