☆101Aug 30, 2024Updated last year
Alternatives and similar repositories for How_Much_VRAM
Users that are interested in How_Much_VRAM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Radiantloom Email Assist 7B is an email-assistant large language model fine-tuned from Zephyr-7B-Beta, over a custom-curated dataset of 1…☆14Jan 19, 2024Updated 2 years ago
- Example code using the DSPy framework.☆20May 30, 2024Updated last year
- ☆17Sep 1, 2024Updated last year
- tiny_fnc_engine is a minimal python library that provides a flexible engine for calling functions extracted from a LLM.☆38Sep 11, 2024Updated last year
- coze api to openai☆15Sep 1, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- RAG example using DSPy, Gradio, FastAPI☆92Apr 11, 2024Updated 2 years ago
- A simple LLaMA implementation using MLX.☆15Apr 22, 2024Updated 2 years ago
- An RAG (retrieval augmented generation) app which iterates through a PDF document and can answer user's questions based on the document u…☆16Mar 23, 2025Updated last year
- A prompting library☆192Jul 1, 2025Updated 10 months ago
- Implementation of "Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation"☆21Jul 31, 2023Updated 2 years ago
- A python command-line tool to download & manage MLX AI models from Hugging Face.☆20Aug 26, 2024Updated last year
- A simple Python tool to measure the performance of ONNX models.☆27Sep 15, 2024Updated last year
- An in-cell AI assistant for JupyterLab notebooks☆38Sep 17, 2025Updated 8 months ago
- automatically quant GGUF models☆226Dec 23, 2025Updated 5 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆170Aug 16, 2024Updated last year
- Rust Implementation of micrograd☆52Jul 3, 2024Updated last year
- [ISBI 2025] Design Data Before Models: Using large vision-language models to automatically enhance medical dataset annotations.☆35Jan 28, 2026Updated 3 months ago
- Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜☆1,919Jan 9, 2026Updated 4 months ago
- d3.js-based tool to visualize network communication for arbitrary protocols☆17Feb 20, 2015Updated 11 years ago
- 模型压缩的小白入门教程☆22Jul 7, 2024Updated last year
- A Redis-compatible in-memory database server written in Rust with MLua-based Lua 5.1 scripting☆18Nov 28, 2025Updated 5 months ago
- Rag Chatbot React And Tyepscript base boilerplate☆32Apr 14, 2024Updated 2 years ago
- LangGraph-GUI backend with fastapi☆62Oct 16, 2025Updated 7 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆42Jul 24, 2024Updated last year
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆41Jan 4, 2024Updated 2 years ago
- ☆69May 26, 2024Updated 2 years ago
- ☆11Oct 19, 2024Updated last year
- It change UNetModel and VAE Conv2d Layer into circular padding mode that make any text2image process generate seamless patten☆19Mar 19, 2025Updated last year
- List of papers on Self-Correction of LLMs.☆81May 19, 2026Updated last week
- ☆22Apr 17, 2025Updated last year
- M5Stack GrayのIMUのデータをROS/ROS 2のノードからトピックとして送信するためのArduino Sketch(要PC)☆11Sep 6, 2020Updated 5 years ago
- ⛓️ build cognitive systems, pythonic☆341Nov 19, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆24Jun 4, 2024Updated last year
- Intel Gaudi's Megatron DeepSpeed Large Language Models for training☆18Dec 19, 2024Updated last year
- EATFormer: Improving Vision Transformer Inspired by Evolutionary Algorithm☆35Aug 18, 2022Updated 3 years ago
- FastAPI wrapper around DSPy☆294Mar 11, 2024Updated 2 years ago
- Python library for talking to Apollo API☆10Jan 31, 2024Updated 2 years ago
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆88May 29, 2024Updated last year
- Minimalistic large language model 3D-parallelism training☆2,698Apr 7, 2026Updated last month