A calculator to estimate the memory footprint, capacity, and latency on VMware Private AI with NVIDIA.
☆40Aug 5, 2025Updated 9 months ago
Alternatives and similar repositories for LLM_Sizing_Guide
Users that are interested in LLM_Sizing_Guide are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- tensorflow fork with Salus integration☆12Jan 7, 2022Updated 4 years ago
- Java client for RedisAI☆13Oct 3, 2024Updated last year
- This demo showcases OpenTelemetry distributed tracing of a sample Golang HTTP App that uses a Redis backend. This setup relies on Jaeger…☆10Jul 27, 2021Updated 4 years ago
- BNO055 USB Stick Linux Python 3 Driver☆14Apr 6, 2021Updated 5 years ago
- A cookiecutter template for Python agent projects that use uv for dependency management☆31Mar 13, 2026Updated 2 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆11Nov 5, 2021Updated 4 years ago
- Redis Observability using eBPF☆19Sep 17, 2024Updated last year
- A web interface for SleekDB written in PHP☆11Jan 22, 2022Updated 4 years ago
- State-of-The-Art Rating-based RECOmmendation system: pytorch lightning implementation☆13Oct 10, 2023Updated 2 years ago
- TigerGraph Graph Database Benchmark Report - Tigergraph, JanusGraph, Amazon Neptune, Neo4j, Arangodb☆16May 22, 2024Updated 2 years ago
- Library for the Test-based Calibration Error (TCE) metric to quantify the degree to classifier calibration.☆13Sep 15, 2023Updated 2 years ago
- A Netdata json formatter for storing metrics in Timescale.☆17Jun 29, 2025Updated 10 months ago
- Find Pitch of an Audio file☆10Jun 10, 2019Updated 6 years ago
- ☆19Oct 18, 2025Updated 7 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆13Aug 28, 2025Updated 8 months ago
- ☆19May 13, 2023Updated 3 years ago
- ☆14Sep 18, 2024Updated last year
- ☆32Apr 20, 2026Updated last month
- Javascript RedisTimeSeries client☆21Jan 6, 2023Updated 3 years ago
- Finetuning a codegen model with python instruction set using QLORA technique for better efficacy☆11Aug 31, 2023Updated 2 years ago
- A vllm proxy server to add security and multi model management for vllm servers☆12May 30, 2024Updated last year
- AgentQL's integrations with workflow automation tools and AI agent frameworks let you extract structured data from web pages using querie…☆26May 19, 2026Updated last week
- A port of Stream VByte to Go☆35Feb 22, 2022Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆13Oct 8, 2021Updated 4 years ago
- A LLaMA2-7b chatbot with memory running on CPU, and optimized using smooth quantization, 4-bit quantization or Intel® Extension For PyTor…☆15Feb 27, 2024Updated 2 years ago
- PyTorch Implementation of A Deep Learning System for Predicting Size and Fit in Fashion E-Commerce (RecSys'19)☆14Aug 23, 2021Updated 4 years ago
- A dual-chatbot system for learning languages based on LangChain☆13Jun 25, 2023Updated 2 years ago
- ☆10Sep 9, 2021Updated 4 years ago
- Review econometrics concepts with code examples☆16Oct 23, 2022Updated 3 years ago
- Learning coupled matrix factorizations in Python☆18Feb 8, 2026Updated 3 months ago
- An API for CRUD operations on binary files stored in S3☆26Dec 16, 2021Updated 4 years ago
- The A2C Reinforcement Learning Algorithm in Pytorch☆17May 13, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆15Apr 7, 2020Updated 6 years ago
- My personal guide to the great Python library Open3D.☆22Feb 3, 2025Updated last year
- This is a joint project between Helmholtz Imaging (located at DKFZ) and Lin Yang and Otmar Schmid (Helmholtz Munich).☆13Nov 6, 2024Updated last year
- Vim Xcode 10 Dark Theme☆12Jun 5, 2019Updated 6 years ago
- An Offline and Secure Retrieval-Augmented Generation (RAG) system designed for efficient processing of diverse content types with minimal…☆23Dec 29, 2024Updated last year
- A modified version of searx (the privacy-respecting metasearch engine) to only search an allowlist of sites, to build functionality simil…☆19Sep 17, 2021Updated 4 years ago
- Edina - A simple stack-oriented compiled programming language.☆15Jun 8, 2023Updated 2 years ago