A calculator to estimate the memory footprint, capacity, and latency on VMware Private AI with NVIDIA.
☆40Aug 5, 2025Updated 8 months ago
Alternatives and similar repositories for LLM_Sizing_Guide
Users that are interested in LLM_Sizing_Guide are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An SCTP implementation for WebRTC Data Channels☆27Apr 9, 2026Updated last week
- A cookiecutter template for Python agent projects that use uv for dependency management☆30Mar 13, 2026Updated last month
- URI Component encoder/decoder☆24Feb 28, 2015Updated 11 years ago
- ☆22Dec 31, 2025Updated 3 months ago
- Unit benchmarks of CUDA event APIs.☆17Apr 23, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 提醒您按时吃药的 Telegram Bot。☆15Feb 8, 2023Updated 3 years ago
- https://bbuf.github.io/gpu-glossary-zh/☆26Nov 7, 2025Updated 5 months ago
- Set of general purpose utilities and interfaces.☆17Apr 5, 2026Updated last week
- ☆24Apr 9, 2026Updated last week
- zlib for Telegram Desktop☆15Jul 25, 2019Updated 6 years ago
- OpenAL Soft is a software implementation of the OpenAL 3D audio API.☆18May 15, 2025Updated 11 months ago
- hyperscan using dpdk☆12Jul 15, 2018Updated 7 years ago
- Library for the Test-based Calibration Error (TCE) metric to quantify the degree to classifier calibration.☆13Sep 15, 2023Updated 2 years ago
- The new generation of meta-universe web3.0 infrastructure, mannheim is a fast, distributed, and creator-friendly Blockchain for UGC devel…☆10May 16, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Split a string into a char array by a given delimiter☆14Apr 1, 2016Updated 10 years ago
- My Gen AI research☆11Jun 3, 2024Updated last year
- Concurrency Runtime Library for Telegram Desktop☆20Jul 18, 2019Updated 6 years ago
- A simple website to manage your Hyper-V VMs and IIS sites☆12Jan 19, 2023Updated 3 years ago
- ☆19May 13, 2023Updated 2 years ago
- Contains the Dockerfiles to easily build Telegram Desktop.☆24Oct 29, 2020Updated 5 years ago
- Finetuning a codegen model with python instruction set using QLORA technique for better efficacy☆11Aug 31, 2023Updated 2 years ago
- The Microsoft community Windows Package Manager manifest repository☆14Updated this week
- network time protocol client☆18Dec 15, 2010Updated 15 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Obsidian Plugin: Display all bible references in Reading View as JW Library links, and add a command to convert bible references & jw.org…☆12Sep 12, 2025Updated 7 months ago
- An interactive tutorial project that demonstrates the capabilities of NVIDIA AI Workbench☆26Mar 26, 2026Updated 3 weeks ago
- Example of Langchain-Elasticsearch integrations & RAG.☆12Sep 20, 2024Updated last year
- A LLaMA2-7b chatbot with memory running on CPU, and optimized using smooth quantization, 4-bit quantization or Intel® Extension For PyTor…☆15Feb 27, 2024Updated 2 years ago
- Learning coupled matrix factorizations in Python☆17Feb 8, 2026Updated 2 months ago
- A dual-chatbot system for learning languages based on LangChain☆13Jun 25, 2023Updated 2 years ago
- Spatial Decomposition and Transformation Network - TensorFlow☆14Dec 2, 2019Updated 6 years ago
- Large Language Models for the Terminal☆17Dec 11, 2023Updated 2 years ago
- An Offline and Secure Retrieval-Augmented Generation (RAG) system designed for efficient processing of diverse content types with minimal…☆21Dec 29, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A feature-incomplete peekahole (pahole) clone that doesn't rely on libdwarves (and doesn't choke on Clang output)☆24Oct 23, 2017Updated 8 years ago
- XXTEA for encryption algorithm library Nim.☆19May 24, 2021Updated 4 years ago
- Using Machine Learning to Measure Job Skill Similarities - See more at: http://blog.nycdatascience.com/?p=11683&preview=true#sthash.NnPZZ…☆18Jun 20, 2016Updated 9 years ago
- Fallback configuration for branches that lack a .buildkite/ directory☆23Apr 8, 2026Updated last week
- A directory of practical and usable AI agents resources from applications and platforms to frameworks and utilities and other parts of th…☆33Mar 28, 2026Updated 2 weeks ago
- [CCS 2024] "BadMerging: Backdoor Attacks Against Model Merging": official code implementation.☆36Aug 22, 2024Updated last year
- WhatsApp chatbot with Dialogflow and Twilio api☆10May 6, 2024Updated last year