Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization
β12Dec 3, 2024Updated last year
Alternatives and similar repositories for gpu_poor
Users that are interested in gpu_poor are comparing it to the libraries listed below
Sorting:
- Welcome! π This is the official code release of EviNote-RAG, and weβre happy to share it with the community.β44Nov 23, 2025Updated 3 months ago
- Connect your pc with modbus devicesβ11Apr 5, 2021Updated 4 years ago
- Platform API Project seedβ12Nov 8, 2023Updated 2 years ago
- A blazing fast AI Gateway with integrated guardrails. Route to 200+ LLMs, 50+ AI Guardrails with 1 fast & friendly API.β46Sep 12, 2025Updated 5 months ago
- This repo contains documentation related to the operation of the OpenBytes project.β13Oct 29, 2021Updated 4 years ago
- Evaluation of Oasis Platform - simple install, UI and APIβ14Feb 9, 2026Updated 3 weeks ago
- Indexing framework designed for the automated creation of structured knowledge bases in Azure AI Searchβ14Jun 18, 2025Updated 8 months ago
- A relatively simple, unified method for reporting on Kubernetes resource issues.β12Mar 5, 2020Updated 5 years ago
- Application for Agent re-engineering for better and reliable Gen AI workflows.β10Jul 20, 2025Updated 7 months ago
- Automatic Thief Detection via CCTV with Alarm System and Perpetrator Image Capture using YOLOv5 + ROI. This project utilizes computer visβ¦β14Oct 21, 2024Updated last year
- A modular, agentic-AI-based adaptive cybersecurity architecture for digital ecosystems. Combines Zero Trust, real-time telemetry, and intβ¦β21Jul 4, 2025Updated 7 months ago
- β14Sep 8, 2023Updated 2 years ago
- Automate Checkmarx Scanning and Onboarding Plus AWS Accessβ12Jan 5, 2023Updated 3 years ago
- μ¬μ©μμΈμ¦ APIμλΉμ€β10Apr 21, 2021Updated 4 years ago
- Open-source intelligence (OSINT)β15Mar 1, 2024Updated 2 years ago
- This repository contains the source code for the cloud.gov.au website.β12Dec 7, 2022Updated 3 years ago
- Amazon Bedrock μ Nova, Claude 3.7 λͺ¨λΈμ νμ©νμ¬ pdf λλ©΄μ νμ± ν©λλ€.β12May 19, 2025Updated 9 months ago
- SEI120G - Tozed based Routerβ11Sep 18, 2024Updated last year
- AI_Powered_Dev_Search_Engineβ12Mar 10, 2024Updated last year
- TARS: MinMax Token-Adaptive Preference Strategy for Hallucination Reduction in MLLMsβ23Sep 21, 2025Updated 5 months ago
- β27Sep 5, 2025Updated 5 months ago
- A tool to explore ideas generated from artificial intelligence chats.β10Apr 3, 2023Updated 2 years ago
- β15Apr 7, 2024Updated last year
- β11Aug 15, 2024Updated last year
- This is a simple example of how to serve a DeepSeek model with Azure ML.β10Feb 10, 2025Updated last year
- β10Jul 13, 2024Updated last year
- Effortlessly process invoices with AI! This project uses the Llama3.2 Vision Model for OCR, converting invoice images into structured, maβ¦β10Feb 5, 2025Updated last year
- A Bunyan stream to send events to Seqβ11May 7, 2025Updated 9 months ago
- Chain-of-thought λ°©μμ νμ©νμ¬ llama2λ₯Ό fine-tuningβ10Nov 18, 2023Updated 2 years ago
- All things about MCP experiments.βοΈ Star to support our work!β16Aug 15, 2025Updated 6 months ago
- β10Dec 14, 2019Updated 6 years ago
- Fetch and cache files from local filesystem, cloud storage or public webservers in Laravel or Lumenβ11May 13, 2025Updated 9 months ago
- Source code and notebooks for my OReilly Live Course about automating tasks with AI tools and Python.β29Jan 29, 2026Updated last month
- β13Aug 26, 2024Updated last year
- This project is a versatile and powerful search tool that leverages state-of-the-art natural language processing models to provide relevaβ¦β12Apr 3, 2023Updated 2 years ago
- Subgraphormer: Unifying Subgraph GNNs and Graph Transformers via Graph Products (ICML 2024)β11Jul 13, 2024Updated last year
- β11Oct 11, 2023Updated 2 years ago
- Unofficial C# SDK for ZR04RN CCTVβ13Jun 8, 2023Updated 2 years ago
- Automatically switch between podcast camera views depending on who's talkingβ10Aug 22, 2019Updated 6 years ago