Open-source calculator for LLM system requirements.
☆182Dec 18, 2024Updated last year
Alternatives and similar repositories for LLM-Tools
Users that are interested in LLM-Tools are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Bunch of notebooks for pre-training custom Saiga-like LLM☆12Feb 9, 2024Updated 2 years ago
- ☆99Apr 2, 2025Updated last year
- ACM SoCC 2019, "Coupling Decentralized Key-Value Stores with Erasure Coding"☆15May 22, 2021Updated 5 years ago
- torch.compile artifacts for common deep learning models, can be used as a learning resource for torch.compile☆19Dec 22, 2023Updated 2 years ago
- best llms in russian☆62May 23, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A small Neural Network Processor for Edge devices.☆19Nov 22, 2022Updated 3 years ago
- MMLU eval for RU/EN☆16Jul 31, 2023Updated 2 years ago
- Polynomial arithmetic over GF2☆12Oct 30, 2018Updated 7 years ago
- Beyond KV Caching: Shared Attention for Efficient LLMs☆20Jul 19, 2024Updated last year
- LLM Inference analyzer for different hardware platforms☆114Jun 12, 2026Updated last week
- ☆107Sep 9, 2024Updated last year
- MERA (Multimodal Evaluation for Russian-language Architectures) is a new open benchmark for the Russian language for evaluating fundament…☆63Oct 7, 2024Updated last year
- Modified Arena-Hard-Auto LLM evaluation toolkit with an emphasis on Russian language☆47Mar 20, 2025Updated last year
- QuickCached is a memcached server implementation in Java based on QuickServer☆13Jan 16, 2020Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆23May 29, 2023Updated 3 years ago
- Accurate, large-scale, and extensible simulator for LLM inference Systems☆620Jul 25, 2025Updated 10 months ago
- The source code for running LLMs on the AAAR-1.0 benchmark.☆18Apr 5, 2025Updated last year
- Codes for paper "Stylized Story Generation with Style-Guided Planning"☆12May 9, 2021Updated 5 years ago
- ☆23Oct 30, 2024Updated last year
- Effective LLM Alignment Toolkit☆153Jun 25, 2025Updated 11 months ago
- A system validation and diagnostics tool for monitoring, stress testing, detecting, and troubleshooting issues impacting AMD GPUs in high…☆104Updated this week
- Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization☆1,402Dec 3, 2024Updated last year
- Codebase for character-centric story understanding☆14Jan 20, 2022Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Estimate Your LLM's Token Toll Across Various Platforms and Configurations☆39Nov 9, 2025Updated 7 months ago
- EE-LLM is a framework for large-scale training and inference of early-exit (EE) large language models (LLMs).☆81Jun 14, 2024Updated 2 years ago
- SC 2021, "LogECMem: Coupling Erasure-Coded In-Memory Key-Value Stores with Parity Logging"☆12Jul 12, 2021Updated 4 years ago
- Cocytus is an efficient and available in-memory K/V-store through hybrid erasure coding and replication☆31Mar 7, 2016Updated 10 years ago
- THIS REPOSITORY HAS MOVED TO github.com/nvidia/cub, WHICH IS AUTOMATICALLY MIRRORED HERE.☆11May 6, 2023Updated 3 years ago
- Russian paraphrasers. Generate paraphrases with mt5, gpt2, etc.☆56May 27, 2023Updated 3 years ago
- Optimize GEMM with tensorcore step by step☆37Dec 17, 2023Updated 2 years ago
- Efficient and easy multi-instance LLM serving☆553Mar 12, 2026Updated 3 months ago
- High Performance Linpack for Next-Generation AMD HPC Accelerators☆72Apr 21, 2026Updated last month
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A toy implementation about Program Dependence Graph using LLVM☆13Sep 27, 2023Updated 2 years ago
- ☆78Updated this week
- A C++ discrete event simulation framework☆14Mar 24, 2023Updated 3 years ago
- 📚 LaTeX templates and tools for creating beautiful, structured documents 📝☆14Oct 24, 2025Updated 7 months ago
- Cluster management tools for the Hydro stack☆19Feb 5, 2021Updated 5 years ago
- WIP: Get Stable DIffusion Controlnet running with DirectML via ONNX☆16Mar 13, 2023Updated 3 years ago
- ☆13Jun 4, 2025Updated last year