A CLI to estimate inference memory requirements for Hugging Face models, written in Python.
☆849Mar 6, 2026Updated this week
Alternatives and similar repositories for hf-mem
Users that are interested in hf-mem are comparing it to the libraries listed below
Sorting:
- Training tiny models to prove hard theorems☆41Feb 15, 2026Updated 3 weeks ago
- Template for building FastAPI applications with Neo4j(neontology).☆21Nov 14, 2025Updated 3 months ago
- Agentic Swarm AI Agent with persistent long-term memory, multi-provider LLM support, token management, self-learning, and Telegram bot in…☆22Feb 27, 2026Updated last week
- Fast search index for SPLADE sparse retrieval models implemented in Python using Numpy and Numba☆36Oct 16, 2025Updated 4 months ago
- Bringing some SQL to Qdrant☆15Jun 17, 2025Updated 8 months ago
- ☆13Apr 25, 2024Updated last year
- ☆13Jan 22, 2025Updated last year
- This Repository demostrates various examples using YOLO☆13Feb 9, 2024Updated 2 years ago
- Comprehensive LLM evaluation framework: GPQA Diamond to Chatbot Arena. Tests all major models equally, easily extensible.☆17Aug 22, 2024Updated last year
- a set of scripts to easily convert all training data from huggingface into alpaca instruct or sharegpt format, which should allow for eas…☆18Mar 14, 2025Updated 11 months ago
- Embedding Inversion via Conditional Masked Diffusion: recover original text from embedding vectors using parallel denoising. Live demo + …☆26Feb 12, 2026Updated 3 weeks ago
- 🚂 Fine-tune OpenAI models for text classification, question answering, and more☆17May 1, 2023Updated 2 years ago
- ☆17Jun 3, 2024Updated last year
- Code for the C2KD paper (ICASSP 2023)☆19May 15, 2023Updated 2 years ago
- The code for paper "EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning"☆37Oct 1, 2025Updated 5 months ago
- DImensionality REduction in JAX☆25Nov 21, 2025Updated 3 months ago
- This project was inspired by the unclecode/crawl4ai repository. It provided valuable insights and ideas that helped shape the development…☆16Dec 25, 2025Updated 2 months ago
- ☆23Dec 5, 2025Updated 3 months ago
- Learning to Model Editing Processes☆26Aug 3, 2025Updated 7 months ago
- Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜☆1,894Jan 9, 2026Updated 2 months ago
- streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL☆2,660Mar 2, 2026Updated last week
- Inference, Fine Tuning and many more recipes with Gemma family of models☆279Jul 18, 2025Updated 7 months ago
- ☆22May 1, 2024Updated last year
- Plug-and-play document AI with zero-shot models.☆125Feb 16, 2026Updated 3 weeks ago
- Deep research agents using MiniMax M2.1 interleaved thinking☆200Dec 23, 2025Updated 2 months ago
- DeepDive: Advancing Deep Search Agents with Knowledge Graphs and Multi-Turn RL☆289Oct 2, 2025Updated 5 months ago
- Typescript utilities for input validation, with emphasis on security☆19Jan 3, 2024Updated 2 years ago
- ☆105Mar 25, 2025Updated 11 months ago
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆3,732May 21, 2025Updated 9 months ago
- ☆21Mar 3, 2025Updated last year
- Benchmarking the serving capabilities of vLLM☆59Aug 20, 2024Updated last year
- Code to go with beginner FastHTML tutorial☆20Jul 5, 2025Updated 8 months ago
- Nearly Inference Free Embeddings: make your RAG queries 500x faster☆70Feb 20, 2026Updated 2 weeks ago
- High-performance, asynchronous Python HTTP client library designed for faster file transfers using concurrency, semaphores, and fault-tol…☆59May 12, 2025Updated 9 months ago
- ☆57Mar 17, 2025Updated 11 months ago
- a1facts - the precision layer for AI agents☆64Sep 29, 2025Updated 5 months ago
- A minimal yet unstoppable blueprint for multi-agent AI—anchored by the rare, far-reaching “Multi-Agent AI DAO” (2017 Prior Art)—empowerin…☆32Jan 11, 2025Updated last year
- Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. 👨🏻🍳☆354Jun 2, 2025Updated 9 months ago
- Convert PowerPoint files into semantically rich text using vision language models☆113Nov 12, 2025Updated 3 months ago