alvarobartt / hf-memView external linksLinks
A CLI to estimate inference memory requirements for Hugging Face models, written in Python.
☆712Feb 4, 2026Updated last week
Alternatives and similar repositories for hf-mem
Users that are interested in hf-mem are comparing it to the libraries listed below
Sorting:
- Aplicación de realidad aumentada y navegación para museos sobre smart devices☆19Aug 2, 2016Updated 9 years ago
- Fast search index for SPLADE sparse retrieval models implemented in Python using Numpy and Numba☆34Oct 16, 2025Updated 4 months ago
- ☆13Apr 25, 2024Updated last year
- ☆13Jan 22, 2025Updated last year
- Embedding Inversion via Conditional Masked Diffusion: recover original text from embedding vectors using parallel denoising. Live demo + …☆22Updated this week
- A Python library aimed at dissecting and augmenting NER training data.☆61May 11, 2023Updated 2 years ago
- Official Implementation of the 'When XGBoost Outperforms GPT-4 on Text Classification: A Case Study' NAACL-W 2024 paper☆16Dec 16, 2024Updated last year
- A comprehensive repository of 100+ RAG (Retrieval-Augmented Generation) libraries, frameworks, and tools organized by category. This cura…☆11Mar 23, 2025Updated 10 months ago
- This Repository demostrates various examples using YOLO☆13Feb 9, 2024Updated 2 years ago
- SWE-Swiss: A Multi-Task Fine-Tuning and RL Recipe for High-Performance Issue Resolution☆104Sep 24, 2025Updated 4 months ago
- DImensionality REduction in JAX☆26Nov 21, 2025Updated 2 months ago
- Comprehensive LLM evaluation framework: GPQA Diamond to Chatbot Arena. Tests all major models equally, easily extensible.☆17Aug 22, 2024Updated last year
- OneEdit: A Neural-Symbolic Collaboratively Knowledge Editing System.☆19Oct 14, 2024Updated last year
- Code for the C2KD paper (ICASSP 2023)☆18May 15, 2023Updated 2 years ago
- The code for paper "EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning"☆37Oct 1, 2025Updated 4 months ago
- Learning to Model Editing Processes☆26Aug 3, 2025Updated 6 months ago
- The official repo of our research work "Interactive Editing for Text Summarization".☆23Jun 3, 2023Updated 2 years ago
- This project was inspired by the unclecode/crawl4ai repository. It provided valuable insights and ideas that helped shape the development…☆16Dec 25, 2025Updated last month
- ☆23Dec 5, 2025Updated 2 months ago
- Deep research agents using MiniMax M2.1 interleaved thinking☆197Dec 23, 2025Updated last month
- Overview of corpora/datasets for Germanic low-resource languages and dialects. Accompanies "A Survey of Corpora for Germanic Low-Resource…☆27Updated this week
- Load any clip model with a standardized interface☆22Oct 20, 2025Updated 3 months ago
- Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜☆1,875Jan 9, 2026Updated last month
- DeepDive: Advancing Deep Search Agents with Knowledge Graphs and Multi-Turn RL☆283Oct 2, 2025Updated 4 months ago
- Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…☆28Apr 17, 2024Updated last year
- Comprehensive metrics, insights, and visualization for Agno and Crew AI applications☆26May 21, 2025Updated 8 months ago
- Typescript utilities for input validation, with emphasis on security☆19Jan 3, 2024Updated 2 years ago
- ☆210Jun 26, 2025Updated 7 months ago
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆3,719May 21, 2025Updated 8 months ago
- ☆21Mar 3, 2025Updated 11 months ago
- ☆160Dec 18, 2025Updated last month
- Unified Schema-Based Information Extraction☆792Feb 2, 2026Updated 2 weeks ago
- Benchmarking the serving capabilities of vLLM☆59Aug 20, 2024Updated last year
- High-performance, asynchronous Python HTTP client library designed for faster file transfers using concurrency, semaphores, and fault-tol…☆59May 12, 2025Updated 9 months ago
- streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL☆2,660Feb 9, 2026Updated last week
- 🎯An accuracy-first, highly efficient quantization toolkit for LLMs, designed to minimize quality degradation across Weight-Only Quantiza…☆847Updated this week
- A minimal yet unstoppable blueprint for multi-agent AI—anchored by the rare, far-reaching “Multi-Agent AI DAO” (2017 Prior Art)—empowerin…☆32Jan 11, 2025Updated last year
- Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. 👨🏻🍳☆352Jun 2, 2025Updated 8 months ago
- 🦖 X—LLM: Cutting Edge & Easy LLM Finetuning☆408Jan 17, 2024Updated 2 years ago