A simple library for working with Hugging Face models.
☆14Dec 30, 2024Updated last year
Alternatives and similar repositories for hflm
Users that are interested in hflm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LlamaCards is a web application that provides a dynamic interface for interacting with LLM models in real-time. This app allows users to …☆37Aug 28, 2024Updated last year
- Transformer experiments☆16May 8, 2023Updated 2 years ago
- ProfitsBot V0 are a set of LLM experiments training open source langage models with loras for financial applications☆19May 27, 2023Updated 2 years ago
- Friendly interface to chat with an Ollama instance.☆94Apr 8, 2026Updated 3 weeks ago
- Browse, search, and visualize ONNX models.☆34May 6, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- My Implementation of Q-Sparse: All Large Language Models can be Fully Sparsely-Activated☆35Aug 14, 2024Updated last year
- Fast and accurate on-device speech-to-text for web pages and web applications.☆71Dec 11, 2025Updated 4 months ago
- ☆23Oct 4, 2024Updated last year
- Transplants vocabulary between language models, enabling the creation of draft models for speculative decoding WITHOUT retraining.☆52Oct 29, 2025Updated 6 months ago
- Homebrew formulas for installing LLM and related tools☆14Sep 6, 2023Updated 2 years ago
- Seamless Voice Interactions with LLMs☆12Oct 28, 2023Updated 2 years ago
- Safari Reader Mode Source Code☆20Mar 5, 2024Updated 2 years ago
- AscTec quadrotor drivers☆17Aug 22, 2019Updated 6 years ago
- ☆15Apr 26, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- a lightweight, open-source blueprint for building powerful and scalable LLM chat applications☆28Jun 7, 2024Updated last year
- Open-Source Pick and Place Machines☆18Sep 20, 2024Updated last year
- notes on langchain☆18Mar 20, 2026Updated last month
- Yet another frontend for LLM, written using .NET and WinUI 3☆11Sep 14, 2025Updated 7 months ago
- A Framework for Evaluating AI Agent Safety in Realistic Environments☆32Oct 2, 2025Updated 7 months ago
- Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models☆268Apr 23, 2024Updated 2 years ago
- Unofficial Scalable-Softmax Is Superior for Attention☆20May 30, 2025Updated 11 months ago
- A free and open-source GUI tool that simplifies combining multiple code files into one, with automatic labeling and support for various p…☆14Jan 3, 2025Updated last year
- CLI util: Poor man's rpath for Windows executables.☆12Dec 16, 2018Updated 7 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- An API that allows you to scrape blog posts and articles and get a list of notes or a summary back.☆10Mar 31, 2023Updated 3 years ago
- Efficient retrieval head analysis with triton flash attention that supports topK probability☆13Jun 15, 2024Updated last year
- This project is a reverse-engineered version of Figma's tone changer. It uses Groq's Llama-3-8b for high-speed inference and to adjust th…☆90Jul 26, 2024Updated last year
- A desktop GUI for Flux 1.1 Pro built using DelphiFMX For Python☆11Oct 5, 2024Updated last year
- Repository for Sparse Finetuning of LLMs via modified version of the MosaicML llmfoundry☆43Jan 15, 2024Updated 2 years ago
- Matrix exponential in cuda for pytorch and tensorflow☆17Nov 26, 2018Updated 7 years ago
- ☆10Mar 27, 2024Updated 2 years ago
- GFPGAN face reconstruction with ncnn on a bare Raspberry Pi☆14Jan 4, 2023Updated 3 years ago
- AirLLM 70B inference with single 4GB GPU☆20Jun 27, 2025Updated 10 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- The Lily programming language ⚜☆10Apr 7, 2026Updated 3 weeks ago
- Taylor moment expansion in Python (JaX and SymPy) and Matlab☆11Nov 26, 2024Updated last year
- Call any function with command-like syntax at runtime (with automatic argument management). No dependencies, no boilerplate code, no macr…☆12Dec 25, 2022Updated 3 years ago
- Forest Fuels from Brown's Transects☆11Dec 14, 2018Updated 7 years ago
- A thin cython wrapper around llama.cpp, whisper.cpp and stable-diffusion.cpp☆25Updated this week
- Cleanai (https://github.com/willmil11/cleanai) except I'm making it in c now. Fast and clean from the start this time :)☆16Mar 6, 2026Updated 2 months ago
- Utility to use eleven lab's streaming to in the command line☆11Aug 8, 2023Updated 2 years ago