Run GreenBitAI's Quantized LLMs on Apple Devices with MLX
☆31Aug 27, 2025Updated 9 months ago
Alternatives and similar repositories for gbx-lm
Users that are interested in gbx-lm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A toolkit for fine-tuning, inferencing, and evaluating GreenBitAI's LLMs.☆185Jul 23, 2025Updated 10 months ago
- Advanced Ultra-Low Bitrate Compression Techniques for the LLaMA Family of LLMs☆110Jan 11, 2024Updated 2 years ago
- Deploying full-stack on-prem deep research agent that can be run entirely on a local machine for $0!☆34Nov 8, 2025Updated 7 months ago
- MLX Implementation of Recursive Reasoning with Tiny Networks☆78Oct 11, 2025Updated 8 months ago
- MLX binary vectors and associated algorithms.☆14Mar 13, 2025Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆71May 14, 2026Updated last month
- ☆19Aug 19, 2025Updated 9 months ago
- A lightweight, self-hosted infrastructure layer for deploying and managing LLM agents as resilient microservices. Features automatic r…☆18Aug 4, 2025Updated 10 months ago
- Official repo of dataset-decomposition paper [NeurIPS 2024]☆22Jan 8, 2025Updated last year
- ☆20Oct 25, 2025Updated 7 months ago
- A toolkit enhances PyTorch with specialized functions for low-bit quantized neural networks.☆196Jun 25, 2024Updated last year
- ☆19Nov 6, 2023Updated 2 years ago
- Bi-Real-Net Model in Pytorch https://arxiv.org/abs/1808.00278 + pre-trained fp model weights☆18Sep 17, 2019Updated 6 years ago
- Repository for "Accelerating Neural Architecture Search using Performance Prediction" (ICLR Workshop 2018)☆18Mar 21, 2018Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Run controlnet with flux☆17Oct 8, 2024Updated last year
- MLX version of DINO DETR☆16Dec 26, 2024Updated last year
- Cog template for Stable Diffusion 3 (ComfyUI implementation)☆17Jul 16, 2024Updated last year
- LLM-driven browser automation library built on Playwright with 67 CLI/SDK tools, stable snapshot refs, and stealth mode.基于 Playwright 的 L…☆74May 19, 2026Updated last month
- remark plugin to replace @ mentions with links☆14Mar 29, 2026Updated 2 months ago
- GitHub Discussions Blog Loader for Astro☆13Mar 29, 2026Updated 2 months ago
- Configuration files for the ODRI uDriver firmware.☆11Nov 15, 2022Updated 3 years ago
- Modern c++17 unit testing framework on Microsoft Windows, Apple macOS, Linux, iOS and android.☆15Sep 24, 2021Updated 4 years ago
- This is web app by streamlit, this app scrape facebook page and show some statictic and visualize the date☆28Jan 12, 2024Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Contextmap is an interactive developer portal powered by automated documentation.☆12Apr 24, 2024Updated 2 years ago
- ☆13Apr 29, 2024Updated 2 years ago
- ☆19Apr 18, 2025Updated last year
- This repository contains code associated with an AWS a blog which demonstrates how you can accept API keys as a query string parameter in…☆10Feb 18, 2022Updated 4 years ago
- Mixture of Expert (MoE) techniques for enhancing LLM performance through expert-driven prompt mapping and adapter combinations.☆12Feb 11, 2024Updated 2 years ago
- ☆21Oct 9, 2024Updated last year
- fine tuning mistral 7B using Huggingface, Weights and Biases, Choline, and Vast AI☆40Oct 4, 2023Updated 2 years ago
- "Hey, Computer" from Star Trek. Talk to your agent. Run hooks after trigger comands. Runs locally, cause shit's scary.☆150Jun 11, 2026Updated last week
- ☆13Jul 15, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Verify that any MCP server is running the intended and untampered code via hardware attestation.☆19May 20, 2026Updated 3 weeks ago
- Code for paper: "QuIP: 2-Bit Quantization of Large Language Models With Guarantees" adapted for Llama models☆41Aug 4, 2023Updated 2 years ago
- [ACL 2026] Repository of IPBench☆23Apr 6, 2026Updated 2 months ago
- JAX Scalify: end-to-end scaled arithmetics☆18Oct 30, 2024Updated last year
- Code for the examples presented in the talk "Training a Llama in your backyard: fine-tuning very large models on consumer hardware" given…☆15Oct 16, 2023Updated 2 years ago
- II-Thought-RL is our initial attempt at developing a large-scale, multi-domain Reinforcement Learning (RL) dataset☆30Apr 8, 2025Updated last year
- The source code of our work "Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models" [AISTATS …☆60Oct 11, 2024Updated last year