☆45Oct 13, 2023Updated 2 years ago
Alternatives and similar repositories for transformers-gptq-quant
Users that are interested in transformers-gptq-quant are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆22Aug 27, 2023Updated 2 years ago
- Training HuggingFace models using fastai☆11Jul 22, 2021Updated 4 years ago
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆32Sep 22, 2024Updated last year
- Mobile Viewer for W&B, built on top of Flutter.☆41Mar 2, 2024Updated 2 years ago
- Just a bunch of benchmark logs for different LLMs☆121Jul 28, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Latent Large Language Models☆19Aug 24, 2024Updated last year
- Common Voice Generator using Speech Synthesizer☆13Jul 28, 2021Updated 4 years ago
- ☆20Jul 12, 2023Updated 2 years ago
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆83Sep 10, 2023Updated 2 years ago
- Build modern UIs in Jupyter with Python☆12Dec 28, 2022Updated 3 years ago
- Automatically research and outbound companies with Exa API and google sheets app scripts.☆18Jun 24, 2024Updated last year
- Automatic Integration for Neural Spatio-Temporal Point Process models (AI-STPP) is a new paradigm for exact, efficient, non-parametric inf…☆25Oct 14, 2024Updated last year
- A chat implementation for FastHTML☆12Sep 14, 2025Updated 7 months ago
- ☆415Nov 2, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A repository of prompts and Python scripts for intelligent transformation of raw text into diverse formats.☆31May 29, 2023Updated 2 years ago
- Adaptive Passage Encoder for Open-domain Question Answering☆15Jun 1, 2021Updated 4 years ago
- ☆18Apr 3, 2023Updated 3 years ago
- ☆63Sep 23, 2024Updated last year
- A collection of optimizers, some arcane others well known, for Flax.☆29Aug 6, 2021Updated 4 years ago
- ☆427Aug 13, 2024Updated last year
- batched loras☆351Sep 6, 2023Updated 2 years ago
- Full finetuning of large language models without large memory requirements☆94Sep 22, 2025Updated 6 months ago
- ☆120Dec 18, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Command-line tools to support meta-analysis using a library managed in Zotero☆11Feb 9, 2023Updated 3 years ago
- defaultMODE is a Python framework for creating Discord AI agents with persistent memory and evolving behavior through brain-inspired sele…☆13Mar 31, 2026Updated 2 weeks ago
- A memory manager essential for evolving AI to be more human-like, enabling dynamic, context-aware responses through structured memory han…☆29Apr 7, 2024Updated 2 years ago
- Multi-Modal Multi-Embodied Hivemind-like Iteration of RTX-2☆15Jun 27, 2025Updated 9 months ago
- Standalone repo for our Atropos integration with Thinking Machines Tinker API (https://thinkingmachines.ai/tinker/)☆44Mar 22, 2026Updated 3 weeks ago
- Demo of AI chatbot that predicts user message to generate response quickly.☆105Feb 28, 2024Updated 2 years ago
- Retrieval-Augmented Generation battle!☆64Mar 31, 2026Updated 2 weeks ago
- ☆25Mar 31, 2026Updated 2 weeks ago
- A collection of utilities for FastHTML projects.☆14Oct 23, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Advanced Ultra-Low Bitrate Compression Techniques for the LLaMA Family of LLMs☆110Jan 11, 2024Updated 2 years ago
- Smithy4s client directly using Fetch APIs, without bringing http4s/cats, to dramatically reduce bundle size☆13Jul 7, 2024Updated last year
- Acceleration of word2vec using GPU☆13Jan 25, 2017Updated 9 years ago
- ☆13Jan 20, 2023Updated 3 years ago
- Stream of my favorite papers and links☆44Feb 15, 2026Updated 2 months ago
- Convert all of libgen to high quality markdown☆255Dec 13, 2023Updated 2 years ago
- Code repo for "Model-Generated Pretraining Signals Improves Zero-Shot Generalization of Text-to-Text Transformers" (ACL 2023)☆22Nov 1, 2023Updated 2 years ago