☆45Oct 13, 2023Updated 2 years ago
Alternatives and similar repositories for transformers-gptq-quant
Users that are interested in transformers-gptq-quant are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆22Aug 27, 2023Updated 2 years ago
- Training HuggingFace models using fastai☆11Jul 22, 2021Updated 4 years ago
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆32Sep 22, 2024Updated last year
- Mobile Viewer for W&B, built on top of Flutter.☆41Mar 2, 2024Updated 2 years ago
- Just a bunch of benchmark logs for different LLMs☆120Jul 28, 2024Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆20Jul 12, 2023Updated 2 years ago
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆83Sep 10, 2023Updated 2 years ago
- Build modern UIs in Jupyter with Python☆12Dec 28, 2022Updated 3 years ago
- Code for the paper "Spectral Editing of Activations for Large Language Model Alignments"☆29Dec 20, 2024Updated last year
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆23Mar 12, 2024Updated 2 years ago
- Automatically research and outbound companies with Exa API and google sheets app scripts.☆18Jun 24, 2024Updated last year
- Automatic Integration for Neural Spatio-Temporal Point Process models (AI-STPP) is a new paradigm for exact, efficient, non-parametric inf…☆25Oct 14, 2024Updated last year
- TPU support for the fastai library☆13Apr 15, 2021Updated 4 years ago
- ☆14Oct 21, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- A chat implementation for FastHTML☆12Sep 14, 2025Updated 6 months ago
- ☆14May 7, 2019Updated 6 years ago
- ☆415Nov 2, 2023Updated 2 years ago
- A repository of prompts and Python scripts for intelligent transformation of raw text into diverse formats.☆31May 29, 2023Updated 2 years ago
- Adaptive Passage Encoder for Open-domain Question Answering☆15Jun 1, 2021Updated 4 years ago
- Standalone repo for our Atropos integration with Thinking Machines Tinker API (https://thinkingmachines.ai/tinker/)☆20Updated this week
- T5Patches is a set of tools for fast and targeted editing of generative language models built with T5X.☆12May 31, 2024Updated last year
- ☆18Apr 3, 2023Updated 2 years ago
- ☆420Aug 13, 2024Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- batched loras☆351Sep 6, 2023Updated 2 years ago
- Simplex Random Feature attention, in PyTorch☆76Oct 10, 2023Updated 2 years ago
- Examples of apps built with Nendo, the AI Audio Tool Suite☆55Feb 29, 2024Updated 2 years ago
- Manifold-Mixup implementation for fastai V2☆17Oct 1, 2020Updated 5 years ago
- Full finetuning of large language models without large memory requirements☆94Sep 22, 2025Updated 6 months ago
- A sample pattern for running CI tests on Modal☆19Apr 12, 2025Updated 11 months ago
- ☆119Dec 18, 2024Updated last year
- Multi-Modal Multi-Embodied Hivemind-like Iteration of RTX-2☆15Jun 27, 2025Updated 9 months ago
- Retrieval-Augmented Generation battle!☆64Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Advanced Ultra-Low Bitrate Compression Techniques for the LLaMA Family of LLMs☆110Jan 11, 2024Updated 2 years ago
- Smithy4s client directly using Fetch APIs, without bringing http4s/cats, to dramatically reduce bundle size☆13Jul 7, 2024Updated last year
- Acceleration of word2vec using GPU☆13Jan 25, 2017Updated 9 years ago
- Code for the paper "QMoE: Practical Sub-1-Bit Compression of Trillion-Parameter Models".☆281Nov 3, 2023Updated 2 years ago
- Stream of my favorite papers and links☆44Feb 15, 2026Updated last month
- A really tiny autograd engine☆100May 26, 2025Updated 10 months ago
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Feb 22, 2024Updated 2 years ago