☆45Oct 13, 2023Updated 2 years ago
Alternatives and similar repositories for transformers-gptq-quant
Users that are interested in transformers-gptq-quant are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆22Aug 27, 2023Updated 2 years ago
- Training HuggingFace models using fastai☆11Jul 22, 2021Updated 4 years ago
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆32Sep 22, 2024Updated last year
- clean up your LLM datasets☆113May 30, 2023Updated 2 years ago
- Mobile Viewer for W&B, built on top of Flutter.☆41Mar 2, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Just a bunch of benchmark logs for different LLMs☆127Jul 28, 2024Updated last year
- Common Voice Generator using Speech Synthesizer☆14Jul 28, 2021Updated 4 years ago
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆83Sep 10, 2023Updated 2 years ago
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆23Mar 12, 2024Updated 2 years ago
- Automatically research and outbound companies with Exa API and google sheets app scripts.☆18Jun 24, 2024Updated last year
- TPU support for the fastai library☆13Apr 15, 2021Updated 5 years ago
- ☆14Oct 21, 2024Updated last year
- A chat implementation for FastHTML☆12Sep 14, 2025Updated 8 months ago
- ☆415Nov 2, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Adaptive Passage Encoder for Open-domain Question Answering☆15Jun 1, 2021Updated 4 years ago
- T5Patches is a set of tools for fast and targeted editing of generative language models built with T5X.☆12May 31, 2024Updated last year
- ☆63Sep 23, 2024Updated last year
- A collection of optimizers, some arcane others well known, for Flax.☆29Aug 6, 2021Updated 4 years ago
- ☆434Aug 13, 2024Updated last year
- batched loras☆351Sep 6, 2023Updated 2 years ago
- Simplex Random Feature attention, in PyTorch☆76Oct 10, 2023Updated 2 years ago
- Manifold-Mixup implementation for fastai V2☆17Oct 1, 2020Updated 5 years ago
- Full finetuning of large language models without large memory requirements☆94Sep 22, 2025Updated 8 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Bridging Large Language Models with Scala 3 Functions☆11Aug 31, 2024Updated last year
- A sample pattern for running CI tests on Modal☆19Apr 12, 2025Updated last year
- ☆124Dec 18, 2024Updated last year
- A memory manager essential for evolving AI to be more human-like, enabling dynamic, context-aware responses through structured memory han…☆29Apr 7, 2024Updated 2 years ago
- Multi-Modal Multi-Embodied Hivemind-like Iteration of RTX-2☆15Jun 27, 2025Updated 10 months ago
- Demo of AI chatbot that predicts user message to generate response quickly.☆106Feb 28, 2024Updated 2 years ago
- Retrieval-Augmented Generation battle!☆67Apr 18, 2026Updated last month
- Flash-MoE sidecar slot-bank runtime for large GGUF MoE models on Apple Silicon — llama.cpp fork☆102May 16, 2026Updated last week
- Acceleration of word2vec using GPU☆13Jan 25, 2017Updated 9 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Code for the paper "QMoE: Practical Sub-1-Bit Compression of Trillion-Parameter Models".☆280Nov 3, 2023Updated 2 years ago
- ☆13Jan 20, 2023Updated 3 years ago
- Stream of my favorite papers and links☆44Apr 19, 2026Updated last month
- Convert all of libgen to high quality markdown☆255Dec 13, 2023Updated 2 years ago
- Submodule of evalverse forked from [google-research/instruction_following_eval](https://github.com/google-research/google-research/tree/m…☆14May 4, 2024Updated 2 years ago
- Code repo for "Model-Generated Pretraining Signals Improves Zero-Shot Generalization of Text-to-Text Transformers" (ACL 2023)☆22Nov 1, 2023Updated 2 years ago
- Image scraper for DuckDuckGo and Google for creating DL datasets☆22Sep 18, 2020Updated 5 years ago