teknium1 / transformers-gptq-quant
☆48Updated last year
Alternatives and similar repositories for transformers-gptq-quant:
Users that are interested in transformers-gptq-quant are comparing it to the libraries listed below
- ☆22Updated last year
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆82Updated last year
- Just a bunch of benchmark logs for different LLMs☆119Updated 9 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆39Updated 3 months ago
- KMD is a collection of conversational exchanges between patients and doctors on various medical topics. It aims to capture the intricaci…☆24Updated last year
- ☆20Updated last year
- ☆38Updated 9 months ago
- ☆24Updated last year
- Ongoing research training transformer models at scale☆36Updated last year
- ☆66Updated 11 months ago
- look how they massacred my boy☆63Updated 6 months ago
- Simple GRPO scripts and configurations.☆58Updated 3 months ago
- Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models☆69Updated last year
- Training an LLM to use a calculator with multi-turn reinforcement learning, achieving a **62% absolute increase in evaluation accuracy**.☆17Updated this week
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated 9 months ago
- Writing Blog Posts with Generative Feedback Loops!☆47Updated last year
- Chat Markup Language conversation library☆55Updated last year
- Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.☆32Updated 2 months ago
- ☆48Updated 5 months ago
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆30Updated 7 months ago
- A framework for evaluating function calls made by LLMs☆37Updated 9 months ago
- ☆43Updated 2 months ago
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆24Updated last year
- ☆61Updated last year
- ☆80Updated 3 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆90Updated 3 months ago
- A fast, local, and secure approach for training LLMs for coding tasks using GRPO with WebAssembly and interpreter feedback.☆22Updated last month
- Verbosity control for AI agents☆63Updated 11 months ago
- Using modal.com to process FineWeb-edu data☆20Updated last month
- Port of Andrej Karpathy's nanoGPT to Apple MLX framework.☆105Updated last year