IST-DASLab / gptq-gguf-toolkitLinks
DASLab support for GGUF
☆37Updated this week
Alternatives and similar repositories for gptq-gguf-toolkit
Users that are interested in gptq-gguf-toolkit are comparing it to the libraries listed below
Sorting:
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆55Updated 7 months ago
- NanoGPT (124M) quality in 2.67B tokens☆28Updated this week
- ☆61Updated 2 months ago
- Lightweight toolkit package to train and fine-tune 1.58bit Language models☆88Updated 3 months ago
- Train, tune, and infer Bamba model☆132Updated 3 months ago
- ☆64Updated 5 months ago
- ☆51Updated last year
- RWKV-7: Surpassing GPT☆95Updated 10 months ago
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆36Updated last month
- j1-micro (1.7B) & j1-nano (600M) are absurdly tiny but mighty reward models.☆97Updated last month
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆105Updated 6 months ago
- ☆63Updated 11 months ago
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆77Updated 6 months ago
- Simple & Scalable Pretraining for Neural Architecture Research☆291Updated 3 weeks ago
- EvaByte: Efficient Byte-level Language Models at Scale☆109Updated 4 months ago
- Pivotal Token Search☆124Updated 2 months ago
- ☆68Updated 3 months ago
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya☆116Updated last month
- ☆134Updated last year
- ☆54Updated 10 months ago
- ☆19Updated 6 months ago
- PyTorch implementation of models from the Zamba2 series.☆185Updated 7 months ago
- look how they massacred my boy☆64Updated 11 months ago
- Training an LLM to use a calculator with multi-turn reinforcement learning, achieving a **62% absolute increase in evaluation accuracy**.☆49Updated 4 months ago
- Testing paligemma2 finetuning on reasoning dataset☆18Updated 8 months ago
- Simple GRPO scripts and configurations.☆59Updated 7 months ago
- ☆56Updated 2 months ago
- Work in progress.☆72Updated 2 months ago
- Collection of autoregressive model implementation☆86Updated 4 months ago
- Luth is a state-of-the-art series of fine-tuned LLMs for French☆30Updated 3 weeks ago