bayjarvis / llmLinks
Fine-tuning, DPO, RLHF, RLAIF on LLMs - Qwen3, Zephyr 7B GPTQ with 4-Bit Quantization, Mistral-7B-GPTQ
☆15Updated 6 months ago
Alternatives and similar repositories for llm
Users that are interested in llm are comparing it to the libraries listed below
Sorting:
- Implementation of the Mamba SSM with hf_integration.☆56Updated last year
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆35Updated last year
- ☆23Updated 2 years ago
- Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models☆70Updated 2 years ago
- Finetune any model on HF in less than 30 seconds☆56Updated 2 months ago
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Updated 2 years ago
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆24Updated last year
- ☆86Updated last year
- Code, results and other artifacts from the paper introducing the WildChat-50m dataset and the Re-Wild model family.☆32Updated 9 months ago
- ☆39Updated last year
- ☆55Updated last year
- ☆63Updated last year
- 🐜🔧 A minimalistic tool to fine-tune your LLMs☆18Updated 2 years ago
- Reward Model framework for LLM RLHF☆62Updated 2 years ago
- Using multiple LLMs for ensemble Forecasting☆16Updated last year
- implementation of https://arxiv.org/pdf/2312.09299☆21Updated last year
- A library for simplifying training with multi gpu setups in the HuggingFace / PyTorch ecosystem.☆16Updated this week
- Using open source LLMs to build synthetic datasets for direct preference optimization☆72Updated last year
- The Next Generation Multi-Modality Superintelligence☆70Updated last year
- Lottery Ticket Adaptation☆40Updated last year
- Understanding the correlation between different LLM benchmarks☆29Updated 2 years ago
- A library for squeakily cleaning and filtering language datasets.☆49Updated 2 years ago
- SCREWS: A Modular Framework for Reasoning with Revisions☆27Updated 2 years ago
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Updated 2 years ago
- Github repo for Peifeng's internship project☆13Updated 2 years ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆61Updated last year
- PyTorch implementation for MRL☆20Updated last year
- Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."☆66Updated 2 years ago
- Training hybrid models for dummies.☆29Updated 2 months ago
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆43Updated last year