maiush / OpenCharacterTrainingLinks
Open Character Training
☆52Updated this week
Alternatives and similar repositories for OpenCharacterTraining
Users that are interested in OpenCharacterTraining are comparing it to the libraries listed below
Sorting:
- ☆40Updated last year
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆60Updated last year
- Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models☆70Updated 2 years ago
- ☆25Updated 6 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆58Updated last month
- ☆55Updated last year
- ☆63Updated last year
- Simple GRPO scripts and configurations.☆59Updated 9 months ago
- Losslessly encode text natively with arithmetic coding and HuggingFace Transformers☆77Updated last month
- Official repo for Learning to Reason for Long-Form Story Generation☆72Updated 7 months ago
- ☆15Updated 7 months ago
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆34Updated 7 months ago
- Visual RAG using less than 300 lines of code.☆29Updated last year
- OLMost every training recipe you need to perform data interventions with the OLMo family of models.☆56Updated this week
- Understanding the correlation between different LLM benchmarks☆29Updated last year
- ☆88Updated 3 weeks ago
- Training code for Sparse Autoencoders on Embedding models☆38Updated 9 months ago
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆32Updated last year
- ☆20Updated last week
- Evaluating LLMs with CommonGen-Lite☆91Updated last year
- ☆38Updated 7 months ago
- ReBase: Training Task Experts through Retrieval Based Distillation☆29Updated 9 months ago
- ☆22Updated 2 years ago
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆44Updated last year
- Synthetic data generation and benchmark implementation for "Episodic Memories Generation and Evaluation Benchmark for Large Language Mode…☆59Updated last month
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta☆13Updated last year
- Code, results and other artifacts from the paper introducing the WildChat-50m dataset and the Re-Wild model family.☆31Updated 7 months ago
- Code and data for the paper "Why think step by step? Reasoning emerges from the locality of experience"☆62Updated 7 months ago
- Functional Benchmarks and the Reasoning Gap☆90Updated last year
- A collection of lightweight interpretability scripts to understand how LLMs think☆68Updated this week