epoch-research / training-cost-trendsLinks
☆19Updated last month
Alternatives and similar repositories for training-cost-trends
Users that are interested in training-cost-trends are comparing it to the libraries listed below
Sorting:
- A sample pattern for running CI tests on Modal☆17Updated 5 months ago
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆24Updated 2 weeks ago
- ☆54Updated 11 months ago
- ☆25Updated 5 months ago
- ☆23Updated 2 years ago
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆44Updated last year
- UQ: Assessing Language Models on Unsolved Questions☆27Updated last month
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆60Updated last year
- Training hybrid models for dummies.☆26Updated last week
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆19Updated last year
- Code for our paper Resources and Evaluations for Multi-Distribution Dense Information Retrieval☆15Updated last year
- ☆20Updated last year
- Aioli: A unified optimization framework for language model data mixing☆27Updated 8 months ago
- Experiments to assess SPADE on different LLM pipelines.☆17Updated last year
- Minimum Description Length probing for neural network representations☆20Updated 8 months ago
- PyTorch implementation for MRL☆19Updated last year
- Download, parse, and filter data from Phil Papers. Data-ready for The-Pile.☆18Updated 2 years ago
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…☆34Updated 2 years ago
- Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models☆69Updated 2 years ago
- ☆49Updated 7 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆55Updated 8 months ago
- ☆25Updated 4 months ago
- CLaMR: Contextualized Late-Interaction for Multimodal Content Retrieval☆20Updated 3 months ago
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Updated 2 years ago
- Fork of Flame repo for training of some new stuff in development☆18Updated last month
- ReBase: Training Task Experts through Retrieval Based Distillation☆29Updated 8 months ago
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆34Updated 5 months ago
- Tools for merging pretrained large language models.☆19Updated last year
- Latent Large Language Models☆19Updated last year
- Embedding Recycling for Language models☆38Updated 2 years ago