epoch-research / training-cost-trendsLinks
☆20Updated last week
Alternatives and similar repositories for training-cost-trends
Users that are interested in training-cost-trends are comparing it to the libraries listed below
Sorting:
- ☆25Updated 7 months ago
- UQ: Assessing Language Models on Unsolved Questions☆29Updated 3 months ago
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆24Updated this week
- Aioli: A unified optimization framework for language model data mixing☆31Updated 10 months ago
- ☆55Updated last year
- ☆88Updated last month
- A sample pattern for running CI tests on Modal☆18Updated 7 months ago
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆23Updated last year
- Small python package to measure OCR quality and other related metrics.☆25Updated last year
- ☆25Updated 6 months ago
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta☆13Updated last year
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆60Updated last year
- CLaMR: Contextualized Late-Interaction for Multimodal Content Retrieval☆22Updated 5 months ago
- Creating Generative AI Apps which work☆17Updated 7 months ago
- ☆52Updated 9 months ago
- Download, parse, and filter data from Phil Papers. Data-ready for The-Pile.☆18Updated 2 years ago
- ☆51Updated last year
- Finding semantically meaningful and accurate prompts.☆49Updated 2 years ago
- Open sourced predictions, execution logs, trajectories, and results from model inference + evaluation runs on the SWE-bench task.☆15Updated last year
- ReBase: Training Task Experts through Retrieval Based Distillation☆29Updated 10 months ago
- ☆22Updated 9 months ago
- code for training and using chess embeddings models☆13Updated last year
- OLMost every training recipe you need to perform data interventions with the OLMo family of models.☆56Updated last week
- Evaluation framework for document processing models and services.☆57Updated this week
- Training LLMs to reason and analyze data with notebooks☆54Updated 2 months ago
- Minimum Description Length probing for neural network representations☆20Updated 10 months ago
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆34Updated 7 months ago
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆44Updated last year
- QLoRA for Masked Language Modeling☆22Updated 2 years ago
- ☆20Updated last year