epoch-research / training-cost-trends
☆14Updated last month
Alternatives and similar repositories for training-cost-trends:
Users that are interested in training-cost-trends are comparing it to the libraries listed below
- ☆18Updated 6 months ago
- This repository implements DSPy programs to tasks in Indian Languages☆13Updated last year
- ☆21Updated 2 months ago
- Latent Large Language Models☆17Updated 8 months ago
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆16Updated last year
- Tools for merging pretrained large language models.☆19Updated 10 months ago
- Open sourced predictions, execution logs, trajectories, and results from model inference + evaluation runs on the SWE-bench task.☆15Updated 7 months ago
- ☆20Updated last year
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta☆13Updated 5 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆39Updated 2 months ago
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆23Updated this week
- Implementation☆24Updated last month
- Training hybrid models for dummies.☆20Updated 3 months ago
- A python command-line tool to download & manage MLX AI models from Hugging Face.☆17Updated 7 months ago
- ☆13Updated 4 months ago
- A repository of projects and datasets under active development by Alignment Lab AI☆22Updated last year
- ☆18Updated last year
- A sample pattern for running CI tests on Modal☆17Updated last week
- Knowledge Graph Generator app☆30Updated last year
- ☆38Updated 9 months ago
- ☆11Updated last year
- BH hackathon☆14Updated last year
- ☆9Updated 6 months ago
- The original BabyAGI, updated with LiteLLM and no vector database reliance (csv instead)☆21Updated 6 months ago
- Creating Generative AI Apps which work☆17Updated last week
- ☆17Updated 2 months ago
- FMS Model Optimizer is a framework for developing reduced precision neural network models.☆16Updated this week
- LMQL implementation of tree of thoughts☆34Updated last year
- Transform unstructured documents into actionable, structured data with enterprise-grade precision and reliability, ready for large-scale …☆19Updated this week
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆21Updated 5 months ago