epoch-research / training-cost-trendsLinks
☆14Updated 2 months ago
Alternatives and similar repositories for training-cost-trends
Users that are interested in training-cost-trends are comparing it to the libraries listed below
Sorting:
- ☆19Updated last week
- ☆21Updated 3 weeks ago
- OLMost every training recipe you need to perform data interventions with the OLMo family of models.☆29Updated this week
- BH hackathon☆14Updated last year
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆23Updated last week
- This repository implements DSPy programs to tasks in Indian Languages☆13Updated last year
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta☆12Updated 6 months ago
- ☆21Updated 3 months ago
- ☆21Updated 3 months ago
- ☆13Updated 5 months ago
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆16Updated last year
- Latent Large Language Models☆18Updated 9 months ago
- Official Repository for Task-Circuit Quantization☆20Updated this week
- Advanced Coding AI Assistant that uses a Gradio interface to stream coding related responses. ChatRAG supports local and API inference an…☆22Updated 3 weeks ago
- ☆16Updated 3 months ago
- Tools for merging pretrained large language models.☆19Updated 11 months ago
- Proceedings of Innovative Use of NLP for Building Educational Applications 2023: SIGHT: A Large Annotated Dataset on Student Insights Gat…☆9Updated 10 months ago
- Aioli: A unified optimization framework for language model data mixing☆25Updated 4 months ago
- Training hybrid models for dummies.☆21Updated 4 months ago
- Open sourced predictions, execution logs, trajectories, and results from model inference + evaluation runs on the SWE-bench task.☆15Updated 9 months ago
- ☆9Updated 7 months ago
- ☆22Updated last year
- ☆9Updated last month
- MPI Code Generation through Domain-Specific Language Models☆14Updated 6 months ago
- AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories☆15Updated 3 weeks ago
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Updated last year
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…☆13Updated last week
- Creating Generative AI Apps which work☆17Updated last month
- QAlign is a new test-time alignment approach that improves language model performance by using Markov chain Monte Carlo methods.☆24Updated last month
- ☆11Updated last year