togethercomputer / finetuningLinks
Finetune Llama-3-8b on the MathInstruct dataset
☆110Updated 8 months ago
Alternatives and similar repositories for finetuning
Users that are interested in finetuning are comparing it to the libraries listed below
Sorting:
- ☆114Updated 6 months ago
- Just a bunch of benchmark logs for different LLMs☆119Updated 10 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆91Updated 5 months ago
- WIP - Allows you to create DSPy pipelines using ComfyUI☆189Updated 6 months ago
- Train your own SOTA deductive reasoning model☆94Updated 3 months ago
- Solving data for LLMs - Create quality synthetic datasets!☆149Updated 5 months ago
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆99Updated last year
- Simple examples using Argilla tools to build AI☆53Updated 7 months ago
- A seamless matchmaking application that is programmed with Cohere Command R+, Stanford NLP DSPy framework, Weaviate Vector store and Crew…☆59Updated last year
- A framework for evaluating function calls made by LLMs☆37Updated 11 months ago
- Lightweight open-source perplexity☆62Updated last year
- Function Calling Benchmark & Testing☆87Updated 11 months ago
- auto fine tune of models with synthetic data☆75Updated last year
- ☆66Updated last year
- An automated tool for discovering insights from research papaer corpora☆138Updated last year
- ☆157Updated 11 months ago
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI☆222Updated last year
- ☆77Updated last year
- ☆63Updated last month
- ☆86Updated 9 months ago
- ☆47Updated last year
- Transcribe and summarize videos using whisper and llms on apple mlx framework☆74Updated last year
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆53Updated 4 months ago
- ☆127Updated 3 months ago
- All the world is a play, we are but actors in it.☆50Updated this week
- The next evolution of Agents☆48Updated 3 weeks ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated 11 months ago
- LLM-Training-API: Including Embeddings & ReRankers, mergekit, LaserRMT☆27Updated last year
- Simple GRPO scripts and configurations.☆58Updated 4 months ago
- Demo of AI chatbot that predicts user message to generate response quickly.☆103Updated last year