NVIDIA / workbench-example-llama2-finetune
An NVIDIA AI Workbench Example Project for Finetuning Llama 2
☆28Updated 4 months ago
Alternatives and similar repositories for workbench-example-llama2-finetune:
Users that are interested in workbench-example-llama2-finetune are comparing it to the libraries listed below
- An NVIDIA AI Workbench example project for fine-tuning a Mistral 7B model☆50Updated 7 months ago
- ☆47Updated last month
- ☆43Updated 3 months ago
- ☆39Updated last month
- ☆46Updated 2 months ago
- ☆20Updated 2 months ago
- Improving Text Embedding of Language Models Using Contrastive Fine-tuning☆59Updated 5 months ago
- CRMArena: Understanding the Capacity of LLM Agents to Perform Professional CRM Tasks in Realistic Environments☆40Updated this week
- Pre-training code for CrystalCoder 7B LLM☆55Updated 8 months ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆74Updated 3 months ago
- ☆46Updated 6 months ago
- Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"☆50Updated 3 months ago
- Codes and datasets for the paper Measuring and Enhancing Trustworthiness of LLMs in RAG through Grounded Attributions and Learning to Ref…☆34Updated 3 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆48Updated 6 months ago
- ☆14Updated 3 months ago
- Source code of the paper: RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering [F…☆61Updated 7 months ago
- Data preparation code for CrystalCoder 7B LLM☆44Updated 8 months ago
- Small and Efficient Mathematical Reasoning LLMs☆71Updated 11 months ago
- Automation Framework using LLM-as-a-judge to Scale Eval of Gen AI solutions (RAG, Multi-turn, Query Rewrite, Text2SQL etc.); that is a go…☆21Updated last week
- Code for our paper Resources and Evaluations for Multi-Distribution Dense Information Retrieval☆14Updated last year
- Training hybrid models for dummies.☆16Updated this week
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆55Updated last month
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆34Updated 8 months ago
- EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding for…☆22Updated last month
- ☆34Updated 5 months ago
- ☆51Updated last month
- ☆62Updated 5 months ago
- This is an NVIDIA AI Workbench example project that demonstrates an end-to-end model development workflow using Llamafactory.☆42Updated 3 months ago