NVIDIA / workbench-example-llama2-finetune
An NVIDIA AI Workbench Example Project for Finetuning Llama 2
☆27Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for workbench-example-llama2-finetune
- An NVIDIA AI Workbench example project for fine-tuning a Mistral 7B model☆49Updated 5 months ago
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆29Updated 6 months ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆68Updated 3 weeks ago
- ☆40Updated last week
- The code for the paper ROUTERBENCH: A Benchmark for Multi-LLM Routing System☆91Updated 5 months ago
- Codebase accompanying the Summary of a Haystack paper.☆72Updated last month
- A framework for standardizing evaluations of large foundation models, beyond single-score reporting and rankings.☆83Updated this week
- Codes and datasets for the paper Measuring and Enhancing Trustworthiness of LLMs in RAG through Grounded Attributions and Learning to Ref…☆23Updated last month
- Inference examples☆18Updated 2 months ago
- ☆29Updated 4 months ago
- DSPY on action with OpenSource LLMs.☆54Updated 7 months ago
- EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding for…☆20Updated this week
- ☆41Updated 2 months ago
- ☆42Updated 4 months ago
- Testing speed and accuracy of RAG with, and without Cross Encoder Reranker.☆47Updated 10 months ago
- Measuring RAG solutions throughput and latency☆12Updated 3 months ago
- ☆22Updated 3 months ago
- ☆44Updated 5 months ago
- Self-host LLMs with vLLM and BentoML☆72Updated last week
- ☆25Updated 3 months ago
- ☆21Updated last month
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆30Updated 9 months ago
- The first dense retrieval model that can be prompted like an LM☆62Updated last month
- Github repo for storing LlamaDatasets☆29Updated 10 months ago
- Tutorial for DSPy☆21Updated 6 months ago
- ☆39Updated 3 weeks ago
- Experimental Code for StructuredRAG: Structured Outputs in Retrieval-Augmented Generation☆93Updated this week
- Using open source LLMs to build synthetic datasets for direct preference optimization☆40Updated 8 months ago
- 🔎 A deep-dive into HyDE for Advanced LLM RAG + 💡 Introducing AutoHyDE, a semi-supervised framework to improve the effectiveness, covera…☆29Updated 7 months ago