dmahan93 / lm-evaluation-harness
A framework for few-shot evaluation of autoregressive language models.
☆16Updated last year
Alternatives and similar repositories for lm-evaluation-harness:
Users that are interested in lm-evaluation-harness are comparing it to the libraries listed below
- ☆24Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated 8 months ago
- Writing Blog Posts with Generative Feedback Loops!☆47Updated 11 months ago
- Using multiple LLMs for ensemble Forecasting☆16Updated last year
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆67Updated 4 months ago
- Conduct consumer interviews with synthetic focus groups using LLMs and LangChain☆43Updated last year
- ☆48Updated last year
- ☆57Updated last year
- Automated testing and benchmarking for code generation agents.☆18Updated last year
- 💙 Unstructured Data Connectors for Haystack 2.0☆16Updated last year
- ☆30Updated 8 months ago
- Python Server for C3 AI app. A project that brings the power of Large Language Models (LLM) and Retrieval-Augmented Generation (RAG) with…☆22Updated last year
- LLM reads a paper and produce a working prototype☆48Updated last month
- ☆20Updated last year
- ☆48Updated 4 months ago
- ☆30Updated last year
- KMD is a collection of conversational exchanges between patients and doctors on various medical topics. It aims to capture the intricaci…☆24Updated last year
- ☆31Updated last year
- Explore the use of DSPy for extracting features from PDFs 🔎☆38Updated last year
- ☆16Updated last year
- entropix style sampling + GUI☆25Updated 4 months ago
- Overview and tutorials of the LlamaIndex Library☆18Updated last year
- Finetune Falcon, LLaMA, MPT, and RedPajama on consumer hardware using PEFT LoRA☆102Updated 7 months ago
- Simple examples using Argilla tools to build AI☆53Updated 3 months ago
- Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models☆69Updated last year
- Using open source LLMs to build synthetic datasets for direct preference optimization☆58Updated last year
- OpenMindedChatbot is a Proof Of Concept that leverages the power of Open source Large Language Models (LLM) with Function Calling capabil…☆29Updated last year
- ☆37Updated last year
- Data Questionnaire Agent Chatbot☆64Updated this week