georgesung / LLM-WikipediaQA
Document Q&A on Wikipedia articles using LLMs
☆74Updated last year
Alternatives and similar repositories for LLM-WikipediaQA:
Users that are interested in LLM-WikipediaQA are comparing it to the libraries listed below
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆48Updated 6 months ago
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆99Updated 9 months ago
- This repository implements the chain of verification paper by Meta AI☆160Updated last year
- ☆57Updated last year
- RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo ranker☆106Updated 3 weeks ago
- Convenient wrapper for fine-tuning and inference of Large Language Models (LLMs) with several quantization techniques (GTPQ, bitsandbytes…☆146Updated last year
- Just a bunch of benchmark logs for different LLMs☆116Updated 5 months ago
- Simple examples using Argilla tools to build AI☆52Updated 2 months ago
- Large Language Model (LLM) Inference API and Chatbot☆124Updated 9 months ago
- Data extraction with LLM on CPU☆112Updated last year
- Writing Blog Posts with Generative Feedback Loops!☆46Updated 10 months ago
- Mistral + Haystack: build RAG pipelines that rock 🤘☆100Updated 11 months ago
- Mixing Language Models with Self-Verification and Meta-Verification☆100Updated last month
- Reimplementation of the task generation part from the Alpaca paper☆119Updated last year
- Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first app…☆163Updated last year
- Sample notebooks and prompts for LLM evaluation☆119Updated last month
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform☆82Updated last week
- ARAGOG- Advanced RAG Output Grading. Exploring and comparing various Retrieval-Augmented Generation (RAG) techniques on AI research paper…☆101Updated 9 months ago
- ☆51Updated last year
- ☆76Updated 7 months ago
- ☆59Updated last year
- Testing speed and accuracy of RAG with, and without Cross Encoder Reranker.☆48Updated last year
- Build Enterprise RAG (Retriver Augmented Generation) Pipelines to tackle various Generative AI use cases with LLM's by simply plugging co…☆111Updated 5 months ago
- Initiative to evaluate and rank the most popular LLMs across common task types based on their propensity to hallucinate.☆104Updated 4 months ago
- ☆88Updated last year
- Using open source LLMs to build synthetic datasets for direct preference optimization☆49Updated 10 months ago
- A semantic research engine to get relevant papers based on a user query. Application frontend with Chainlit Copilot. Observability with L…☆79Updated 8 months ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆74Updated 3 months ago
- Using LlamaIndex with Ray for productionizing LLM applications☆71Updated last year
- Patch for MPT-7B which allows using and training a LoRA☆58Updated last year