stanford-oval / WikiChat
WikiChat is an improved RAG. It stops the hallucination of large language models by retrieving data from a corpus.
☆1,414Updated 2 months ago
Alternatives and similar repositories for WikiChat:
Users that are interested in WikiChat are comparing it to the libraries listed below
- [ICLR 2025] LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs☆1,631Updated 5 months ago
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆2,578Updated this week
- ☆842Updated 6 months ago
- Convert Compute And Books Into Instruct-Tuning Datasets! Makes: QA, RP, Classifiers.☆1,388Updated last month
- Knowledge Agents and Management in the Cloud☆3,827Updated this week
- High-performance retrieval engine for unstructured data☆1,284Updated this week
- This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?☆1,014Updated 2 months ago
- LLM for Long Text Summary (Comprehensive Bulleted Notes)☆526Updated 2 months ago
- This repository includes the official implementation of OpenScholar: Synthesizing Scientific Literature with Retrieval-augmented LMs.☆658Updated last month
- Democratizing Reinforcement Learning for LLMs☆2,158Updated last month
- An Open Large Reasoning Model for Real-World Solutions☆1,477Updated 3 weeks ago
- mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding☆2,141Updated 3 months ago
- Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"☆1,099Updated last month
- OpenResearcher, an advanced Scientific Research Assistant☆438Updated 5 months ago
- Autonomously train research-agent LLMs on custom data using reinforcement learning and self-verification.☆548Updated last week
- [NeurIPS'24] HippoRAG is a novel RAG framework inspired by human long-term memory that enables LLMs to continuously integrate knowledge a…☆2,087Updated this week
- Prompt optimization scratch☆678Updated 3 weeks ago
- A generalized information-seeking agent system with Large Language Models (LLMs).☆1,140Updated 9 months ago
- Llama-3 agents that can browse the web by following instructions and talking to you☆1,393Updated 3 months ago
- Data and tools for generating and inspecting OLMo pre-training data.☆1,170Updated 2 weeks ago
- A lightweight library for generating synthetic instruction tuning datasets for your data without GPT.☆757Updated last month
- LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA☆482Updated 3 months ago
- 🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)☆6,250Updated 2 months ago
- Enforce the output format (JSON Schema, Regex etc) of a language model☆1,756Updated last month
- Evaluate your LLM's response with Prometheus and GPT4 💯☆893Updated last week
- Synthetic data curation for post-training and structured data extraction☆1,097Updated this week
- A library for advanced large language model reasoning☆2,065Updated last month
- [NeurIPS 2024 Spotlight] Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models☆613Updated last week
- DataComp for Language Models☆1,267Updated last week
- GraphRAG using Local LLMs - Features robust API and multiple apps for Indexing/Prompt Tuning/Query/Chat/Visualizing/Etc. This is meant to…☆2,060Updated 4 months ago