philschmid / text-generation-inference-tests
☆20Updated last year
Alternatives and similar repositories for text-generation-inference-tests:
Users that are interested in text-generation-inference-tests are comparing it to the libraries listed below
- ☆24Updated last year
- Writing Blog Posts with Generative Feedback Loops!☆47Updated 11 months ago
- Mixing Language Models with Self-Verification and Meta-Verification☆102Updated 3 months ago
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆104Updated 3 months ago
- ☆30Updated 8 months ago
- ☆51Updated 3 months ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆76Updated 4 months ago
- ☆18Updated 5 months ago
- A framework for high-fidelity retrieval augmented generation in industrial knowledge bases. Integrates jargon identification, context rec…☆29Updated 7 months ago
- Explore the use of DSPy for extracting features from PDFs 🔎☆38Updated last year
- ☆41Updated 3 months ago
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆99Updated 11 months ago
- Backtracing: Retrieving the Cause of the Query, EACL 2024 Long Paper, Findings.☆88Updated 7 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated 8 months ago
- ☆76Updated 9 months ago
- A framework for evaluating function calls made by LLMs☆37Updated 7 months ago
- ☆13Updated 2 months ago
- ☆48Updated last year
- ☆45Updated 5 months ago
- Run LLMs on Replicate with vLLM☆16Updated 5 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆67Updated 4 months ago
- Source code for our paper: "SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals".☆66Updated 8 months ago
- Testing speed and accuracy of RAG with, and without Cross Encoder Reranker.☆48Updated last year
- Streamlit app for recommending eval functions using prompt diffs☆27Updated last year
- ☆31Updated last year
- ☆73Updated 2 months ago
- ☆31Updated last year
- ☆65Updated 9 months ago
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆29Updated 5 months ago
- LLM finetuning☆42Updated last year