yujonglee / eval
Evaluate your LLM apps, RAG pipeline, any generated text, and more!
☆0Updated 11 months ago
Alternatives and similar repositories for eval:
Users that are interested in eval are comparing it to the libraries listed below
- manage histories of LLM applied applications☆88Updated last year
- ☆37Updated last year
- Use OpenAI with HuggingChat by emulating the text_generation_inference_server☆43Updated last year
- Writing Blog Posts with Generative Feedback Loops!☆47Updated last year
- Pinecone text client library☆61Updated last month
- Sakura-SOLAR-DPO: Merge, SFT, and DPO☆117Updated last year
- Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created by…☆30Updated 7 months ago
- Explore the use of DSPy for extracting features from PDFs 🔎☆39Updated last year
- ☆75Updated last year
- Using various instructor clients evaluating the quality and capabilities of extractions and reasoning.☆50Updated 6 months ago
- Using modal.com to process FineWeb-edu data☆20Updated last week
- LLMON (pronounced limón) is a structured data format optimized for large language models☆34Updated last year
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆99Updated last year
- 🎨 Imagine what Picasso could have done with AI. Self-host your StableDiffusion API.☆50Updated last year
- Build complex LLM Applications with Python Dictionary☆39Updated 6 months ago
- Voyage AI Official Python Library☆57Updated 4 months ago
- Command Line Interface for Hugging Face Inference Endpoints☆66Updated last year
- generate synthetic data for LLM fine-tuning in arbitrary situations within systematic way☆21Updated last year
- 1-Click is all you need.☆61Updated 11 months ago
- Search through the Weaviate Podcast!☆56Updated 3 months ago
- ☆20Updated last year
- ☆53Updated 4 months ago
- Prompt & model versioning on the cloud☆10Updated 9 months ago
- hllama is a library which aims to provide a set of utility tools for large language models.☆10Updated last year
- Check for data drift between two OpenAI multi-turn chat jsonl files.☆37Updated last year
- Text to Python Objects via a LLM Function Call☆57Updated last year
- This open-source repository offers reference code for integrating workplace datastores with Cohere's LLMs, enabling developers and busine…☆149Updated 6 months ago
- Streamlit app for recommending eval functions using prompt diffs☆27Updated last year
- The Universe of Evaluation. All about the evaluation for LLMs.☆223Updated 9 months ago
- Conduct consumer interviews with synthetic focus groups using LLMs and LangChain☆43Updated last year