yujonglee / eval
Evaluate your LLM apps, RAG pipeline, any generated text, and more!
☆0Updated 8 months ago
Alternatives and similar repositories for eval:
Users that are interested in eval are comparing it to the libraries listed below
- manage histories of LLM applied applications☆88Updated last year
- ☆30Updated last year
- Sakura-SOLAR-DPO: Merge, SFT, and DPO☆116Updated last year
- Writing Blog Posts with Generative Feedback Loops!☆47Updated 10 months ago
- ☆37Updated last year
- Prompt & model versioning on the cloud☆11Updated 7 months ago
- 1-Click is all you need.☆59Updated 9 months ago
- Using modal.com to process FineWeb-edu data☆19Updated last month
- generate synthetic data for LLM fine-tuning in arbitrary situations within systematic way☆21Updated 10 months ago
- ☆75Updated last year
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆99Updated 9 months ago
- The Universe of Evaluation. All about the evaluation for LLMs.☆221Updated 6 months ago
- ✅ Pytest-style test runner for langchain projects☆25Updated last year
- Voyage AI Official Python Library☆49Updated last month
- ☆38Updated last year
- Pinecone text client library☆59Updated last month
- ☆20Updated last year
- Build complex LLM Applications with Python Dictionary☆38Updated 3 months ago
- LLM prompt language based on Jinja. Banks provides tools and functions to build prompts text and chat messages from generic blueprints. I…☆72Updated this week
- ☆51Updated last month
- LLM finetuning☆42Updated last year
- Use OpenAI with HuggingChat by emulating the text_generation_inference_server☆45Updated last year
- Explore the use of DSPy for extracting features from PDFs 🔎☆38Updated 10 months ago
- Example of running LangChain on Cloud Run☆61Updated last year
- Conduct consumer interviews with synthetic focus groups using LLMs and LangChain☆43Updated last year
- Chat to LLaMa 2 that also provides responses with reference documents over vector database. Locally available model using GPTQ 4bit quant…☆29Updated last year
- ☆35Updated 10 months ago
- Applying Evaluation Driven Development (EDD) to aid in the design decision of RAG pipelines☆31Updated last year
- Streamlit app for recommending eval functions using prompt diffs☆27Updated last year
- 🔓 The open-source autonomous agent LLM initiative 🔓☆90Updated 11 months ago