A simple repository showcasing a few LLM Evaluation strategies and leverages W&B Sweeps to optimize the LLM system.
☆12Jul 11, 2023Updated 2 years ago
Alternatives and similar repositories for llm-eval-sweep
Users that are interested in llm-eval-sweep are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Academic Writing Analytics☆10Jan 5, 2026Updated 4 months ago
- This repository contains a study how we can examine the vegetation cover of a region with the help of satellite data. The notebook in thi…☆13Dec 25, 2018Updated 7 years ago
- Estimates fatigue loads in wind turbines from SCADA data based on supervised learning.☆10Sep 11, 2018Updated 7 years ago
- Weights & Biases Addons is a repository consisting of additional unitilities and community contributions for supercharging your Weights &…☆23Jan 2, 2024Updated 2 years ago
- This repository stems from our paper, “Cataloguing LLM Evaluations”, and serves as a living, collaborative catalogue of LLM evaluation fr…☆20Nov 16, 2023Updated 2 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- 盯盘朋友仓位涨跌工具☆22Apr 21, 2025Updated last year
- Physics-guided data-driven solutions for the wind energy industry☆28Jan 7, 2026Updated 4 months ago
- Code and supplementary material complementing the WES-publication: "Change-point detection in wind turbine SCADA data for robust conditio…☆20Sep 2, 2021Updated 4 years ago
- 华中科技大学研究生课程论文LaTeX模板☆12Aug 5, 2022Updated 3 years ago
- Experiment to slice, dice, and clean up spreadsheets☆15May 3, 2024Updated 2 years ago
- Easily identify and label sentence intervals using various taggers.☆16Feb 1, 2017Updated 9 years ago
- A public repo that contains integrations for Argilla and LlamaIndex.☆17Oct 10, 2024Updated last year
- Workshop Data Science Fundamentals (Course at the University of St.Gallen)☆21Jan 18, 2024Updated 2 years ago
- A Python package for declarative Process Mining with Machine Learning applications☆33Nov 19, 2025Updated 5 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Explore and Control with Adversarial Surprise☆10Jul 20, 2021Updated 4 years ago
- A word embedding and graph-based keyword extraction tool☆19Oct 20, 2025Updated 6 months ago
- 百度网盘 Alfred workflow☆11Apr 23, 2021Updated 5 years ago
- Source code for ACL 2021 paper "Automatic ICD Coding via Interactive Shared Representation Networks with Self-distillation Mechanism"☆14Jun 1, 2021Updated 4 years ago
- ☆22Apr 8, 2026Updated last month
- Dynamic Simulation Environments for Reinforcement Learning☆13Apr 17, 2021Updated 5 years ago
- ☆17Jul 31, 2021Updated 4 years ago
- ☆11Jun 21, 2025Updated 10 months ago
- Hierarchical reinforcement learning framework which uses a directed graph to define the hierarchy.☆16Aug 5, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆35Dec 30, 2025Updated 4 months ago
- pharmpy is an umbrella library for searching the FDA NDC directory, Established Pharmacologic Class (EPC), Anatomical Therapeutic Chemica…☆17Oct 26, 2020Updated 5 years ago
- Reinforcement Learning Robot avoiding obstacles(Python + V_rep)☆12Oct 29, 2019Updated 6 years ago
- The Paradox of Choice: Using Attention in Hierarchical Reinforcement Learning☆11Oct 31, 2021Updated 4 years ago
- Generate workflows (for flowcharts or low code) via LLM. Also describe workflow given in DOT.☆18Nov 2, 2023Updated 2 years ago
- Catch incompatibilities between gems☆27Nov 15, 2024Updated last year
- DatasetResearch: Benchmarking Agent Systems for Demand-Driven Dataset Discovery☆20Sep 24, 2025Updated 7 months ago
- Alfred 4.0 workflow which shorten the url via bitly☆13Mar 16, 2022Updated 4 years ago
- Pytorch implementation of [Feudal Net](https://arxiv.org/abs/1703.01161). ([Tensorflow version](https://github.com/dmakian/feudal_networ…☆18Jun 25, 2019Updated 6 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- incremental symbol learning for natural language understanding☆10Jun 12, 2023Updated 2 years ago
- Implementation of "Face detection in untrained deep neural networks" (Baek et al., Nature Communications, 2021)☆10Nov 2, 2021Updated 4 years ago
- Demonstrate using MCP with Pydantic AI framework☆14Mar 14, 2025Updated last year
- ☆23Sep 27, 2025Updated 7 months ago
- Analysis of ensembles of metabolic network reconstructions☆22Feb 9, 2026Updated 3 months ago
- Repo for our work "Systematic Evaluation of Large Vision-Language Models for Surgical Artificial Intelligence"☆20Jun 2, 2025Updated 11 months ago
- Exploration by Random Network Distillation☆15Dec 30, 2018Updated 7 years ago