ayulockin/llm-eval-sweep

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ayulockin/llm-eval-sweep)

ayulockin / llm-eval-sweep

A simple repository showcasing a few LLM Evaluation strategies and leverages W&B Sweeps to optimize the LLM system.

☆12

Alternatives and similar repositories for llm-eval-sweep

Users that are interested in llm-eval-sweep are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

uts-cic / acawriter
View on GitHub
Academic Writing Analytics
☆11Jan 5, 2026Updated 6 months ago
luisveratudela / monitoring
View on GitHub
Estimates fatigue loads in wind turbines from SCADA data based on supervised learning.
☆10Sep 11, 2018Updated 7 years ago
iam-mhaseeb / Satellite-Imagery-Analysis-of-Vegetation-in-Southern-Pakistan
View on GitHub
This repository contains a study how we can examine the vegetation cover of a region with the help of satellite data. The notebook in thi…
☆13Dec 25, 2018Updated 7 years ago
soumik12345 / wandb-addons
View on GitHub
Weights & Biases Addons is a repository consisting of additional unitilities and community contributions for supercharging your Weights &…
☆23Jan 2, 2024Updated 2 years ago
aiverify-foundation / LLM-Evals-Catalogue
View on GitHub
This repository stems from our paper, “Cataloguing LLM Evaluations”, and serves as a living, collaborative catalogue of LLM evaluation fr…
☆22Nov 16, 2023Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
abbey2017 / wind-energy-analytics
View on GitHub
Physics-guided data-driven solutions for the wind energy industry
☆28Jan 7, 2026Updated 6 months ago
sltzgs / KernelCPD_WindSCADA
View on GitHub
Code and supplementary material complementing the WES-publication: "Change-point detection in wind turbine SCADA data for robust conditio…
☆21Sep 2, 2021Updated 4 years ago
ywang-wnlo / HUSTLatexTemplate
View on GitHub
华中科技大学研究生课程论文LaTeX模板
☆13Aug 5, 2022Updated 3 years ago
sphinxbio / sliceanddice
View on GitHub
Experiment to slice, dice, and clean up spreadsheets
☆15May 3, 2024Updated 2 years ago
allenai / taggers
View on GitHub
Easily identify and label sentence intervals using various taggers.
☆16Feb 1, 2017Updated 9 years ago
JLDC / Data-Science-Fundamentals
View on GitHub
Workshop Data Science Fundamentals (Course at the University of St.Gallen)
☆21Jan 18, 2024Updated 2 years ago
argilla-io / argilla-llama-index
View on GitHub
A public repo that contains integrations for Argilla and LlamaIndex.
☆17Oct 10, 2024Updated last year
ArnaudFickinger / adversarial-surprise
View on GitHub
Explore and Control with Adversarial Surprise
☆10Jul 20, 2021Updated 5 years ago
123xiao / stock-tracker
View on GitHub
盯盘朋友仓位涨跌工具
☆22Apr 21, 2025Updated last year
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
zacksleo / pcs-alfred-workflow
View on GitHub
百度网盘 Alfred workflow
☆11Apr 23, 2021Updated 5 years ago
jasonplato / ray_robot
View on GitHub
Reinforcement Learning Robot avoiding obstacles(Python + V_rep)
☆12Oct 29, 2019Updated 6 years ago
jeekim / fasttextrank
View on GitHub
A word embedding and graph-based keyword extraction tool
☆19Oct 20, 2025Updated 9 months ago
tongzhou21 / ISD
View on GitHub
Source code for ACL 2021 paper "Automatic ICD Coding via Interactive Shared Representation Networks with Self-distillation Mechanism"
☆14Jun 1, 2021Updated 5 years ago
szemenyeim / DynEnv
View on GitHub
Dynamic Simulation Environments for Reinforcement Learning
☆13Apr 17, 2021Updated 5 years ago
aws-samples / Mistral-7B-Instruct-fine-tune-and-deploy-on-SageMaker
View on GitHub
☆22Jul 2, 2026Updated 3 weeks ago
Jay4242 / llm-websearch
View on GitHub
My version of an LLM Websearch Agent using a local SearXNG server because SearXNG is great.
☆46Jan 27, 2026Updated 6 months ago
rvboards / linux_kernel_for_d1
View on GitHub
☆17Jul 31, 2021Updated 4 years ago
andreicnica / hrl_attention
View on GitHub
The Paradox of Choice: Using Attention in Hierarchical Reinforcement Learning
☆11Oct 31, 2021Updated 4 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
sujoyp / subgoal-discovery
View on GitHub
Learning from Trajectories via Subgoal Discovery
☆12Dec 10, 2020Updated 5 years ago
GimmyHchs / alfred-workflow-bitly
View on GitHub
Alfred 4.0 workflow which shorten the url via bitly
☆13Mar 16, 2022Updated 4 years ago
GreamDesu / OpenDeepArxiv
View on GitHub
OpenDeepArxiv is an open-sourced project designed to streamline the process of searching for research papers on arXiv, filtering based on…
☆26Feb 18, 2025Updated last year
AI-Maker-Space / DeepResearch-HF
View on GitHub
☆16Feb 5, 2025Updated last year
yubin-park / pharmpy
View on GitHub
pharmpy is an umbrella library for searching the FDA NDC directory, Established Pharmacologic Class (EPC), Anatomical Therapeutic Chemica…
☆18Oct 26, 2020Updated 5 years ago
holarissun / PCHID_code
View on GitHub
Code for [NeurIPS'2019 Spotlight] Policy Continuation with Hindsight Inverse Dynamics
☆15Jan 7, 2020Updated 6 years ago
DuaneNielsen / rnd
View on GitHub
Exploration by Random Network Distillation
☆15Dec 30, 2018Updated 7 years ago
opencobra / Medusa
View on GitHub
Analysis of ensembles of metabolic network reconstructions
☆22May 20, 2026Updated 2 months ago
HansalShah007 / semroute
View on GitHub
A flexible and easy to use tool for Semantic Routing
☆21Sep 13, 2024Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
Finndersen / pydanticai_mcp_demo
View on GitHub
Demonstrate using MCP with Pydantic AI framework
☆14Mar 14, 2025Updated last year
allenai / S2APLER
View on GitHub
S2APLER: S2 Agglomeration of Papers with Low Error Rate (it's for academic paper clustering)
☆22Jul 8, 2026Updated 3 weeks ago
microsoft / nlu-incremental-symbol-learning
View on GitHub
incremental symbol learning for natural language understanding
☆10Jun 12, 2023Updated 3 years ago
ivanDonadello / Declare4Py
View on GitHub
A Python package for declarative Process Mining with Machine Learning applications
☆34Jun 26, 2026Updated last month
cogilab / Face
View on GitHub
Implementation of "Face detection in untrained deep neural networks" (Baek et al., Nature Communications, 2021)
☆10Nov 2, 2021Updated 4 years ago
srsohn / shortest-path-rl
View on GitHub
A public repo for ICML 2021 "Shortest-Path Constrained Reinforcement Learning for Sparse Reward Tasks"
☆13Jul 19, 2021Updated 5 years ago
g8hh / evolve
View on GitHub
☆24Jun 21, 2026Updated last month