LiveBench/liveswebench

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/LiveBench/liveswebench)

LiveBench / liveswebench

☆61

Alternatives and similar repositories for liveswebench

Users that are interested in liveswebench are comparing it to the libraries listed below

Sorting:

facebookresearch / iclmlp
View on GitHub
Experiments for "A Closer Look at In-Context Learning under Distribution Shifts"
☆19May 29, 2023Updated 2 years ago
microsoft / FEA-Bench
View on GitHub
[ACL25] FEA-Bench: A Benchmark for Evaluating Repository-Level Code Generation for Feature Implementation
☆46Jan 28, 2026Updated last month
zchuz / TimeBench
View on GitHub
The repository for ACL 2024 paper "TimeBench: A Comprehensive Evaluation of Temporal Reasoning Abilities in Large Language Models"
☆34Jun 29, 2024Updated last year
ordavid-s / snmf-mlp-decomposition
View on GitHub
☆13Oct 5, 2025Updated 5 months ago
kugwzk / DiDE
View on GitHub
Code for EMNLP 2022 paper “Distilled Dual-Encoder Model for Vision-Language Understanding”
☆31May 1, 2023Updated 2 years ago
noiseQA / NoiseQA
View on GitHub
☆12Feb 22, 2021Updated 5 years ago
MostaphaG / Summer_project-df
View on GitHub
Python GUI for differential forms
☆13Oct 14, 2023Updated 2 years ago
zeroshot-ood / ood-detection
View on GitHub
Code for "Zero-Shot Out-of-Distribution Detection with Feature Correlations"
☆13Jan 19, 2020Updated 6 years ago
devpytech / musicpage
View on GitHub
Python powered music controlling webpage with websockets and bottle py (works with spotify, vlc, audacious, and others)
☆11Jun 9, 2017Updated 8 years ago
WukLab / osworld-human
View on GitHub
OSWorld-Human: Benchmarking the Efficiency of Computer-Use Agents
☆21Jan 6, 2026Updated 2 months ago
Alex-Mathai-98 / kGym-Kernel-Playground
View on GitHub
Kernel Playground - A playground to run large scale experiments on the Linux Kernel
☆17Nov 8, 2025Updated 4 months ago
wzhouad / WPO
View on GitHub
Code and models for EMNLP 2024 paper "WPO: Enhancing RLHF with Weighted Preference Optimization"
☆41Sep 24, 2024Updated last year
kzkadc / gan-with-grl
View on GitHub
Chainer and PyTorch implementation of GAN with gradient reversal layer
☆10Mar 19, 2022Updated 3 years ago
ETHRuiGong / TADA
View on GitHub
TACS: Taxonomy Adaptive Cross-Domain Semantic Segmentation
☆12Jul 14, 2022Updated 3 years ago
fargolo / TextGraphs.jl
View on GitHub
Graph representations of text
☆13Sep 20, 2023Updated 2 years ago
ErxinYu / CoSafe-Dataset
View on GitHub
☆11Nov 12, 2024Updated last year
pelotom / purescript-d3-examples
View on GitHub
PureScript + D3 examples
☆13Oct 11, 2016Updated 9 years ago
xwhan / pylucene-bm25
View on GitHub
Lucene open-domain QA retrieval in python
☆11Feb 18, 2021Updated 5 years ago
EastTower16 / LLMDataDistill
View on GitHub
distill large scale web page text
☆12Jul 29, 2023Updated 2 years ago
ocastel / exact-extract
View on GitHub
☆13Sep 2, 2021Updated 4 years ago
jacobkrantz / VertMetric
View on GitHub
VertMetric: An abstractive summarization evaluation package. VERT stands for Versatile Evaluation of Reduced Texts.
☆11Dec 20, 2018Updated 7 years ago
StanfordVL / arxivbot
View on GitHub
☆11Oct 26, 2021Updated 4 years ago
Oneplus / ELMo
View on GitHub
☆10May 20, 2019Updated 6 years ago
AsteriskAmpersand / MHW-Free-HyperKinetics
View on GitHub
MHW Animation Importer and Exporter Plugin for Blender 2.79
☆14Aug 31, 2024Updated last year
dokato / connectivipy
View on GitHub
python module for connectivity analysis
☆10Nov 2, 2024Updated last year
IINemo / llm-uncertainty-head
View on GitHub
☆23Feb 23, 2026Updated 2 weeks ago
kawu / concraft-pl
View on GitHub
A morphosyntactic tagger for Polish based on conditional random fields
☆22Apr 6, 2021Updated 4 years ago
allenai / natural-perturbations
View on GitHub
Natural Perturbation for Robust Question Answering
☆12Apr 7, 2020Updated 5 years ago
yoichi1484 / subspace
View on GitHub
An implementation of "Subspace Representations for Soft Set Operations and Sentence Similarities" (NAACL 2024)
☆10May 31, 2024Updated last year
LooperXX / ManagerTower
View on GitHub
Code for ACL 2023 Oral Paper: ManagerTower: Aggregating the Insights of Uni-Modal Experts for Vision-Language Representation Learning
☆12Aug 23, 2025Updated 6 months ago
forecr / forecr_xavier_kernel
View on GitHub
Forecr Linux Kernel for Jetson Xavier, Xavier NX, Orin, Orin NX and Orin Nano based products
☆12Updated this week
GAIR-NLP / daVinci-Agency
View on GitHub
daVinci-Agency: Unlocking Long-Horizon Agency Data-Efficiently
☆33Feb 4, 2026Updated last month
CLUEbenchmark / LGEB
View on GitHub
LGEB: Benchmark of Language Generation Evaluation
☆16Oct 21, 2022Updated 3 years ago
mugen-org / MUGEN_coinrun
View on GitHub
A repository for the updated version of CoinRun used to collect MUGEN, a multimodal video-audio-text dataset. This repo contains scripts …
☆13Jul 13, 2022Updated 3 years ago
jaraco / jaraco.text
View on GitHub
☆17Feb 9, 2026Updated last month
TomSheng21 / AdaptGuard
View on GitHub
ICCV 2023 - AdaptGuard: Defending Against Universal Attacks for Model Adaptation
☆11Dec 23, 2023Updated 2 years ago
j4tools / j4status-plugins
View on GitHub
A collection of plugin for j4status
☆14Oct 2, 2023Updated 2 years ago
jwilk-archive / python-morfeusz
View on GitHub
[obsolete] Python interface to Morfeusz
☆10Jul 3, 2017Updated 8 years ago
QwenLM / CodeElo
View on GitHub
CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings
☆66Feb 3, 2025Updated last year