☆52Aug 31, 2025Updated 9 months ago
Alternatives and similar repositories for WebGen-Bench
Users that are interested in WebGen-Bench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆23Jul 5, 2024Updated last year
- A copy of the source for Grinstead and Snell's lovely probability book☆13Dec 20, 2015Updated 10 years ago
- Assessing Context-Aware Creative Intelligence in MLLMs☆23Jul 22, 2025Updated 10 months ago
- ☆14Mar 11, 2024Updated 2 years ago
- UICrit is a dataset containing human-generated natural language design critiques, corresponding bounding boxes for each critique, and des…☆25Nov 19, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Repo for Anonymous purpose, pls don't distribute☆10Oct 2, 2024Updated last year
- [NeurIPS 2025] UI-Genie: A Self-Improving Approach for Iteratively Boosting MLLM-based Mobile GUI Agents☆58Nov 27, 2025Updated 6 months ago
- Computer-Use Agents as Judges for Generative UI☆45Nov 27, 2025Updated 6 months ago
- Code for EMNLP2023 paper "MolCA: Molecular Graph-Language Modeling with Cross-Modal Projector and Uni-Modal Adapter".☆12Dec 27, 2023Updated 2 years ago
- [EMNLP 2025 Main] SpecVLM: Enhancing Speculative Decoding of Video LLMs via Verifier-Guided Token Pruning☆44Apr 16, 2026Updated last month
- Neural discourse structure for text categorization☆11Aug 27, 2017Updated 8 years ago
- The Easiest Pytorch Implementation of Branching-DQN☆12Feb 10, 2021Updated 5 years ago
- ☆11Apr 20, 2021Updated 5 years ago
- RhetoricalRecursiveNeuralNetwork(R2N2) is recursive neural network using RST for NLP Tasks such as Sentiment Analysis☆12Sep 2, 2015Updated 10 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Extract structured strategy specifications from quantitative finance research papers — Agent Skill for GitHub Copilot & Claude Code☆168Jun 5, 2026Updated last week
- Sklearn implement of multiple ensemble learning methods, including bagging, adaboost, iterative bagging and multiboosting☆13Jan 9, 2018Updated 8 years ago
- Library for the Test-based Calibration Error (TCE) metric to quantify the degree to classifier calibration.☆13Sep 15, 2023Updated 2 years ago
- Dataset from Tip of the Tongue Known-Item Retrieval (2021) paper.☆12Nov 4, 2021Updated 4 years ago
- (AAAI 2026) OSVBench, a new benchmark for evaluating Large Language Models (LLMs) in generating complete specification code pertaining to…☆14May 13, 2025Updated last year
- A curated list of awesome multi-modal recommendation.☆10Mar 16, 2022Updated 4 years ago
- All-in-one benchmarking platform for evaluating LLM.☆15Nov 12, 2025Updated 7 months ago
- Multi-Granularity LLM Debugger [ICSE2026]☆98Jul 6, 2025Updated 11 months ago
- GDPnet: "Geometry-guided Dense Perspective Network for Speech-Driven Facial Animation." (TVCG 2021)☆11Nov 21, 2021Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Clober Solidity Library☆10Jun 9, 2025Updated last year
- Source code of ICML'22 paper: FEDformer: Frequency Enhanced Decomposed Transformer for Long-term Series Forecasting☆10Jun 10, 2022Updated 4 years ago
- SimPER: A Minimalist Approach to Preference Alignment without Hyperparameters (ICLR 2025)☆17Aug 22, 2025Updated 9 months ago
- CAT-probing: A Metric-based Approach to Interpret How Pre-trained Models for Programming Language Attend Code Structure, EMNLP 2022☆13Dec 10, 2022Updated 3 years ago
- MetricEval: A framework that conceptualizes and operationalizes four main components of metric evaluation, in terms of reliability and va…☆12Nov 6, 2023Updated 2 years ago
- ☆18Apr 19, 2023Updated 3 years ago
- Beimingwu is the first systematic open-source implementation of the learnware dock system, providing a preliminary research platform for …☆121Jul 17, 2024Updated last year
- 📊 A simple command-line utility for querying and monitoring GPU status☆14Aug 3, 2023Updated 2 years ago
- A curated list of personalized Language model / Large language model (continually updated)☆10Nov 17, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- JsonTuning: Towards Generalizable, Robust, and Controllable Instruction Tuning☆10Nov 3, 2024Updated last year
- A PyTorch Implementation of PlaNet: A Deep Planning Network for Reinforcement Learning☆13Aug 31, 2020Updated 5 years ago
- A Soul-grounded Minecraft social simulation runtime where Mineflayer actors pursue LifeGoals through evidence-backed action skills and tr…☆20Updated this week
- 简单易理解的代码,用于在qwen上使用grpo加强数学能力☆57May 14, 2025Updated last year
- This repository contains code for the paper Direct Preference Optimization with an Offset (ODPO).☆20Feb 17, 2025Updated last year
- LLM Prompting for Text2SQL via Gradual SQL Reffnement☆15Feb 19, 2025Updated last year
- Claw-Eval is an evaluation harness for evaluating LLM as agents. All tasks verified by humans.☆643May 17, 2026Updated 3 weeks ago