☆53Aug 31, 2025Updated 10 months ago
Alternatives and similar repositories for WebGen-Bench
Users that are interested in WebGen-Bench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆23Jul 5, 2024Updated last year
- ☆14Mar 11, 2024Updated 2 years ago
- UICrit is a dataset containing human-generated natural language design critiques, corresponding bounding boxes for each critique, and des…☆26Nov 19, 2024Updated last year
- [AAAI 2026] Multimodal Deepresearcher: Generating Text-Chart Interleaved Reports From Scratch with Agentic Framework☆57Jun 8, 2026Updated 3 weeks ago
- [NeurIPS 2025] UI-Genie: A Self-Improving Approach for Iteratively Boosting MLLM-based Mobile GUI Agents☆58Nov 27, 2025Updated 7 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code for EMNLP2023 paper "MolCA: Molecular Graph-Language Modeling with Cross-Modal Projector and Uni-Modal Adapter".☆12Dec 27, 2023Updated 2 years ago
- Neural discourse structure for text categorization☆11Aug 27, 2017Updated 8 years ago
- The Easiest Pytorch Implementation of Branching-DQN☆12Feb 10, 2021Updated 5 years ago
- ☆11Apr 20, 2021Updated 5 years ago
- RhetoricalRecursiveNeuralNetwork(R2N2) is recursive neural network using RST for NLP Tasks such as Sentiment Analysis☆12Sep 2, 2015Updated 10 years ago
- Active Learning Helps Pretrained Models Learn the Intended Task (https://arxiv.org/abs/2204.08491) by Alex Tamkin, Dat Nguyen, Salil Desh…☆11Nov 22, 2022Updated 3 years ago
- Dataset from Tip of the Tongue Known-Item Retrieval (2021) paper.☆12Nov 4, 2021Updated 4 years ago
- Extract structured strategy specifications from quantitative finance research papers — Agent Skill for GitHub Copilot & Claude Code☆206Jun 12, 2026Updated 2 weeks ago
- (AAAI 2026) OSVBench, a new benchmark for evaluating Large Language Models (LLMs) in generating complete specification code pertaining to…☆15May 13, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A curated list of awesome multi-modal recommendation.☆10Mar 16, 2022Updated 4 years ago
- All-in-one benchmarking platform for evaluating LLM.☆15Nov 12, 2025Updated 7 months ago
- ☆10May 24, 2021Updated 5 years ago
- Multi-Granularity LLM Debugger [ICSE2026]☆98Jul 6, 2025Updated 11 months ago
- mcp wrapper for openai built-in tools☆12Mar 13, 2025Updated last year
- GDPnet: "Geometry-guided Dense Perspective Network for Speech-Driven Facial Animation." (TVCG 2021)☆11Nov 21, 2021Updated 4 years ago
- PyTorch implementation of "A Simple Baseline for Low-Budget Active Learning".☆14Dec 22, 2021Updated 4 years ago
- CAT-probing: A Metric-based Approach to Interpret How Pre-trained Models for Programming Language Attend Code Structure, EMNLP 2022☆13Dec 10, 2022Updated 3 years ago
- MetricEval: A framework that conceptualizes and operationalizes four main components of metric evaluation, in terms of reliability and va…☆12Nov 6, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆18Apr 19, 2023Updated 3 years ago
- This repository is created for recording the paper I read every day, so as to facilitate my review and push myself to learn.☆13Oct 18, 2020Updated 5 years ago
- This repository contains code for the paper Direct Preference Optimization with an Offset (ODPO).☆21Feb 17, 2025Updated last year
- ☆18Feb 20, 2026Updated 4 months ago
- ☆11Jul 3, 2019Updated 6 years ago
- Simple setup for personal dotfiles☆11Mar 29, 2026Updated 3 months ago
- ☆15Feb 18, 2023Updated 3 years ago
- ☆14Jan 22, 2025Updated last year
- Long Context Research☆35Jan 26, 2026Updated 5 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Claw-Eval is an evaluation harness for evaluating LLM as agents. All tasks verified by humans.☆684May 17, 2026Updated last month
- A PyTorch implementation of a Deep Hidden Markov Model [Structured Inference Networks for Nonlinear State Space Models]☆58Jul 25, 2024Updated last year
- Implementation of the paper "Contrastive Learning with Bidirectional Transformers for Sequential Recommendation".☆28Nov 17, 2022Updated 3 years ago
- Official implementation for "ALI-Agent: Assessing LLMs'Alignment with Human Values via Agent-based Evaluation"☆21Jan 31, 2026Updated 5 months ago
- A LLM Multi-Agent Framework toward Ultra Large-Scale Code Generation and Optimization☆18Dec 22, 2024Updated last year
- Source Code & Datasets for "FBL: Feature-Balanced Loss for Long-Tailed Visual Recognition"☆13Sep 3, 2022Updated 3 years ago
- [ICLR'25] Code for KaSA, an official implementation of "KaSA: Knowledge-Aware Singular-Value Adaptation of Large Language Models"☆22Jan 16, 2025Updated last year