A framework for evolving and testing question-answering datasets with various models.
☆23Feb 28, 2024Updated 2 years ago
Alternatives and similar repositories for Self-Evolving-Benchmark
Users that are interested in Self-Evolving-Benchmark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆14May 17, 2025Updated 10 months ago
- ☆12Sep 23, 2024Updated last year
- A Datasette instance for searching WebVid-10M☆15Sep 30, 2022Updated 3 years ago
- chinese ner based on rnn☆12Oct 14, 2016Updated 9 years ago
- ☆10Jun 12, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- TabLeak: Tabular Data Leakage in Federated Learning☆17Jul 4, 2024Updated last year
- ☆12Sep 8, 2020Updated 5 years ago
- ☆32Jun 28, 2025Updated 9 months ago
- ☆13Mar 8, 2024Updated 2 years ago
- An implementation of Transformer with Expire-Span, a circuit for learning which memories to retain☆34Oct 30, 2020Updated 5 years ago
- The source code of Paper "PathQG: Neural Question Generation from Facts".☆23Jan 4, 2021Updated 5 years ago
- ☆30Jan 11, 2026Updated 2 months ago
- [AAAI 2025 Oral] Synergistic Multi-Agent Framework with Trajectory Learning for Knowledge-Intensive Tasks☆30Apr 14, 2025Updated 11 months ago
- MLLM @ Game☆16May 12, 2025Updated 10 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- [ICML 2025] Official repository for paper "OR-Bench: An Over-Refusal Benchmark for Large Language Models"☆25Mar 4, 2025Updated last year
- code space of paper "Safety Layers in Aligned Large Language Models: The Key to LLM Security" (ICLR 2025)☆22Apr 26, 2025Updated 11 months ago
- [ACL 2024] LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement☆195Mar 25, 2024Updated 2 years ago
- ☆16Mar 22, 2024Updated 2 years ago
- Research on "Many-Shot Jailbreaking" in Large Language Models (LLMs). It unveils a novel technique capable of bypassing the safety mechan…☆16Aug 6, 2024Updated last year
- Robust Point Cloud Processing through Positional Embedding☆13Sep 7, 2023Updated 2 years ago
- Chinese Generation Evaluation☆13Aug 14, 2023Updated 2 years ago
- Code for ICCV 2023 work "Generalized Few-Shot Point Cloud Segmentation Via Geometric Words"☆12Sep 26, 2023Updated 2 years ago
- The official repository for paper: BadVLA: Towards Backdoor Attacks on Vision-Language-Action Models via Objective-Decoupled Optimization☆48Dec 9, 2025Updated 3 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆37Oct 15, 2024Updated last year
- An asymmetric 1v1 multiplayer game using Unreal Engine☆18Feb 25, 2017Updated 9 years ago
- [EMNLP 2025 Findings] Retrieval-Augmented Machine Translation with Unstructured Knowledge☆14Sep 4, 2025Updated 6 months ago
- ☆29Jun 5, 2025Updated 9 months ago
- Pytorch implementation and comparison of Fourier Feature Networks and Sinusoidal Representation Networks☆13Jun 27, 2020Updated 5 years ago
- ☆18Mar 25, 2024Updated 2 years ago
- ☆18Mar 19, 2023Updated 3 years ago
- Repository for the Exposing Outlier Exposure paper☆12Aug 20, 2024Updated last year
- ALBench Leaderboard for active learning in object detection☆15Jan 13, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- This repository contains the ToolSelect dataset which was used to fine-tune Llama-2 70B for tool selection.☆22Mar 11, 2024Updated 2 years ago
- window.hjSiteSettings = {"forms":[],"record":true,"polls":[],"r":1.0,"record_targeting_rules":[],"deferred_page_contents":[{"targeting":[…☆16Updated this week
- Code and data for the paper "Can Large Language Models Understand Real-World Complex Instructions?"(AAAI2024)☆50Apr 19, 2024Updated last year
- [ACL 2024] On the Multi-turn Instruction Following for Conversational Web Agents☆17Oct 12, 2024Updated last year
- Class Prior Estimation in Active Positive and Unlabeled Learning☆16Mar 24, 2021Updated 5 years ago
- R-LPIPS [ICML W 2023]☆17Nov 14, 2023Updated 2 years ago
- SG-Bench: Evaluating LLM Safety Generalization Across Diverse Tasks and Prompt Types☆24Nov 29, 2024Updated last year