A framework for evolving and testing question-answering datasets with various models.
☆25Feb 28, 2024Updated 2 years ago
Alternatives and similar repositories for Self-Evolving-Benchmark
Users that are interested in Self-Evolving-Benchmark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Datasette instance for searching WebVid-10M☆15Sep 30, 2022Updated 3 years ago
- chinese ner based on rnn☆12Oct 14, 2016Updated 9 years ago
- pytorch implementation of mvp: a multi-stage vision-language pre-training framework☆11Apr 23, 2022Updated 4 years ago
- ☆12Sep 8, 2020Updated 5 years ago
- An implementation of Transformer with Expire-Span, a circuit for learning which memories to retain☆34Oct 30, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆31Jan 11, 2026Updated 3 months ago
- [AAAI 2025 Oral] Synergistic Multi-Agent Framework with Trajectory Learning for Knowledge-Intensive Tasks☆30Apr 14, 2025Updated last year
- MLLM @ Game☆16May 12, 2025Updated 11 months ago
- [ACL 2024] An Easy-to-use Hallucination Detection Framework for LLMs.☆39Feb 25, 2025Updated last year
- Something about 3D face reconstruction☆19Mar 24, 2023Updated 3 years ago
- 利用大语言模型进行卧底游戏,包括谁是卧底及衍生的发现AI卧底游戏等。☆11Sep 6, 2024Updated last year
- ☆32Dec 14, 2025Updated 4 months ago
- [ACL 2024] LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement☆196Mar 25, 2024Updated 2 years ago
- Chinese Generation Evaluation☆13Aug 14, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Code and data for the paper "Steering Conversational Large Language Models for Long Emotional Support Conversations" along with a UI to v…☆15Apr 14, 2025Updated last year
- ☆30Jun 5, 2025Updated 11 months ago
- Pytorch implementation and comparison of Fourier Feature Networks and Sinusoidal Representation Networks☆13Jun 27, 2020Updated 5 years ago
- ☆19Mar 25, 2024Updated 2 years ago
- ☆18Mar 19, 2023Updated 3 years ago
- Repository for the Exposing Outlier Exposure paper☆12Aug 20, 2024Updated last year
- insight data engineering fellow project☆16Nov 14, 2016Updated 9 years ago
- ALBench Leaderboard for active learning in object detection☆15Jan 13, 2023Updated 3 years ago
- This is the official project of paper: Compress to Impress: Unleashing the Potential of Compressive Memory in Real-World Long-Term Conver…☆25Nov 18, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆17Nov 3, 2024Updated last year
- The multi-view version of MonoDETR on nuScenes dataset☆21Nov 4, 2022Updated 3 years ago
- MM-PRM: Enhancing Multimodal Mathematical Reasoning with Scalable Step-Level Supervision☆28May 26, 2025Updated 11 months ago
- Class Prior Estimation in Active Positive and Unlabeled Learning☆16Mar 24, 2021Updated 5 years ago
- R-LPIPS [ICML W 2023]☆17Nov 14, 2023Updated 2 years ago
- A pytorch reimplementation of KL-Loss (CVPR'2019)☆15Oct 15, 2023Updated 2 years ago
- [ICASSP '26] This is the code repo for our paper: LegalΔ: Enhancing Legal Reasoning in LLMs via Reinforcement Learning with Chain-of-Thou…☆30Aug 20, 2025Updated 8 months ago
- [ACM MM 2022 Oral] This is the official implementation of "SER30K: A Large-Scale Dataset for Sticker Emotion Recognition"☆30Oct 18, 2022Updated 3 years ago
- code for ACL 2023 paper 'Event Extraction as Question Generation and Answering'☆24Aug 13, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Codes, data, and baselines for CIKM 2023 Long Paper "Dual Intents Graph Modeling for User-centric Group Discovery"☆17Oct 22, 2023Updated 2 years ago
- [ACL 2024] ValueBench: Towards Comprehensively Evaluating Value Orientations and Understanding of Large Language Models☆26Jan 11, 2025Updated last year
- Official repository for the Findings of ACL 2023 paper "AugESC: Dialogue Augmentation with Large Language Models for Emotional Support Co…☆20May 16, 2023Updated 2 years ago
- LM, ULMFit et al.☆46Dec 30, 2019Updated 6 years ago
- Dataset and Codes for our EMNLP 2022 Main Conference Long Paper titled "ECTSum: A New Benchmark Dataset For Bullet Point Summarization of…☆32May 22, 2024Updated last year
- LLM-Check: Investigating Detection of Hallucinations in Large Language Models (NeurIPS 2024)☆40Dec 8, 2024Updated last year
- Code for the paper "Jailbreak Large Vision-Language Models Through Multi-Modal Linkage"☆33Dec 6, 2024Updated last year