URS Benchmark: Evaluating LLMs on User Reported Scenarios
☆31May 30, 2025Updated last year
Alternatives and similar repositories for URS
Users that are interested in URS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code of LeCoRE☆13Feb 15, 2023Updated 3 years ago
- ☆51Nov 22, 2024Updated last year
- This is our implementation of IntEL-Intent-aware Ranking Ensemble for Personalized Recommendation (SIGIR2023)☆24Nov 17, 2023Updated 2 years ago
- Automated testing and benchmarking for code generation agents.☆18Jun 27, 2023Updated 3 years ago
- ☆16Mar 3, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- ☆48Sep 28, 2025Updated 9 months ago
- The backup repository for FairytaleQA dataset and paper "Fantastic Questions and Where to Find Them: FairytaleQA – An Authentic Dataset f…☆10May 30, 2023Updated 3 years ago
- ☆11Sep 19, 2025Updated 9 months ago
- Official Repository for "BlendX: Complex Multi-intent Detection with Blended Patterns"☆27Apr 27, 2026Updated 2 months ago
- This repo is to demo the concept of lossless compression with Transformers as encoder and decoder.☆14May 2, 2024Updated 2 years ago
- 언어모델을 학습하기 위한 공개 한국어 instruction dataset들을 모아두었습니다.☆19Jul 16, 2023Updated 2 years ago
- Implementation of self-certainty as an extention of ZeroEval Project☆37May 31, 2025Updated last year
- Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"☆92Oct 30, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- code for paper Sparse Structure Search for Delta Tuning☆11Oct 16, 2022Updated 3 years ago
- Code and data for Marked Personas (ACL 2023)☆30May 26, 2023Updated 3 years ago
- Code and data for "KoDialogBench: Evaluating Conversational Understanding of Language Models with Korean Dialogue Benchmark" (LREC-COLING…☆18Apr 15, 2025Updated last year
- SCREWS: A Modular Framework for Reasoning with Revisions☆27Sep 26, 2023Updated 2 years ago
- Official repository for Montessori-Instruct: Generate Influential Training Data Tailored for Student Learning [ICLR 2025]☆51Jan 24, 2025Updated last year
- ☆22Dec 15, 2023Updated 2 years ago
- Official implementation of "Disentangled Knowledge Transfer for OOD Intent Discovery with Unified Contrastive Learning", ACL2022 main con…☆14Jul 23, 2022Updated 3 years ago
- Repository for "Rescan: Inductive Instance Segmentation for Indoor RGBD Scans" (ICCV 2019)☆17Mar 12, 2020Updated 6 years ago
- OneFlow Serving☆20Apr 10, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- StrategyQA 데이터 세트 번역☆22Apr 12, 2024Updated 2 years ago
- ☆15Dec 3, 2024Updated last year
- KLUE Benchmark 1st place (2021.12) solutions. (RE, MRC, NLI, STS, TC)☆25Apr 11, 2022Updated 4 years ago
- Query Performance Prediction for Conversational Search (QPP4CS)☆112May 22, 2024Updated 2 years ago
- Pre-training Multi-task Contrastive Learning Models for Scientific Literature Understanding (Findings of EMNLP'23)☆11Aug 24, 2024Updated last year
- ☆43Feb 2, 2024Updated 2 years ago
- Code for the ACL 2022 (Long paper): "New Intent Discovery with Pre-training and Contrastive Learning".☆14Jul 18, 2022Updated 3 years ago
- ☆36Oct 4, 2023Updated 2 years ago
- [ACL 2025 Findings] Autonomous Data Selection with Zero-shot Generative Classifiers for Mathematical Texts (https://huggingface.co/papers…☆92Nov 23, 2025Updated 7 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Official Implementation of "A Hybrid Architecture for Out of Domain Intent Detection and Intent Discovery"☆12May 31, 2023Updated 3 years ago
- Official code repository for Findings of EMNLP 2022 paper: PseudoReasoner: Leveraging Pseudo Labels for Commonsense Knowledge Base Popula…☆11Oct 18, 2022Updated 3 years ago
- Dataaset Release for Explanations for CommonsenseQA, ACL 2021 Paper☆20Jul 30, 2021Updated 4 years ago
- Offical code repository for PromptMix: A Class Boundary Augmentation Method for Large Language Model Distillation, EMNLP 2023☆12Dec 13, 2023Updated 2 years ago
- 2023年iThome鐵人賽「AI & Data」組佳作【30天內成為NLP大師:掌握關鍵工具和技巧】完整程式碼,該 文章會從零開始教你該如何微調大型語言模型☆18Nov 21, 2024Updated last year
- ☆14Dec 9, 2021Updated 4 years ago
- ChatGPT-related papers☆15May 6, 2026Updated last month