URS Benchmark: Evaluating LLMs on User Reported Scenarios
☆31May 30, 2025Updated 11 months ago
Alternatives and similar repositories for URS
Users that are interested in URS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆12Nov 22, 2024Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆17Jun 3, 2024Updated last year
- Code of LeCoRE☆13Feb 15, 2023Updated 3 years ago
- code and data associated with CoMPosT: Characterizing and Evaluating Caricature in LLM Simulations☆11Oct 13, 2023Updated 2 years ago
- NUIX-Studio App helps developers to create devices for VR-IoT environment☆23Jan 9, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This is our implementation of IntEL-Intent-aware Ranking Ensemble for Personalized Recommendation (SIGIR2023)☆24Nov 17, 2023Updated 2 years ago
- Code to compute AnthroScore, a computational linguistic measure of anthropomorphism in text☆18Mar 31, 2025Updated last year
- An extended project of the LLM Compiler paper, focusing on developing LLM-based Autonomous Agents.☆26Oct 22, 2024Updated last year
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- ☆42Sep 28, 2025Updated 7 months ago
- The backup repository for FairytaleQA dataset and paper "Fantastic Questions and Where to Find Them: FairytaleQA – An Authentic Dataset f…☆10May 30, 2023Updated 2 years ago
- ☆11Sep 19, 2025Updated 8 months ago
- This repo is to demo the concept of lossless compression with Transformers as encoder and decoder.☆14May 2, 2024Updated 2 years ago
- Neural Unification for Logic Reasoning over Language☆22Nov 15, 2021Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Blog post☆17Feb 16, 2024Updated 2 years ago
- 언어모델을 학습하기 위한 공개 한국어 instruction dataset들을 모아두었습니다.☆19Jul 16, 2023Updated 2 years ago
- Implementation of self-certainty as an extention of ZeroEval Project☆36May 31, 2025Updated 11 months ago
- Code and data for Marked Personas (ACL 2023)☆30May 26, 2023Updated 2 years ago
- Automatically Generated d2l-zh TensorFlow Notebooks for Colab☆13Aug 18, 2023Updated 2 years ago
- Query Performance Prediction for Conversational Search (QPP4CS)☆32May 22, 2024Updated 2 years ago
- Code and data for "KoDialogBench: Evaluating Conversational Understanding of Language Models with Korean Dialogue Benchmark" (LREC-COLING…☆17Apr 15, 2025Updated last year
- SCREWS: A Modular Framework for Reasoning with Revisions☆27Sep 26, 2023Updated 2 years ago
- 将百度DeepSpeech的keras后端由theano改为tensorflow,整合mozilla解码模块进行中文语音识别模型部署☆10Dec 2, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Official repository for Montessori-Instruct: Generate Influential Training Data Tailored for Student Learning [ICLR 2025]☆50Jan 24, 2025Updated last year
- Learning and buiding API using Fast API☆16Aug 7, 2021Updated 4 years ago
- Some example codes for drawing figures in research paper☆35Mar 3, 2022Updated 4 years ago
- Repository for "Rescan: Inductive Instance Segmentation for Indoor RGBD Scans" (ICCV 2019)☆17Mar 12, 2020Updated 6 years ago
- OneFlow Serving☆20Apr 10, 2025Updated last year
- StrategyQA 데이터 세트 번역☆22Apr 12, 2024Updated 2 years ago
- Official code and dataset repository of KoBBQ (TACL 2024)☆19May 13, 2024Updated 2 years ago
- ☆15Dec 3, 2024Updated last year
- ☆12Feb 25, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Bayesian Model Averaging☆10Aug 1, 2019Updated 6 years ago
- This is a project using neural-network reinforcement learning to solve the 8 puzzle problem (or even N puzzle)☆12Mar 24, 2018Updated 8 years ago
- ☆14Mar 25, 2023Updated 3 years ago
- Suite of 500 procedurally-generated NLP tasks to study language model adaptability☆21Jul 16, 2022Updated 3 years ago
- Code for the ACL 2022 (Long paper): "New Intent Discovery with Pre-training and Contrastive Learning".☆14Jul 18, 2022Updated 3 years ago
- ☆42Feb 2, 2024Updated 2 years ago
- BERT finetuned on NER downstream tasks☆15Jun 12, 2023Updated 2 years ago