Shopping MMLU: A Multi-Task Online Shopping Benchmark for LLMs.
☆46Nov 4, 2024Updated last year
Alternatives and similar repositories for ShoppingMMLU
Users that are interested in ShoppingMMLU are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13Jul 14, 2024Updated last year
- BERT score for text generation☆12Jan 15, 2025Updated last year
- A new type of sorting algorithm. Use large language model (llm like gpt, chat-gpt or others) to sort collections.☆12Jun 7, 2023Updated 2 years ago
- Repository for "Scaling Evaluation-time Compute with Reasoning Models as Process Evaluators"☆12Mar 25, 2025Updated last year
- [NAACL 2025] Representing Rule-based Chatbots with Transformers☆23Feb 9, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Code for paper "W-RAG: Weakly Supervised Dense Retrieval in RAG for Open-domain Question Answering"☆15Oct 2, 2025Updated 6 months ago
- Library to extract text from HTML files☆11Dec 20, 2015Updated 10 years ago
- Design of in-memory database☆14Aug 4, 2024Updated last year
- The reproduce of paper "Continual Vision-Language Representation Learning with Off-Diagonal Information ".(Mod-X)☆11Oct 31, 2023Updated 2 years ago
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Aug 8, 2022Updated 3 years ago
- Codebase the paper "The Remarkable Robustness of LLMs: Stages of Inference?"☆19Jun 11, 2025Updated 10 months ago
- An unofficial implementation of SOLAR-10.7B model and the newly proposed interlocked-DUS(iDUS) implementation and experiment details.☆14Mar 20, 2024Updated 2 years ago
- Code implementation for paper "On the Efficacy of Small Self-Supervised Contrastive Models without Distillation Signals".☆17Dec 15, 2021Updated 4 years ago
- A toolkit for dialogue system evaluation via crowdsourcing☆18Apr 25, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Code and data of "Controllable Unsupervised Event-based Video Generation" (accepted as ICIP oral and invited by WACV workshop)☆19Nov 5, 2024Updated last year
- Official repo of Knowledge or Reasoning? A Close Look at How LLMs Think Across Domains.☆44Jun 6, 2025Updated 10 months ago
- ☆34Mar 21, 2026Updated 3 weeks ago
- Code for the ICML 2021 paper "Sharing Less is More: Lifelong Learning in Deep Networks with Selective Layer Transfer"☆12Aug 17, 2021Updated 4 years ago
- ☆12Dec 20, 2024Updated last year
- A simple JSON parser specifically designed to handle malformed JSON output from Large Language Models (LLMs) like GPT, Claude, and others…☆27Jun 20, 2025Updated 9 months ago
- ☆28Jul 11, 2024Updated last year
- ☆82Mar 11, 2025Updated last year
- Large language models for document ranking.☆72Apr 12, 2026Updated last week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Deep Learning Part 2, 2019 edition - transcriptions, screenshots and notebooks☆11Jul 19, 2019Updated 6 years ago
- CVPR2021☆12Mar 29, 2021Updated 5 years ago
- R package for Conditional Random Fields☆20Oct 22, 2025Updated 5 months ago
- A Data Source for Reasoning Embodied Agents☆19Sep 18, 2023Updated 2 years ago
- An index for papers on large language model agents for recommendation and search.☆91Feb 12, 2026Updated 2 months ago
- Official code repository for the ICLR 2022 paper "You Mostly Walk Alone: Analyzing Feature Attribution in Trajectory Prediction".☆14Jul 25, 2024Updated last year
- COVID-19 Risk Estimation for L.A. County using a Bayesian Time-varying SIR-model☆12Feb 17, 2023Updated 3 years ago
- ☆14Dec 27, 2016Updated 9 years ago
- Evaluating Multimodal Generative AI with Korean Educational Standards, NAACL 2025.☆26May 15, 2025Updated 11 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A dataset for Vietnamese Spelling Correction☆15Sep 27, 2021Updated 4 years ago
- [NAACL 2024] CoE-SQL: In-Context Learning for Multi-Turn Text-to-SQL with Chain-of-Editions☆13May 7, 2024Updated last year
- This repository has been redirected into https://kuaisar.github.io/.☆11Oct 12, 2023Updated 2 years ago
- ELECTRA MODEL NLP☆13Apr 8, 2020Updated 6 years ago
- How to really install tensorflow-gpu from source on a clean instance of Ubuntu☆11Sep 29, 2023Updated 2 years ago
- Ko-Arena-Hard-Auto: An automatic LLM benchmark for Korean☆22Apr 23, 2025Updated 11 months ago
- Hierarchical Encoder Decoder for Dialog Modelling☆16May 20, 2015Updated 10 years ago