A iterative feedback driven benchmark on LLM's instruction following ability
☆56Jan 22, 2026Updated 3 months ago
Alternatives and similar repositories for Meeseeks
Users that are interested in Meeseeks are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The code and datasets of our ACM MM 2024 paper "Hallu-PI: Evaluating Hallucination in Multi-modal Large Language Models within Perturbed …☆11Sep 27, 2024Updated last year
- Experiment results using FM, FFM and DeepFM algorithms in Criteo Display Advertising Challenge(https://www.kaggle.com/c/criteo-display-ad…☆13Apr 15, 2020Updated 6 years ago
- [ Arxiv 2023 ] This repository contains the code for "MUPPET: Multi-Modal Few-Shot Temporal Action Detection"☆15Aug 30, 2023Updated 2 years ago
- ☆12Jun 12, 2024Updated last year
- ☆11Oct 13, 2019Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆10Aug 30, 2022Updated 3 years ago
- CITE: A Corpus of Image-Text Discourse Relations☆13Apr 7, 2019Updated 7 years ago
- ☆13May 23, 2021Updated 4 years ago
- Weird autoencoder experiments☆24Apr 24, 2026Updated last week
- ☆10Apr 5, 2025Updated last year
- sougou医学词库爬取☆13Nov 21, 2019Updated 6 years ago
- https://openreview.net/forum?id=OC1o4_OI6Jw☆13May 27, 2022Updated 3 years ago
- code☆15Jun 21, 2020Updated 5 years ago
- 这是一个为 Claude Code / AI Agent 设计的诊断技能(Skill)。它通过自省式分析和多项特定的压力测试,帮助用户检测当前使用的 API 是否为官方原版的 Claude 4.6 模型,或者是否存在第三方中转、提示词注入与封装。☆72Mar 24, 2026Updated last month
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- simplify the prediction process for a finetuned bert model☆11Jun 19, 2019Updated 6 years ago
- 5Hz Deep-Compression Speech VAE for AR-Diffusion and CALMs☆57Nov 19, 2025Updated 5 months ago
- 实现《Multiway Attention Networks for Modeling Sentence Pairs》中的网络模型,可用于问答,句子逻辑推理☆11Apr 13, 2020Updated 6 years ago
- Source code and dataset for paper "End-to-End Transition-Based Online Dialogue Disentanglement"☆17May 17, 2021Updated 4 years ago
- IRIT experiments on the STAC corpus☆17Mar 19, 2018Updated 8 years ago
- 这是一个通过多模态大模型+LangGraph实现的PPT生成系统,包含前端、后端以及Core核心三部分构成。☆45Jun 9, 2025Updated 10 months ago
- 关键词抽取技术☆18Sep 11, 2019Updated 6 years ago
- Longyin Zhang, Fang Kong, and Guodong Zhou. Adversarial Learning for Discourse Rhetorical Structure Parsing. Accepted by ACL-IJCNLP2021.☆18Jan 12, 2023Updated 3 years ago
- a simple implementation of part-of-speech tagging with hmm☆13Feb 26, 2019Updated 7 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Biaffine Dependency Parser, implemented in PyTorch.☆12Feb 19, 2018Updated 8 years ago
- Code Repository for "Please Mind the Root: Decoding Arborescences for Dependency Parsing" and "On Finding the K-best Non-projective Depen…☆20Dec 12, 2022Updated 3 years ago
- The official implementation of our NAACL 2024 paper "A Wolf in Sheep’s Clothing: Generalized Nested Jailbreak Prompts can Fool Large Lang…☆158Sep 2, 2025Updated 7 months ago
- A top-down text-level discourse parser.☆17Jun 26, 2023Updated 2 years ago
- ELECTRA MODEL NLP☆13Apr 8, 2020Updated 6 years ago
- Code for the ACL2022 main conference paper "A Variational Hierarchical Model for Neural Cross-Lingual Summarization"☆18Sep 5, 2022Updated 3 years ago
- Official implementation of FRAPPE: Infusing World Modeling into Generalist Policies via Multiple Future Representation Alignment☆40Mar 24, 2026Updated last month
- ☆17Jul 20, 2022Updated 3 years ago
- ☆10Aug 22, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Repository for 3 papers on Summarization and Entailment for Medical User-Generated Questions.☆13Jun 7, 2022Updated 3 years ago
- DICE: End-to-end Deformation Capture of Hand-Face Interactions from a Single Image (ICLR 2025)☆24Jan 12, 2026Updated 3 months ago
- the PyTorch implementation of paper: [Neural Response Generation via GAN with an Approximate Embedding Layer](http://www.aclweb.org/antho…☆10Feb 6, 2018Updated 8 years ago
- ☆23May 25, 2022Updated 3 years ago
- This repository is the code and data for DialMed: A Dataset for Dialogue-based Medication Recommendation, COLING 2022.☆23Oct 26, 2022Updated 3 years ago
- Paraphrase Identification with Deep Learning using Keras☆12Jun 22, 2018Updated 7 years ago
- Repository for DISRPT2023 shared task☆17Jul 26, 2024Updated last year