[EMNLP 2023] Official repository for Dialogue Chain-of-Thought Distillation (DONUT & DOCTOR)
☆11Nov 15, 2023Updated 2 years ago
Alternatives and similar repositories for DialogueCoT
Users that are interested in DialogueCoT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- All-in-one repository for Fine-tuning & Pretraining (Large) Language Models☆15Mar 8, 2023Updated 3 years ago
- Interview-based evaluation of LLMs☆26Jan 8, 2025Updated last year
- Keep Me Updated! Memory Management in Long-term Conversations (Findings of EMNLP 2022)☆33Dec 2, 2022Updated 3 years ago
- [TACL 2024] Improving Probability-based Prompt Selection Through Unified Evaluation and Analysis☆11Nov 14, 2024Updated last year
- [COLING 2022] Mind the Gap! Injecting Commonsense Knowledge for Abstractive Dialogue Summarization☆25Mar 28, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- [NeurIPS 2025] Reasoning Models Better Express Their Confidence"☆23Nov 19, 2025Updated 4 months ago
- ☆16Jul 20, 2023Updated 2 years ago
- Continue Pretraining T5 on custom dataset based on available pretrained model checkpoints☆38Mar 21, 2021Updated 5 years ago
- Examples of using Galileo for better ML data quality!!☆13Feb 5, 2026Updated 2 months ago
- On Efficient Language and Vision Assistants for Visually-Situated Natural Language Understanding: What Matters in Reading and Reasoning, …☆19Mar 13, 2026Updated last month
- Data and Code for Paper "Reflect Not Reflex: Inference-Based Common Ground Improves Dialogue Response Quality" (EMNLP 2022)☆11Nov 28, 2022Updated 3 years ago
- Esolang inspired by The Demon Girl Next Door(まちカドまぞく)☆12Apr 17, 2025Updated 11 months ago
- ☆19Oct 11, 2025Updated 6 months ago
- Train large COMET (T5-3B/GPT2-XL) with small memory (on 11GB memory GPUs like 1080/2080) using DeepSpeed.☆14Jan 23, 2022Updated 4 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- [NAACL 2024] Official repository for "KTRL+F: Knowledge-Augmented In-Document Search"☆23Oct 11, 2024Updated last year
- 🔵 [App][Android] AppLock for Android☆12Aug 31, 2020Updated 5 years ago
- ☆23Oct 30, 2023Updated 2 years ago
- Naver Boostcamp AI Tech Stage 3 : MRC (Machine Reading Comprehension)☆10Jun 10, 2021Updated 4 years ago
- [ACL'25 (Findings)] Explorer: Scaling Exploration-driven Web Trajectory Synthesis for Multimodal Web Agents☆27Feb 17, 2026Updated last month
- 👻 Code and benchmark for our EMNLP 2023 paper - "FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions"☆59May 31, 2024Updated last year
- ☆13Jun 5, 2024Updated last year
- [ACL 2023] Gradient Ascent Post-training Enhances Language Model Generalization☆29Sep 12, 2024Updated last year
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Aug 25, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- beko-translateは、Apple Silicon Mac向けのCLI翻訳ツールです。PDF見開き翻訳機能も同梱してあり原文・訳文を交互に表示できます。☆34Mar 25, 2026Updated 2 weeks ago
- Code for NeurIPS'23 paper "A Bayesian Approach To Analysing Training Data Attribution In Deep Learning"☆17Jan 12, 2024Updated 2 years ago
- ☆31Nov 23, 2022Updated 3 years ago
- [ACL 2025 Main] Official Repository for "Evaluating Language Models as Synthetic Data Generators"☆41Dec 13, 2024Updated last year
- ☆10Nov 28, 2024Updated last year
- ☆10Nov 29, 2024Updated last year
- Code and data for "A fine-grained comparison of pragmatic language understanding in humans and language models"☆11Dec 14, 2022Updated 3 years ago
- Python binding for Jagger(C++ implementation of Pattern-based Japanese Morphological Analyzer)☆13Dec 16, 2025Updated 3 months ago
- Repo for SPOLIN corpus and paper "Grounding Conversations with Improvised Dialogues" (ACL2020)☆14Feb 20, 2026Updated last month
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [부스트캠프] 귀가노니 - 출퇴근길에 듣는 인공지능 뉴스 팟캐스트☆12Feb 28, 2022Updated 4 years ago
- Critique-out-Loud Reward Models☆74Oct 18, 2024Updated last year
- ☆11Nov 30, 2024Updated last year
- [ACL 2023] Code and Data Repo for Paper "Element-aware Summary and Summary Chain-of-Thought (SumCoT)"☆53Jan 21, 2024Updated 2 years ago
- Sotopia-π: Interactive Learning of Socially Intelligent Language Agents (ACL 2024)☆83May 7, 2024Updated last year
- Train and make Google chrome dinosaur game AI with tensorflow.☆14Jul 7, 2019Updated 6 years ago
- EMNLP 2024 Tutorial: https://sites.google.com/view/reasoning-with-explanations☆14Apr 15, 2025Updated 11 months ago