Momentum Decoding: Open-ended Text Generation as Graph Exploration
☆19Jan 27, 2023Updated 3 years ago
Alternatives and similar repositories for MomentumDecoding
Users that are interested in MomentumDecoding are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An Empirical Study On Contrastive Search And Contrastive Decoding For Open-ended Text Generation☆27Jun 7, 2024Updated 2 years ago
- Better Transition-Based AMR Parsing with a Refined Search Space (authors' DyNet implementation for the EMNLP18 paper)☆10Jun 13, 2019Updated 7 years ago
- Code of the COLING22 paper "uChecker: Masked Pretrained Language Models as Unsupervised Chinese Spelling Checkers"☆19Aug 17, 2022Updated 3 years ago
- Findings of ACL 2021☆24May 8, 2021Updated 5 years ago
- ☆23Nov 6, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- distill large scale web page text☆12Jul 29, 2023Updated 2 years ago
- ☆17Feb 20, 2023Updated 3 years ago
- [NeurIPS'22 Spotlight] A Contrastive Framework for Neural Text Generation☆478Mar 7, 2024Updated 2 years ago
- VaLM: Visually-augmented Language Modeling. ICLR 2023.☆56Mar 6, 2023Updated 3 years ago
- ☆14Aug 18, 2022Updated 3 years ago
- benchmarks for evaluating MT models☆11Jun 26, 2024Updated last year
- [CVPR 2023] An official Pytorch implementation of "Masked Jigsaw Puzzle: A Versatile Position Embedding for Vision Transformers".☆45Dec 21, 2024Updated last year
- code for "Natural Language to Code Translation with Execution"☆41Nov 2, 2022Updated 3 years ago
- Code for the paper "Self-Detoxifying Language Models via Toxification Reversal" (EMNLP 2023)☆18Oct 17, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Code for AAAI2020 paper "Graph Transformer for Graph-to-Sequence Learning"☆190Jul 25, 2024Updated last year
- EfficientVLM: Fast and Accurate Vision-Language Models via Knowledge Distillation and Modal-adaptive Pruning (ACL 2023)☆34Jul 18, 2023Updated 2 years ago
- Official code for the paper "Provable Compositional Generalization for Object-Centric Learning" (ICLR 2024, oral)☆16Aug 26, 2024Updated last year
- [2024-ACL]: TextBind: Multi-turn Interleaved Multimodal Instruction-following in the Wildrounded Conversation☆47Sep 19, 2023Updated 2 years ago
- Code used to create the Linked WikiText-2 dataset☆16May 22, 2023Updated 3 years ago
- ☆26Nov 7, 2022Updated 3 years ago
- ☆97Aug 6, 2022Updated 3 years ago
- Official code and model checkpoints for our EMNLP 2022 paper "RankGen - Improving Text Generation with Large Ranking Models" (https://arx…☆139Aug 2, 2023Updated 2 years ago
- Ask to Know More: Counterfactual Explanations for Fake Claims source code☆11Nov 22, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆75Sep 1, 2022Updated 3 years ago
- Code for ACL 2023 paper titled "Lifting the Curse of Capacity Gap in Distilling Language Models"☆29Jul 14, 2023Updated 2 years ago
- [EMNLP 2022] Training Language Models with Memory Augmentation https://arxiv.org/abs/2205.12674☆193Jun 14, 2023Updated 2 years ago
- ☆13Jun 21, 2021Updated 4 years ago
- The implementation of the paper "Retrieval-Free Knowledge-Grounded Dialogue Response Generation with Adapters".☆17May 24, 2022Updated 4 years ago
- ReCross: Unsupervised Cross-Task Generalization via Retrieval Augmentation☆23May 1, 2022Updated 4 years ago
- The official site of paper MMDialog: A Large-scale Multi-turn Dialogue Dataset Towards Multi-modal Open-domain Conversation☆204Sep 3, 2023Updated 2 years ago
- [IEEE VL/HCC'25]Frontend Diffusion is an end-to-end LLM-powered tool that generates high-quality websites from user sketches.☆19Oct 10, 2025Updated 8 months ago
- Official code and dataset for our EMNLP 2024 Findings paper: Stark: Social Long-Term Multi-Modal Conversation with Persona Commonsense Kn…☆19Dec 27, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Regressing the Relative Future: Efficient Policy Optimization for Multi-turn RLHF☆25Oct 8, 2024Updated last year
- Findings of ACL'2023: Optimizing Test-Time Query Representations for Dense Retrieval☆30Oct 24, 2023Updated 2 years ago
- Paper collections of retrieval-based (augmented) language model.☆232May 24, 2024Updated 2 years ago
- the instructions and demonstrations for building a formal logical reasoning capable GLM☆54Sep 3, 2024Updated last year
- AMR-Visualization Tools, show AMR graph strcucture☆12Jul 29, 2019Updated 6 years ago
- [NeurIPS 2023] Repetition In Repetition Out: Towards Understanding Neural Text Degeneration from the Data Perspective☆41Oct 17, 2023Updated 2 years ago
- EMNLP 2022: Finding Dataset Shortcuts with Grammar Induction https://arxiv.org/abs/2210.11560☆59Feb 28, 2025Updated last year