WadeYin9712/Dynosaur

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/WadeYin9712/Dynosaur)

WadeYin9712 / Dynosaur

Code and data for "Dynosaur: A Dynamic Growth Paradigm for Instruction-Tuning Data Curation" (EMNLP 2023)

☆63

Alternatives and similar repositories for Dynosaur

Users that are interested in Dynosaur are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

WadeYin9712 / GeoMLAMA
View on GitHub
☆15Oct 24, 2022Updated 3 years ago
xxxiaol / counterfactual-recipe-generation
View on GitHub
Source code and data for Counterfactual Recipe Generation: Exploring Models’ Compositional Generalization Ability in a Realistic Scenario…
☆15Oct 25, 2022Updated 3 years ago
WadeYin9712 / GD-VCR
View on GitHub
Code and data for "Broaden the Vision: Geo-Diverse Visual Commonsense Reasoning" (EMNLP 2021).
☆29Sep 4, 2021Updated 4 years ago
luciusssss / ZhuangBench
View on GitHub
[ACL'24 Findings] Teaching Large Language Models an Unseen Language on the Fly
☆30Jan 6, 2026Updated 6 months ago
WadeYin9712 / UI-Simulator
View on GitHub
Code for 🌍 UI-Simulator: LLMs as Scalable, General-Purpose Simulators For Evolving Digital Agent Training
☆21Oct 17, 2025Updated 9 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
xxxiaol / spatial-commonsense
View on GitHub
Source code and data for Things not Written in Text: Exploring Spatial Commonsense from Visual Signals (ACL2022 main conference paper).
☆20Oct 10, 2022Updated 3 years ago
xxxiaol / magic-if
View on GitHub
Source code and data for The Magic of IF: Investigating Causal Reasoning Abilities in Large Language Models of Code (Findings of ACL 2023…
☆31Jun 4, 2023Updated 3 years ago
WilliamZR / ProTrix
View on GitHub
Code for ProTrix: Building Models for Planning and Reasoning over Tables with Sentence Context
☆17Nov 15, 2024Updated last year
maszhongming / ParaKnowTransfer
View on GitHub
Code for "Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective"
☆33May 9, 2024Updated 2 years ago
ozyyshr / ShareGPT_investigation
View on GitHub
The Shifted and The Overlooked: A Task-oriented Investigation of User-GPT Interactions (EMNLP 2023))
☆13Dec 21, 2023Updated 2 years ago
xxxiaol / QRData
View on GitHub
Are LLMs Capable of Data-based Statistical and Causal Reasoning? Benchmarking Advanced Quantitative Reasoning with Data
☆48Feb 18, 2025Updated last year
Hritikbansal / entigen_emnlp
View on GitHub
How well can Text-to-Image Generative Models understand Ethical Natural Language Interventions?
☆13Aug 16, 2023Updated 2 years ago
yzjiao / RolePred
View on GitHub
Source code for EMNLP findings paper "Open-Vocabulary Argument Role Prediction for Event Extraction"
☆19Nov 5, 2022Updated 3 years ago
RenzeLou / Muffin
View on GitHub
MUFFIN: Curating Multi-Faceted Instructions for Improving Instruction-Following
☆16Oct 31, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
facebookresearch / llm-cross-capabilities
View on GitHub
Official implementation for "Law of the Weakest Link: Cross capabilities of Large Language Models"
☆43Oct 1, 2024Updated last year
luciusssss / mc2_corpus
View on GitHub
[ACL'24] MC^2: A Multilingual Corpus of Minority Languages in China (Tibetan, Uyghur, Kazakh, and Mongolian)
☆37Jan 17, 2026Updated 6 months ago
dreasysnail / CoCon
View on GitHub
Consistent dialogue generation
☆16Oct 26, 2022Updated 3 years ago
codogogo / towerparse
View on GitHub
Tower Parse: Low-Resource Dependency Parsing via Hierarchical Source Selection
☆15Aug 20, 2021Updated 4 years ago
kobayashikanna01 / Chain-of-Discussion
View on GitHub
☆11May 28, 2024Updated 2 years ago
cchen23 / ctp
View on GitHub
Inducing Taxonomic Knowledge from Pretrained Transformers
☆14Jul 30, 2023Updated 2 years ago
ChenxinAn-fdu / CoLo
View on GitHub
[COLING'22] Code for our paper: "COLO: A Contrastive Learning based Re-ranking Framework for One-Stage Summarization"
☆22Oct 21, 2022Updated 3 years ago
yumeng5 / FewGen
View on GitHub
[ICML 2023] Tuning Language Models as Training Data Generators for Augmentation-Enhanced Few-Shot Learning
☆44May 10, 2023Updated 3 years ago
yzhan238 / SeedTopicMine
View on GitHub
The source code used for paper "Effective Seed-Guided Topic Discovery by Integrating Multiple Types of Contexts", in WSDM 2023.
☆14May 27, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
dki-lab / ArcaneQA
View on GitHub
☆23Aug 14, 2023Updated 2 years ago
maszhongming / UniEval
View on GitHub
Repository for EMNLP 2022 Paper: Towards a Unified Multi-Dimensional Evaluator for Text Generation
☆217Feb 10, 2024Updated 2 years ago
Shark-NLP / CoNT
View on GitHub
[NeurIPS'22 Spotlight] Data and code for our paper CoNT: Contrastive Neural Text Generation
☆152May 10, 2023Updated 3 years ago
jiacheng-ye / ZeroGen
View on GitHub
[EMNLP 2022] Code for our paper “ZeroGen: Efficient Zero-shot Learning via Dataset Generation”.
☆47Feb 18, 2022Updated 4 years ago
artpli / CodeIE
View on GitHub
[ACL 23] CodeIE: Large Code Generation Models are Better Few-Shot Information Extractors
☆42Dec 14, 2025Updated 7 months ago
nlpxucan / evol-instruct
View on GitHub
☆287Apr 25, 2023Updated 3 years ago
qiujiali / lattice_rnn
View on GitHub
Bi-directional Lattice Recurrent Neural Networks for Confidence Estimation
☆15Aug 28, 2020Updated 5 years ago
allenai / data-efficient-finetuning
View on GitHub
Code for paper 'Data-Efficient FineTuning'
☆28May 24, 2023Updated 3 years ago
PlusLabNLP / PredictiveEngagement
View on GitHub
Code for Predictive Engagement: An Efficient Metric for Automatic Evaluation of Open-Domain Dialogue Systems
☆16Jun 8, 2021Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
PlusLabNLP / EventPlus
View on GitHub
[NAACL'21 Demo] EventPlus: a temporal event understanding pipeline that integrates various state-of-the-art event understanding component…
☆28Mar 7, 2023Updated 3 years ago
tanyuqian / ctc-gen-eval
View on GitHub
EMNLP 2021 - CTC: A Unified Framework for Evaluating Natural Language Generation
☆97Mar 20, 2023Updated 3 years ago
MikeWangWZHL / Paxion
View on GitHub
Repo for paper: "Paxion: Patching Action Knowledge in Video-Language Foundation Models" Neurips 23 Spotlight
☆38May 23, 2023Updated 3 years ago
allenai / numglue
View on GitHub
NumGLUE: A Suite of Fundamental yet Challenging Mathematical Reasoning Tasks
☆20May 10, 2022Updated 4 years ago
yihedeng9 / rlhf-summary-notes
View on GitHub
A brief and partial summary of RLHF algorithms.
☆152Mar 4, 2025Updated last year
nickrosh / evol-teacher
View on GitHub
Open Source WizardCoder Dataset
☆166Jul 12, 2023Updated 3 years ago
vevake / DomainAware_DST
View on GitHub
Source code for "Domain-Aware Dialogue State Tracker for Multi-Domain Dialogue Systems"
☆10Oct 5, 2020Updated 5 years ago