felixzli/synthetic_pretraining

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/felixzli/synthetic_pretraining)

felixzli / synthetic_pretraining

☆39

Alternatives and similar repositories for synthetic_pretraining

Users that are interested in synthetic_pretraining are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

oriyor / turning_tables
View on GitHub
Implementation of the paper: "Turning Tables: Generating Examples from Semi-structured Tables for Endowing Language Models with Reasoning…
☆22Nov 2, 2021Updated 4 years ago
sail-sg / symbolic-instruction-tuning
View on GitHub
The official repository for the paper "From Zero to Hero: Examining the Power of Symbolic Tasks in Instruction Tuning".
☆65Apr 18, 2023Updated 3 years ago
yszh8 / TriageSQL
View on GitHub
The dataset and source code for our paper: "Did You Ask a Good Question? A Cross-Domain Question IntentionClassification Benchmark for Te…
☆32Jul 5, 2021Updated 4 years ago
facebookresearch / mbr-exec
View on GitHub
code for "Natural Language to Code Translation with Execution"
☆41Nov 2, 2022Updated 3 years ago
petezh / OpenD5
View on GitHub
Tasks for describing differences between text distributions.
☆17Aug 9, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
alexrame / diwa
View on GitHub
DiWA: Diverse Weight Averaging for Out-of-Distribution Generalization
☆31Jan 31, 2023Updated 3 years ago
mirror-data / mirror
View on GitHub
Mirror: Plug-and-Play Data Query, Summarization and Visualization with Natural Language Interface
☆44Apr 13, 2023Updated 3 years ago
taoyds / grappa
View on GitHub
☆31Sep 4, 2021Updated 4 years ago
nju-websoft / TSQA
View on GitHub
TSQA: Tabular Scenario Based Question Answering (AAAI 2021)
☆18Dec 17, 2020Updated 5 years ago
erosenfeld / disagree_discrep
View on GitHub
Provably (and non-vacuously) bounding test error of deep neural networks under distribution shift with unlabeled test data.
☆10Feb 27, 2024Updated 2 years ago
facebookresearch / bounding_data_reconstruction
View on GitHub
Repo for the paper "Bounding Training Data Reconstruction in Private (Deep) Learning".
☆11Jun 16, 2023Updated 2 years ago
allenai / PathNet
View on GitHub
PathNet model for Multi-hop Reading Comprehension (https://arxiv.org/pdf/1811.01127.pdf)
☆22May 25, 2021Updated 5 years ago
wesg52 / universal-neurons
View on GitHub
Universal Neurons in GPT2 Language Models
☆30May 28, 2024Updated 2 years ago
qinlibo-hit / CI-ToD
View on GitHub
PyTorch code for EMNLP 2021 paper: Don't be Contradicted with Anything! CI-ToD: Towards Benchmarking Consistency for Task-oriented Dialog…
☆28Oct 4, 2021Updated 4 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
JunShern / few-shot-adaptation
View on GitHub
Exploring Few-Shot Adaptation of Language Models with Tables
☆24Aug 22, 2022Updated 3 years ago
LZhengisme / self-infilling
View on GitHub
[ICML 2024] Self-Infilling Code Generation
☆18May 5, 2024Updated 2 years ago
dki-lab / Pangu
View on GitHub
Code for reproducing the ACL'23 paper: Don't Generate, Discriminate: A Proposal for Grounding Language Models to Real-World Environments
☆78May 17, 2025Updated last year
csinva / mdl-complexity
View on GitHub
MDL Complexity computations and experiments from the paper "Revisiting complexity and the bias-variance tradeoff".
☆18Jun 12, 2023Updated 2 years ago
SivilTaram / FollowUp
View on GitHub
public dataset for followup-query analysis, accepted by AAAI2019
☆15Aug 22, 2019Updated 6 years ago
js-d / sim_metric
View on GitHub
☆37Oct 3, 2023Updated 2 years ago
shiqichen17 / SPA
View on GitHub
Github repository for "Internalizing World Models via Self-Play Finetuning for Agentic RL"
☆35Nov 1, 2025Updated 6 months ago
wrh14 / online_adaptation_to_label_distribution_shift
View on GitHub
Code repo for the NeurIPS 2021 paper "Online Adaption to Label Distribution Shift".
☆16Feb 15, 2023Updated 3 years ago
ekinakyurek / google-research
View on GitHub
Google Research
☆47Oct 29, 2022Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
csarron / BTR
View on GitHub
☆16Mar 3, 2024Updated 2 years ago
czyssrs / Logic2Text
View on GitHub
Data and code for EMNLP 2020 paper "Logic2Text: High-Fidelity Natural Language Generation from Logical Forms"
☆71Mar 24, 2023Updated 3 years ago
uiuctml / fair-classification
View on GitHub
Post-processing for fair classification
☆16Jun 30, 2025Updated 10 months ago
YannDubs / Invariant-Self-Supervised-Learning
View on GitHub
Pytorch code for "Improving Self-Supervised Learning by Characterizing Idealized Representations"
☆41Nov 27, 2022Updated 3 years ago
ruiqi-zhong / DescribeDistributionalDifferences
View on GitHub
Code for preprint: Summarizing Differences between Text Distributions with Natural Language
☆43Feb 24, 2023Updated 3 years ago
Ciaranshu / relogic
View on GitHub
Code for Findings of ACL 2021 paper: Logic-Consistency Text Generation from Semantic Parses
☆26Aug 3, 2021Updated 4 years ago
telepathylabsai / OpenDF
View on GitHub
Code to reproduce LREC Paper Simplifying Semantic Annotations of SMCalFlow
☆24Mar 28, 2024Updated 2 years ago
wesg52 / llm-context-neurons
View on GitHub
Find context neurons in Pythia models.
☆13Jun 13, 2023Updated 2 years ago
violet-zct / group-conditional-DRO
View on GitHub
Group-conditional DRO to alleviate spurious correlations
☆15Jul 15, 2021Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
longxudou / HIT-SCIR-CoNLL2019
View on GitHub
"HIT-SCIR at MRP 2019: A Unified Pipeline for Meaning Representation Parsing via Efficient Training and Effective Encoding"-1st system in…
☆27Jul 9, 2020Updated 5 years ago
microsoft / SCoRE
View on GitHub
ICLR 2021: Pre-Training for Context Representation in Conversational Semantic Parsing
☆31Aug 30, 2021Updated 4 years ago
linyongver / ZIN_official
View on GitHub
This is the implementation for the NeurIPS 2022 paper: ZIN: When and How to Learn Invariance Without Environment Partition?
☆22Dec 3, 2022Updated 3 years ago
princeton-nlp / WhatICLLearns
View on GitHub
[ACL 2023 Findings] What In-Context Learning “Learns” In-Context: Disentangling Task Recognition and Task Learning
☆21Jul 9, 2023Updated 2 years ago
andylolu2 / jax-vqvae-gpt
View on GitHub
Implementation of VQ-VAE with a GPT-style sampler in the JAX and Haiku ecosystem.
☆11Nov 23, 2023Updated 2 years ago
locuslab / T-MARS
View on GitHub
Code for T-MARS data filtering
☆35Aug 23, 2023Updated 2 years ago
xnancy / russ
View on GitHub
☆16Apr 9, 2021Updated 5 years ago