☆38Jul 13, 2022Updated 3 years ago
Alternatives and similar repositories for synthetic_pretraining
Users that are interested in synthetic_pretraining are comparing it to the libraries listed below
Sorting:
- Implementation of the paper: "Turning Tables: Generating Examples from Semi-structured Tables for Endowing Language Models with Reasoning…☆22Nov 2, 2021Updated 4 years ago
- The official repository for the paper "From Zero to Hero: Examining the Power of Symbolic Tasks in Instruction Tuning".☆66Apr 18, 2023Updated 2 years ago
- init☆13Feb 3, 2021Updated 5 years ago
- Repo for the paper "Bounding Training Data Reconstruction in Private (Deep) Learning".☆11Jun 16, 2023Updated 2 years ago
- DiWA: Diverse Weight Averaging for Out-of-Distribution Generalization☆31Jan 31, 2023Updated 3 years ago
- Provably (and non-vacuously) bounding test error of deep neural networks under distribution shift with unlabeled test data.☆10Feb 27, 2024Updated 2 years ago
- Post-processing for fair classification☆16Jun 30, 2025Updated 8 months ago
- Code repo for the NeurIPS 2021 paper "Online Adaption to Label Distribution Shift".☆15Feb 15, 2023Updated 3 years ago
- public dataset for followup-query analysis, accepted by AAAI2019☆15Aug 22, 2019Updated 6 years ago
- Tasks for describing differences between text distributions.☆17Aug 9, 2024Updated last year
- This project makes available the code and data from our NAACL paper: "Capturing Row and Column Semantics in Transformer Based Question An…☆55Sep 17, 2025Updated 5 months ago
- code for "Natural Language to Code Translation with Execution"☆41Nov 2, 2022Updated 3 years ago
- [ICML 2024] Self-Infilling Code Generation☆18May 5, 2024Updated last year
- ☆38Oct 3, 2023Updated 2 years ago
- ☆31Sep 4, 2021Updated 4 years ago
- Open source code and data for AAAI 2022 Oral Paper "Text is no more Enough! A Benchmark for Profile-based Spoken Language Understanding"☆35May 26, 2024Updated last year
- [NeurIPS 2023 D&B Track] Code and data for paper "Revisiting Out-of-distribution Robustness in NLP: Benchmarks, Analysis, and LLMs Evalua…☆36Jun 8, 2023Updated 2 years ago
- Bridging the Generalization Gap in Text-to-SQL Parsing with Schema Expansion☆14Jul 26, 2023Updated 2 years ago
- Code for paper ”Language Versatilists vs. Specialists: An Empirical Revisiting on Multilingual Transfer Ability“☆15Jun 13, 2023Updated 2 years ago
- MDL Complexity computations and experiments from the paper "Revisiting complexity and the bias-variance tradeoff".☆18Jun 12, 2023Updated 2 years ago
- Pytorch code for "Improving Self-Supervised Learning by Characterizing Idealized Representations"☆42Nov 27, 2022Updated 3 years ago
- Mirror: Plug-and-Play Data Query, Summarization and Visualization with Natural Language Interface☆43Apr 13, 2023Updated 2 years ago
- ☆16Apr 9, 2021Updated 4 years ago
- The code to reproduce CVPR 2021 paper "Towards Robust Classification Model by Counterfactual and Invariant Data Generation"☆17Jul 29, 2021Updated 4 years ago
- Group-conditional DRO to alleviate spurious correlations☆15Jul 15, 2021Updated 4 years ago
- Data and code for EMNLP 2020 paper "Logic2Text: High-Fidelity Natural Language Generation from Logical Forms"☆71Mar 24, 2023Updated 2 years ago
- Code for reproducing the ACL'23 paper: Don't Generate, Discriminate: A Proposal for Grounding Language Models to Real-World Environments☆78May 17, 2025Updated 9 months ago
- Code repository for the paper "Invariant and Transportable Representations for Anti-Causal Domain Shifts"☆16Jul 4, 2022Updated 3 years ago
- ☆39Aug 9, 2022Updated 3 years ago
- Code for preprint: Summarizing Differences between Text Distributions with Natural Language☆43Feb 24, 2023Updated 3 years ago
- ☆44Oct 30, 2025Updated 4 months ago
- ☆24Apr 28, 2022Updated 3 years ago
- ☆15Jun 5, 2023Updated 2 years ago
- Efficient empirical NTKs in PyTorch☆22Jun 13, 2022Updated 3 years ago
- Google Research☆46Oct 29, 2022Updated 3 years ago
- Distilling Model Failures as Directions in Latent Space☆47Feb 8, 2023Updated 3 years ago
- Code to reproduce LREC Paper Simplifying Semantic Annotations of SMCalFlow☆25Mar 28, 2024Updated last year
- ☆25Jun 22, 2023Updated 2 years ago
- ☆152Oct 12, 2022Updated 3 years ago