xiamengzhou / training_trajectory_analysisView external linksLinks
[ACL 2023]: Training Trajectories of Language Models Across Scales https://arxiv.org/pdf/2212.09803.pdf
☆25Nov 14, 2023Updated 2 years ago
Alternatives and similar repositories for training_trajectory_analysis
Users that are interested in training_trajectory_analysis are comparing it to the libraries listed below
Sorting:
- ☆10Jul 16, 2023Updated 2 years ago
- Evaluating Durability: Benchmark Insights into Multimodal Watermarking☆12Jun 7, 2024Updated last year
- Code and dataset for Polyglot Prompting: Multilingual Multitask Prompt Training.☆18Dec 7, 2022Updated 3 years ago
- ☆14Oct 28, 2023Updated 2 years ago
- [EACL 2023] Transfer Knowledge from Natural Language to Electrocardiography: Can We Detect Cardiovascular Disease Through Language Models…☆18May 7, 2024Updated last year
- SCREWS: A Modular Framework for Reasoning with Revisions☆27Sep 26, 2023Updated 2 years ago
- Code accompanying our paper at AISTATS 2020☆21Jan 12, 2021Updated 5 years ago
- Scripts for preprocessing the CoNLL-2005 SRL dataset.☆24Mar 28, 2019Updated 6 years ago
- Functional Optimal Transport: Map Estimation and Domain Adaptation for Functional data☆27Jun 7, 2021Updated 4 years ago
- DiWA: Diverse Weight Averaging for Out-of-Distribution Generalization☆31Jan 31, 2023Updated 3 years ago
- Source code and data for The Magic of IF: Investigating Causal Reasoning Abilities in Large Language Models of Code (Findings of ACL 2023…☆31Jun 4, 2023Updated 2 years ago
- Code for Deep learning models for electrocardiograms are susceptible to adversarial attack☆24Feb 4, 2021Updated 5 years ago
- ☆30Sep 28, 2023Updated 2 years ago
- ☆68Aug 16, 2024Updated last year
- BLOOM+1: Adapting BLOOM model to support a new unseen language☆74Mar 2, 2024Updated last year
- Official code repository for the main conference paper in ACL2023: COLA: Contextualized Commonsense Causality Reasoning from the Causal I…☆33May 12, 2023Updated 2 years ago
- A Data-Driven Approach to Predict the Success of Bank Telemarketing☆10Apr 27, 2021Updated 4 years ago
- Language Model Baselines for PyTorch☆41Aug 18, 2020Updated 5 years ago
- Codes for Merging Large Language Models☆35Aug 7, 2024Updated last year
- Preprint: Asymmetry in Low-Rank Adapters of Foundation Models☆38Feb 27, 2024Updated last year
- [DMLR 2024] Benchmarking Robustness of Multimodal Image-Text Models under Distribution Shift☆38Jan 25, 2024Updated 2 years ago
- ☆10May 22, 2022Updated 3 years ago
- [MLHC 2021] Model Selection for Offline RL: Practical Considerations for Healthcare Settings. https://arxiv.org/abs/2107.11003☆10Oct 6, 2022Updated 3 years ago
- A transformer model to predict pathogenic mutations☆12Jun 25, 2025Updated 7 months ago
- Web上に公開されている小説をスクレイピングして青空文庫形式のテキストにする☆19Feb 9, 2017Updated 9 years ago
- [ICDCS 2023] Evaluation and Optimization of Gradient Compression for Distributed Deep Learning☆10Apr 28, 2023Updated 2 years ago
- Guide to interviewing for industry machine learning roles (data/applied/research scientist, ML engineer, etc).☆11Dec 28, 2022Updated 3 years ago
- A collection of demos and utilities prepared ahead of the Vector Institute Privacy Enhancing Techniques (PETs) Bootcamp.☆15Sep 22, 2022Updated 3 years ago
- We conduct a preregistered experiment to investigate whether fact checks provided by a large language model can serve as an effective mis…☆13Dec 14, 2024Updated last year
- Alignment between clustered datasets via hierarchical Wasserstein distance☆38Sep 26, 2023Updated 2 years ago
- private 1inch☆14Apr 23, 2023Updated 2 years ago
- Base definition of rk-boot plugins. rk-boot is a library to start goLang microservice from YAML☆11Jul 15, 2024Updated last year
- An interpreter for concatenative combinators (i.e. Combinators as a functional language)☆10Nov 27, 2021Updated 4 years ago
- Experiments from our work Uncertainty Quantification and Deep Ensemble☆10Nov 1, 2021Updated 4 years ago
- ☆11Sep 10, 2023Updated 2 years ago
- This is the official code for CoRL 2022 "Robustness Certification of Visual Perception Models via Camera Motion Smoothing"☆11Apr 5, 2023Updated 2 years ago
- a jax benchmark for ad hoc teamwork☆17Updated this week
- Rejection-minimal, non-reversible, and continuous-time samplers☆12Nov 14, 2025Updated 3 months ago
- Research Artifact For Our Submission To VLDB☆10Oct 27, 2021Updated 4 years ago