[ACL 2023]: Training Trajectories of Language Models Across Scales https://arxiv.org/pdf/2212.09803.pdf
☆25Nov 14, 2023Updated 2 years ago
Alternatives and similar repositories for training_trajectory_analysis
Users that are interested in training_trajectory_analysis are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [CHIL 2024] Interpretation of Intracardiac Electrograms Through Textual Representations☆12Sep 4, 2024Updated last year
- Evaluating Durability: Benchmark Insights into Multimodal Watermarking☆12Jun 7, 2024Updated last year
- ☆14Oct 28, 2023Updated 2 years ago
- [ECCV 2022] "Improve Few-Shot Transfer Learning with Low-Rank Decompose and Align" by Ziyu Jiang, Tianlong Chen, Xuxi Chen, Yu Cheng, Luo…☆13Jul 19, 2022Updated 3 years ago
- ☆10Jul 16, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- SCREWS: A Modular Framework for Reasoning with Revisions☆27Sep 26, 2023Updated 2 years ago
- Functional Optimal Transport: Map Estimation and Domain Adaptation for Functional data☆27Jun 7, 2021Updated 4 years ago
- [EACL 2023] Transfer Knowledge from Natural Language to Electrocardiography: Can We Detect Cardiovascular Disease Through Language Models…☆17May 7, 2024Updated 2 years ago
- Code accompanying our paper at AISTATS 2020☆21Jan 12, 2021Updated 5 years ago
- ☆68Aug 16, 2024Updated last year
- ☆26Mar 4, 2022Updated 4 years ago
- DiWA: Diverse Weight Averaging for Out-of-Distribution Generalization☆31Jan 31, 2023Updated 3 years ago
- Web上に公開されている小説をスクレイピングして青空文庫形式のテキストにする☆19Feb 9, 2017Updated 9 years ago
- Transformer-based approaches for an efficient docstrings generation on a piece of Python's code.☆17Feb 16, 2026Updated 2 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Training scripts for paper Miceli Barone et al. 2017 "Deep Architectures for Neural Machine Translation"☆11Jul 13, 2017Updated 8 years ago
- ECLARE: Extreme Classification with Label Graph Correlations☆41Mar 24, 2022Updated 4 years ago
- We conduct a preregistered experiment to investigate whether fact checks provided by a large language model can serve as an effective mis…☆13Dec 14, 2024Updated last year
- BLOOM+1: Adapting BLOOM model to support a new unseen language☆74Mar 2, 2024Updated 2 years ago
- ☆10Aug 18, 2016Updated 9 years ago
- TyDiP Multilingual Politeness dataset and code☆12Oct 15, 2023Updated 2 years ago
- ☆13Oct 8, 2021Updated 4 years ago
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Aug 12, 2023Updated 2 years ago
- REvolveR: Continuous Evolutionary Models for Robot-to-robot Policy Transfer (ICML 2022 Long Oral)☆27Sep 10, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Preprint: Asymmetry in Low-Rank Adapters of Foundation Models☆39Feb 27, 2024Updated 2 years ago
- This is a comprehensive guide on how you can automate your feature engineering process.☆11Jun 25, 2018Updated 7 years ago
- (ACL2025 Findings) Official code for the paper "STeCa: Step-level Trajectory Calibration for LLM Agent Learning"☆27Mar 2, 2026Updated 2 months ago
- Code for ACL 2023 Paper: ACLM: A Selective-Denoising based Generative Data Augmentation Approach for Low-Resource Complex NER☆22Jul 19, 2023Updated 2 years ago
- Official code repository for the main conference paper in ACL2023: COLA: Contextualized Commonsense Causality Reasoning from the Causal I…☆34May 12, 2023Updated 2 years ago
- ☆16Apr 30, 2026Updated last week
- A python implementation for computing the PoR metric for video summarization from "Performance over Random: A Robust Evaluation Protocol …☆10May 4, 2022Updated 4 years ago
- Repository for paper Decrypting Cryptic Crosswords☆11Jan 15, 2022Updated 4 years ago
- GOPHI: an AMR-to-English Verbalizer☆11Feb 5, 2020Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Implements several Markov chain Monte Carlo (MCMC) algorithms for the latent Dirichlet allocation (LDA) model☆11Feb 11, 2020Updated 6 years ago
- Alignment between clustered datasets via hierarchical Wasserstein distance☆38Sep 26, 2023Updated 2 years ago
- A Python script to convert vobsub subtitles into srt format using tesseract for ocr☆10Sep 28, 2014Updated 11 years ago
- Codes for Merging Large Language Models☆36Aug 7, 2024Updated last year
- MiniGPT-4 :: Updated to Torch 2.0, simple setup, easier API, cut out training code☆15Jun 12, 2023Updated 2 years ago
- Dataset for Bilingual VLN☆11Dec 5, 2020Updated 5 years ago
- VisBERT: Demo web app for "How Does BERT Answer Questions?"☆11Jul 22, 2023Updated 2 years ago