[ACL 2023]: Training Trajectories of Language Models Across Scales https://arxiv.org/pdf/2212.09803.pdf
☆25Nov 14, 2023Updated 2 years ago
Alternatives and similar repositories for training_trajectory_analysis
Users that are interested in training_trajectory_analysis are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Evaluating Durability: Benchmark Insights into Multimodal Watermarking☆12Jun 7, 2024Updated last year
- Code and dataset for Polyglot Prompting: Multilingual Multitask Prompt Training.☆18Dec 7, 2022Updated 3 years ago
- ☆14Oct 28, 2023Updated 2 years ago
- [ECCV 2022] "Improve Few-Shot Transfer Learning with Low-Rank Decompose and Align" by Ziyu Jiang, Tianlong Chen, Xuxi Chen, Yu Cheng, Luo…☆13Jul 19, 2022Updated 3 years ago
- Structured Pruning Adapters in PyTorch☆19Aug 30, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Functional Optimal Transport: Map Estimation and Domain Adaptation for Functional data☆27Jun 7, 2021Updated 4 years ago
- [EACL 2023] Transfer Knowledge from Natural Language to Electrocardiography: Can We Detect Cardiovascular Disease Through Language Models…☆17May 7, 2024Updated last year
- Code accompanying our paper at AISTATS 2020☆21Jan 12, 2021Updated 5 years ago
- Source code and data for The Magic of IF: Investigating Causal Reasoning Abilities in Large Language Models of Code (Findings of ACL 2023…☆31Jun 4, 2023Updated 2 years ago
- Scripts for preprocessing the CoNLL-2005 SRL dataset.☆24Mar 28, 2019Updated 7 years ago
- Code for Deep learning models for electrocardiograms are susceptible to adversarial attack☆24Feb 4, 2021Updated 5 years ago
- ☆68Aug 16, 2024Updated last year
- DiWA: Diverse Weight Averaging for Out-of-Distribution Generalization☆31Jan 31, 2023Updated 3 years ago
- [EMNLP 2023] An Empirical Exploration of Cross-domain Alignment between Language and Electroencephalogram☆29Nov 9, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Web上に公開されている小説をスクレイピングして青空文庫形式のテキストにする☆19Feb 9, 2017Updated 9 years ago
- Transformer-based approaches for an efficient docstrings generation on a piece of Python's code.☆17Feb 16, 2026Updated 2 months ago
- Training scripts for paper Miceli Barone et al. 2017 "Deep Architectures for Neural Machine Translation"☆11Jul 13, 2017Updated 8 years ago
- BLOOM+1: Adapting BLOOM model to support a new unseen language☆74Mar 2, 2024Updated 2 years ago
- ☆10Aug 18, 2016Updated 9 years ago
- TyDiP Multilingual Politeness dataset and code☆12Oct 15, 2023Updated 2 years ago
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Aug 12, 2023Updated 2 years ago
- REvolveR: Continuous Evolutionary Models for Robot-to-robot Policy Transfer (ICML 2022 Long Oral)☆26Sep 10, 2022Updated 3 years ago
- Preprint: Asymmetry in Low-Rank Adapters of Foundation Models☆39Feb 27, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- This is a comprehensive guide on how you can automate your feature engineering process.☆11Jun 25, 2018Updated 7 years ago
- (ACL2025 Findings) Official code for the paper "STeCa: Step-level Trajectory Calibration for LLM Agent Learning"☆26Mar 2, 2026Updated last month
- Code for ACL 2023 Paper: ACLM: A Selective-Denoising based Generative Data Augmentation Approach for Low-Resource Complex NER☆22Jul 19, 2023Updated 2 years ago
- ☆10Oct 28, 2019Updated 6 years ago
- Official code repository for the main conference paper in ACL2023: COLA: Contextualized Commonsense Causality Reasoning from the Causal I…☆33May 12, 2023Updated 2 years ago
- ☆16Apr 1, 2026Updated 2 weeks ago
- Code for the NIPS 2016 paper "Single-Image Depth Perception in the Wild"☆11Nov 17, 2017Updated 8 years ago
- A PyTorch implementation for the paper FedCon: A Contrastive Framework for Federated Semi-Supervised Learning.☆24May 18, 2022Updated 3 years ago
- A python implementation for computing the PoR metric for video summarization from "Performance over Random: A Robust Evaluation Protocol …☆10May 4, 2022Updated 3 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Implements several Markov chain Monte Carlo (MCMC) algorithms for the latent Dirichlet allocation (LDA) model☆11Feb 11, 2020Updated 6 years ago
- A Python script to convert vobsub subtitles into srt format using tesseract for ocr☆10Sep 28, 2014Updated 11 years ago
- Code for "Everybody Prune Now: Structured Pruning of LLMs with only Forward Passes"☆31Mar 28, 2024Updated 2 years ago
- Research Artifact For Our Submission To VLDB☆11Oct 27, 2021Updated 4 years ago
- Implementation of Neural Style Transfer on Video☆11Nov 6, 2018Updated 7 years ago
- Codes for Merging Large Language Models☆35Aug 7, 2024Updated last year
- SemBleu: A Robust Metric for AMR Parsing Evaluation☆12Feb 22, 2021Updated 5 years ago