[ACL 2023]: Training Trajectories of Language Models Across Scales https://arxiv.org/pdf/2212.09803.pdf
☆25Nov 14, 2023Updated 2 years ago
Alternatives and similar repositories for training_trajectory_analysis
Users that are interested in training_trajectory_analysis are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Evaluating Durability: Benchmark Insights into Multimodal Watermarking☆12Jun 7, 2024Updated last year
- ☆14Oct 28, 2023Updated 2 years ago
- [ECCV 2022] "Improve Few-Shot Transfer Learning with Low-Rank Decompose and Align" by Ziyu Jiang, Tianlong Chen, Xuxi Chen, Yu Cheng, Luo…☆13Jul 19, 2022Updated 3 years ago
- ☆10Jul 16, 2023Updated 2 years ago
- Generalized Data Augmentation for Low-Resource Translation☆12Jul 30, 2019Updated 6 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Performance Prediction for NLP Tasks☆17May 5, 2020Updated 5 years ago
- Source code and data for The Magic of IF: Investigating Causal Reasoning Abilities in Large Language Models of Code (Findings of ACL 2023…☆31Jun 4, 2023Updated 2 years ago
- [EMNLP 2023] An Empirical Exploration of Cross-domain Alignment between Language and Electroencephalogram☆29Nov 9, 2023Updated 2 years ago
- Web上に公開されている小説をスクレイピングして青空文庫形式のテキストにする☆19Feb 9, 2017Updated 9 years ago
- Fun project to run your own LLM chat bot using llama.cpp☆11Jun 9, 2023Updated 2 years ago
- Training scripts for paper Miceli Barone et al. 2017 "Deep Architectures for Neural Machine Translation"☆11Jul 13, 2017Updated 8 years ago
- [ACL 2022] Structured Pruning Learns Compact and Accurate Models https://arxiv.org/abs/2204.00408☆198May 9, 2023Updated 2 years ago
- We conduct a preregistered experiment to investigate whether fact checks provided by a large language model can serve as an effective mis…☆13Dec 14, 2024Updated last year
- BLOOM+1: Adapting BLOOM model to support a new unseen language☆74Mar 2, 2024Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆10Aug 18, 2016Updated 9 years ago
- ☆13Oct 8, 2021Updated 4 years ago
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Aug 12, 2023Updated 2 years ago
- This is a comprehensive guide on how you can automate your feature engineering process.☆11Jun 25, 2018Updated 7 years ago
- Preprint: Asymmetry in Low-Rank Adapters of Foundation Models☆39Feb 27, 2024Updated 2 years ago
- ☆12Sep 1, 2023Updated 2 years ago
- Code for ACL 2023 Paper: ACLM: A Selective-Denoising based Generative Data Augmentation Approach for Low-Resource Complex NER☆22Jul 19, 2023Updated 2 years ago
- ☆10Oct 28, 2019Updated 6 years ago
- ☆16Mar 13, 2026Updated 2 weeks ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆19Nov 11, 2023Updated 2 years ago
- Code for the NIPS 2016 paper "Single-Image Depth Perception in the Wild"☆11Nov 17, 2017Updated 8 years ago
- A PyTorch implementation for the paper FedCon: A Contrastive Framework for Federated Semi-Supervised Learning.☆24May 18, 2022Updated 3 years ago
- Repository for paper Decrypting Cryptic Crosswords☆10Jan 15, 2022Updated 4 years ago
- ☆30Sep 28, 2023Updated 2 years ago
- Implements several Markov chain Monte Carlo (MCMC) algorithms for the latent Dirichlet allocation (LDA) model☆11Feb 11, 2020Updated 6 years ago
- Alignment between clustered datasets via hierarchical Wasserstein distance☆38Sep 26, 2023Updated 2 years ago
- Research Artifact For Our Submission To VLDB☆10Oct 27, 2021Updated 4 years ago
- Codes for Merging Large Language Models☆35Aug 7, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- MiniGPT-4 :: Updated to Torch 2.0, simple setup, easier API, cut out training code☆15Jun 12, 2023Updated 2 years ago
- Dataset for Bilingual VLN☆11Dec 5, 2020Updated 5 years ago
- VisBERT: Demo web app for "How Does BERT Answer Questions?"☆11Jul 22, 2023Updated 2 years ago
- SemBleu: A Robust Metric for AMR Parsing Evaluation☆12Feb 22, 2021Updated 5 years ago
- ☆10Dec 8, 2017Updated 8 years ago
- ParCourE - Parallel Corpus Explorer☆12Dec 27, 2021Updated 4 years ago
- Keyword spotting for audio with attention (KWS model for audio)☆18Jul 15, 2021Updated 4 years ago