apoorvkh / academic-pretrainingView external linksLinks
$100K or 100 Days: Trade-offs when Pre-Training with Academic Resources
☆150Oct 2, 2025Updated 4 months ago
Alternatives and similar repositories for academic-pretraining
Users that are interested in academic-pretraining are comparing it to the libraries listed below
Sorting:
- Code to reproduce the experiments in the paper: Does CLIP Bind Concepts? Probing Compositionality in Large Image Models.☆16Oct 14, 2023Updated 2 years ago
- If CLIP Could Talk: Understanding Vision-Language Model Representations Through Their Preferred Concept Descriptions☆17Apr 4, 2024Updated last year
- ☆13Jun 29, 2024Updated last year
- A weak supervision framework for (partial) labeling functions☆16Jul 15, 2024Updated last year
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆16Jun 16, 2024Updated last year
- ☆18May 19, 2023Updated 2 years ago
- Code for "Preference Tuning For Toxicity Mitigation Generalizes Across Languages." Paper accepted at Findings of EMNLP 2024☆18Mar 25, 2025Updated 10 months ago
- ☆291Jul 15, 2024Updated last year
- supporting pytorch FSDP for optimizers☆84Dec 8, 2024Updated last year
- ☆22Jan 5, 2025Updated last year
- ☆20Nov 4, 2025Updated 3 months ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆187Jan 19, 2026Updated 3 weeks ago
- Chat Markup Language conversation library☆55Jan 3, 2024Updated 2 years ago
- [NeurIPS-2024] 📈 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623☆89Sep 26, 2024Updated last year
- ☆39Apr 27, 2024Updated last year
- [EMNLP 2025] The official implementation for paper "Agentic-R1: Distilled Dual-Strategy Reasoning"☆102Aug 30, 2025Updated 5 months ago
- Official repository for Montessori-Instruct: Generate Influential Training Data Tailored for Student Learning [ICLR 2025]☆50Jan 24, 2025Updated last year
- Tiny evaluation of leading LLMs on competitive programming problems☆14Nov 28, 2024Updated last year
- An example of how the LIME algorithm can be used to provide real-world insight into the decision processes of a 'black-box' machine learn…☆15Feb 19, 2019Updated 6 years ago
- Seamless Voice Interactions with LLMs☆12Oct 28, 2023Updated 2 years ago
- Improving transparency of large language models' reasoning☆14Nov 25, 2025Updated 2 months ago
- Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"☆89Oct 30, 2024Updated last year
- A bibliography and survey of the papers surrounding o1☆1,212Nov 16, 2024Updated last year
- [ICCV 2023] Going Beyond Nouns With Vision & Language Models Using Synthetic Data☆14Sep 30, 2023Updated 2 years ago
- Code for "Can We Characterize Tasks Without Labels or Features?" (CVPR 2021)☆11Aug 31, 2021Updated 4 years ago
- Code repository for the paper "The Inherent Limits of Pretrained LLMs: The Unexpected Convergence of Instruction Tuning and In-Context Le…☆13Jan 16, 2025Updated last year
- Follow-Up Differential Descriptions: Language Models Resolve Ambiguities for Image Classification☆11Nov 15, 2023Updated 2 years ago
- A State-Space Model with Rational Transfer Function Representation.☆83May 17, 2024Updated last year
- Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"☆316Dec 20, 2023Updated 2 years ago
- ☆250Dec 2, 2024Updated last year
- A Flexible Toolkit for Dense Retrieval☆43Nov 12, 2025Updated 3 months ago
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆693Jan 26, 2026Updated 2 weeks ago
- ☆50Oct 29, 2023Updated 2 years ago
- My fork os allen AI's OLMo for educational purposes.☆28Dec 5, 2024Updated last year
- moodist☆24Jan 6, 2026Updated last month
- Official implementation of Data Contamination Can Cross Language Barriers☆12Sep 11, 2024Updated last year
- Official Implementation of K-Paths: Reasoning over Graph Paths for Drug Repurposing and Drug Interaction Prediction.☆18Jul 8, 2025Updated 7 months ago
- [WIP] Transformer to embed Danbooru labelsets☆13Mar 31, 2024Updated last year
- Learning to compose soft prompts for compositional zero-shot learning.☆93Sep 13, 2025Updated 5 months ago