LAION-AI/Conditional-Pretraining-of-Large-Language-Models

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/LAION-AI/Conditional-Pretraining-of-Large-Language-Models)

LAION-AI / Conditional-Pretraining-of-Large-Language-Models

☆37

Alternatives and similar repositories for Conditional-Pretraining-of-Large-Language-Models

Users that are interested in Conditional-Pretraining-of-Large-Language-Models are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

donglixp / ICL_PaperList
View on GitHub
Paper List for In-context Learning 🌷
☆19Jan 3, 2023Updated 3 years ago
LAION-AI / General-GPT
View on GitHub
☆65Oct 4, 2023Updated 2 years ago
TencentARC / TaCA
View on GitHub
Official code for the paper, "TaCA: Upgrading Your Visual Foundation Model with Task-agnostic Compatible Adapter".
☆16Jun 20, 2023Updated 3 years ago
davidbrandfonbrener / color-filter-olmo
View on GitHub
☆13Dec 12, 2025Updated 7 months ago
HomoScriptor-Project / HomoScriptor
View on GitHub
Fuel innovation and advance language models with HomoScriptor: A vibrant, community-driven dataset for fine-tuning large language models.
☆18Oct 14, 2023Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
gyhdog99 / RACRO2
View on GitHub
Official PyTorch implementation of RACRO (https://www.arxiv.org/abs/2506.04559)
☆19Jul 1, 2025Updated last year
huggingface / peft-pytorch-conference
View on GitHub
Code for the examples presented in the talk "Training a Llama in your backyard: fine-tuning very large models on consumer hardware" given…
☆15Oct 16, 2023Updated 2 years ago
luyang-NWPU / HGA-STR
View on GitHub
It's the code for <A holistic representation guided attention network for scene text recognition>Neurocomputing 2020
☆17Dec 1, 2020Updated 5 years ago
fra31 / rlhf-trojan-competition-submission
View on GitHub
☆19Feb 25, 2024Updated 2 years ago
allenai / signal-and-noise
View on GitHub
Measuring the Signal to Noise Ratio in Language Model Evaluation
☆31Aug 19, 2025Updated 11 months ago
OpenGVLab / Official-ConvMAE-Det
View on GitHub
☆18Aug 23, 2022Updated 3 years ago
ccx1997 / crnn_ctc_pytorch1.0
View on GitHub
CRNN_CTC_PyTorch
☆10Oct 17, 2019Updated 6 years ago
crypdick / timm-lr-scheduler-explorer
View on GitHub
A dashboard for exploring timm learning rate schedulers
☆20Nov 22, 2024Updated last year
gyhandy / Channel-wise-Lightweight-Reprogramming
View on GitHub
[ICCV 2023] CLR: Channel-wise Lightweight Reprogramming for Continual Learning
☆32Jun 7, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
rwightman / imagenet-12k
View on GitHub
ImageNet-12k subset of ImageNet-21k (fall11)
☆23Jun 13, 2023Updated 3 years ago
TrentBrick / RewardConditionedUDRL
View on GitHub
Open source code combining implementations of Upside Down Reinforcement Learning and Reward Conditioned Policies
☆19Mar 10, 2021Updated 5 years ago
LAION-AI / laion-dreams
View on GitHub
Aim for the moon. If you miss, you may hit a star.
☆168Feb 14, 2023Updated 3 years ago
titu1994 / simple_diffusion
View on GitHub
Simple notebooks to learn diffusion models on toy datasets
☆17Feb 9, 2023Updated 3 years ago
tomekkorbak / pretraining-with-human-feedback
View on GitHub
Code accompanying the paper Pretraining Language Models with Human Preferences
☆182Feb 13, 2024Updated 2 years ago
LAION-AI / Open-GIA
View on GitHub
O-GIA is an umbrella for research, infrastructure and projects ecosystem that should provide open source, reproducible datasets, models, …
☆87Feb 19, 2023Updated 3 years ago
microsoft / ExtreMA
View on GitHub
A self-supervised learning approach based on extremely large masking
☆31Dec 19, 2022Updated 3 years ago
mlfoundations / open_lm
View on GitHub
A repository for research on medium sized language models.
☆537Jun 6, 2025Updated last year
RenzeLou / Muffin
View on GitHub
MUFFIN: Curating Multi-Faceted Instructions for Improving Instruction-Following
☆16Oct 31, 2024Updated last year
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
vigilant-umbrella / wikiHowUnofficialAPI
View on GitHub
API to extract data from wikiHow
☆18Jul 10, 2021Updated 5 years ago
google-deepmind / codesembench
View on GitHub
☆16Mar 22, 2024Updated 2 years ago
facebookresearch / r-mae
View on GitHub
PyTorch implementation of R-MAE https//arxiv.org/abs/2306.05411
☆112Jun 9, 2023Updated 3 years ago
linfeng93 / Large-UniDet
View on GitHub
A practice for million-scale multi-domain universal object detection
☆28Jun 13, 2024Updated 2 years ago
lucidrains / token-shift-gpt
View on GitHub
Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing
☆49Jan 27, 2022Updated 4 years ago
amazon-science / bigdetection
View on GitHub
BigDetection: A Large-scale Benchmark for Improved Object Detector Pre-training
☆399Oct 23, 2024Updated last year
facebookresearch / agenthive
View on GitHub
AgentHive provides the primitives and helpers for a seamless usage of robohive within TorchRL.
☆36Jan 12, 2024Updated 2 years ago
RangiLyu / llama.mmengine
View on GitHub
Training LLaMA language model with MMEngine! It supports LoRA fine-tuning!
☆40Apr 2, 2023Updated 3 years ago
OpenGVLab / efficient-video-recognition
View on GitHub
☆184Aug 20, 2022Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
baaivision / CapsFusion
View on GitHub
[CVPR 2024] CapsFusion: Rethinking Image-Text Data at Scale
☆215Feb 27, 2024Updated 2 years ago
facebookresearch / EgoObjects
View on GitHub
[ICCV2023] EgoObjects: A Large-Scale Egocentric Dataset for Fine-Grained Object Understanding
☆86Oct 6, 2023Updated 2 years ago
facebookresearch / adaptive_scheduling
View on GitHub
Experimental scripts for researching data adaptive learning rate scheduling.
☆23Oct 18, 2023Updated 2 years ago
lbwbowenLi / Optical-Communication
View on GitHub
Former Research simulations and results
☆10Jul 24, 2017Updated 9 years ago
thu-coai / CritiqueLLM
View on GitHub
☆147Jul 1, 2024Updated 2 years ago
jedibobo / python-random-car-plate-generator
View on GitHub
可以随机生成制定数量的车牌号，因为用到停车场的虚假数据生成，所以地区集中在一个地方。支持各类车辆的生成，只需在注释的地方修改即可。
☆10May 30, 2021Updated 5 years ago
OpenGVLab / Awesome-LLM4Tool
View on GitHub
A curated list of the papers, repositories, tutorials, and anythings related to the large language models for tools
☆68Aug 22, 2023Updated 2 years ago