leloykun / mmsgLinks
Generate interleaved text and image content in a structured format you can directly pass to downstream APIs.
☆27Updated 7 months ago
Alternatives and similar repositories for mmsg
Users that are interested in mmsg are comparing it to the libraries listed below
Sorting:
- Aioli: A unified optimization framework for language model data mixing☆25Updated 4 months ago
- Efficient encoder-decoder architecture for small language models (≤1B parameters) with cross-architecture knowledge distillation and visi…☆27Updated 3 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆57Updated 9 months ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated last year
- Code, results and other artifacts from the paper introducing the WildChat-50m dataset and the Re-Wild model family.☆29Updated 2 months ago
- Implementation of Spectral State Space Models☆16Updated last year
- Minimum Description Length probing for neural network representations☆19Updated 4 months ago
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32Updated last year
- ☆25Updated last year
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆23Updated last week
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆24Updated last year
- This repo is based on https://github.com/jiaweizzhao/GaLore☆28Updated 8 months ago
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆17Updated 2 months ago
- ☆79Updated 9 months ago
- ☆15Updated 6 months ago
- A new way to generate large quantities of high quality synthetic data (on par with GPT-4), with better controllability, at a fraction of …☆22Updated 8 months ago
- This repository contains the ToolSelect dataset which was used to fine-tune Llama-2 70B for tool selection.☆20Updated last year
- Entailment self-training☆25Updated 2 years ago
- See https://github.com/cuda-mode/triton-index/ instead!☆10Updated last year
- A repository for research on medium sized language models.☆76Updated last year
- ☆31Updated last year
- Training code for Sparse Autoencoders on Embedding models☆38Updated 3 months ago
- Fork of Flame repo for training of some new stuff in development☆13Updated this week
- Official implementation of ECCV24 paper: POA☆24Updated 9 months ago
- ☆34Updated 11 months ago
- ☆21Updated 6 months ago
- Training hybrid models for dummies.☆21Updated 4 months ago
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆29Updated 8 months ago
- Understanding how features learned by neural networks evolve throughout training☆34Updated 7 months ago
- Efficient Scaling laws and collaborative pretraining.☆16Updated 4 months ago