swiss-ai / pretrain-dataLinks
Pretraining data reconstruction scripts for Apertus
☆87Updated 3 weeks ago
Alternatives and similar repositories for pretrain-data
Users that are interested in pretrain-data are comparing it to the libraries listed below
Sorting:
- Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first app…☆170Updated last year
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆67Updated 10 months ago
- ☆67Updated last year
- Source code for the collaborative reasoner research project at Meta FAIR.☆103Updated 5 months ago
- ☆49Updated 7 months ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆66Updated last year
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆146Updated 7 months ago
- Let's build better datasets, together!☆263Updated 9 months ago
- Low-Rank adapter extraction for fine-tuned transformers models☆177Updated last year
- ☆63Updated last year
- Chat Markup Language conversation library☆55Updated last year
- ☆135Updated last month
- EvaByte: Efficient Byte-level Language Models at Scale☆109Updated 5 months ago
- ☆42Updated 2 weeks ago
- Lightweight tools for quick and easy LLM demo's☆28Updated last year
- Code for ExploreTom☆86Updated 3 months ago
- ☆58Updated 4 months ago
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya☆116Updated last month
- Python library to use Pleias-RAG models☆62Updated 4 months ago
- Let's create synthetic textbooks together :)☆75Updated last year
- Aana SDK is a powerful framework for building AI enabled multimodal applications.☆52Updated last month
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆94Updated this week
- Scaling is a distributed training library and installable dependency designed to scale up neural networks, with a dedicated module for tr…☆65Updated 10 months ago
- An introduction to LLM Sampling☆79Updated 9 months ago
- ☆124Updated 10 months ago
- ☆194Updated 2 months ago
- PyTorch implementation of models from the Zamba2 series.☆185Updated 8 months ago
- ☆119Updated last year
- DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.☆40Updated last year
- Train, tune, and infer Bamba model☆132Updated 3 months ago