marin-community / marinLinks
โ189Updated this week
Alternatives and similar repositories for marin
Users that are interested in marin are comparing it to the libraries listed below
Sorting:
- The simplest, fastest repository for training/finetuning medium-sized GPTs.โ132Updated last month
- A MAD laboratory to improve AI architecture designs ๐งชโ118Updated 5 months ago
- Understand and test language model architectures on synthetic tasks.โ204Updated this week
- seqax = sequence modeling + JAXโ155Updated 2 months ago
- โ78Updated 11 months ago
- โ133Updated 2 months ago
- A scalable asynchronous reinforcement learning implementation with in-flight weight updates.โ122Updated this week
- Implementation of ๐ฅฅ Coconut, Chain of Continuous Thought, in Pytorchโ170Updated 5 months ago
- Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"โ73Updated 7 months ago
- PyTorch building blocks for the OLMo ecosystemโ227Updated this week
- EvaByte: Efficient Byte-level Language Models at Scaleโ101Updated last month
- An extension of the nanoGPT repository for training small MOE models.โ147Updated 3 months ago
- โ269Updated 10 months ago
- Code for exploring Based models from "Simple linear attention language models balance the recall-throughput tradeoff"โ235Updated this week
- Minimal but scalable implementation of large language models in JAXโ35Updated 7 months ago
- Language models scale reliably with over-training and on downstream tasksโ97Updated last year
- A fusion of a linear layer and a cross entropy loss, written for pytorch in triton.โ68Updated 10 months ago
- โ94Updated 8 months ago
- โ191Updated 3 months ago
- WIPโ93Updated 9 months ago
- Repo for "LoLCATs: On Low-Rank Linearizing of Large Language Models"โ237Updated 4 months ago
- A simple library for scaling up JAX programsโ138Updated 7 months ago
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flโฆโ75Updated 9 months ago
- nanoGPT-like codebase for LLM trainingโ94Updated 3 weeks ago
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT trainingโ127Updated last year
- A Collection of Competitive Text-Based Games for Language Model Evaluation and Reinforcement Learningโ173Updated this week
- Experiments for efforts to train a new and improved t5โ77Updated last year
- Scalable toolkit for efficient model reinforcementโ399Updated this week
- supporting pytorch FSDP for optimizersโ80Updated 6 months ago
- Extract full next-token probabilities via language model APIsโ248Updated last year