allenai / awesome-open-source-lms
Friends of OLMo and their links.
☆266Updated 2 months ago
Alternatives and similar repositories for awesome-open-source-lms:
Users that are interested in awesome-open-source-lms are comparing it to the libraries listed below
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆163Updated 2 weeks ago
- ☆171Updated last week
- Automatic evals for LLMs☆304Updated this week
- A comprehensive repository of reasoning tasks for LLMs (and beyond)☆421Updated 5 months ago
- Code for "LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding", ACL 2024☆274Updated last week
- Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…☆301Updated 2 months ago
- 🌾 OAT: A research-friendly framework for LLM online alignment, including preference learning, reinforcement learning, etc.☆210Updated last week
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya☆107Updated last week
- Solving data for LLMs - Create quality synthetic datasets!☆145Updated last month
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆167Updated last month
- Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym☆369Updated this week
- awesome synthetic (text) datasets☆263Updated 4 months ago
- A compact LLM pretrained in 9 days by using high quality data☆296Updated 3 months ago
- Multimodal language model benchmark, featuring challenging examples☆158Updated 2 months ago
- ☆100Updated 2 months ago
- ☆142Updated 2 weeks ago
- This project showcases an LLMOps pipeline that fine-tunes a small-size LLM model to prepare for the outage of the service LLM.☆301Updated last week
- A simple unified framework for evaluating LLMs☆199Updated last month
- ☆122Updated 2 weeks ago
- Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.☆294Updated 4 months ago
- Repo for "LoLCATs: On Low-Rank Linearizing of Large Language Models"☆218Updated last month
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆189Updated this week
- PyTorch implementation of models from the Zamba2 series.☆177Updated last month
- AWM: Agent Workflow Memory☆245Updated last month