allenai / awesome-open-source-lmsView external linksLinks
Friends of OLMo and their links.
☆356Sep 15, 2025Updated 5 months ago
Alternatives and similar repositories for awesome-open-source-lms
Users that are interested in awesome-open-source-lms are comparing it to the libraries listed below
Sorting:
- [T-PAMI 2025] EMOv2: Pushing 5M Vision Model Frontier☆54Dec 30, 2024Updated last year
- Official Code for paper "Towards Efficient and Effective Unlearning of Large Language Models for Recommendation" (Frontiers of Computer S…☆38Jul 19, 2024Updated last year
- A scalable automated alignment method for large language models. Resources for "Aligning Large Language Models via Self-Steering Optimiza…☆20Nov 21, 2024Updated last year
- Wonderful Matrices to Build Small Language Models☆44Feb 15, 2025Updated last year
- Code for this paper "HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts via HyperNetwork"☆33Nov 29, 2023Updated 2 years ago
- XmodelLM☆38Nov 19, 2024Updated last year
- This repository contains the resource introduced in the paper: "Truth or Mirage? Towards End-to-End Factuality Evaluation with LLM-Oasis"…☆25Oct 15, 2025Updated 4 months ago
- AllenAI's post-training codebase☆3,573Updated this week
- Structured Data Extractor for AI Agents. Search your documents or the web for specific data and get it back in JSON or Markdown in a sing…☆182Jan 5, 2026Updated last month
- 🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.☆627Jan 29, 2026Updated 2 weeks ago
- The official implementation of Preference Data Reward-Augmentation.☆18May 1, 2025Updated 9 months ago
- A tool to assist in the interpretation of learned features in sparse autoencoders (in particular the four SAE's trained by Joseph Bloom o…☆19Oct 4, 2024Updated last year
- PyTorch code for "ADEM-VL: Adaptive and Embedded Fusion for Efficient Vision-Language Tuning"☆21Oct 28, 2024Updated last year
- Code associated with the EMNLP 2024 Main paper: "Image, tell me your story!" Predicting the original meta-context of visual misinformatio…☆45Dec 6, 2025Updated 2 months ago
- The repository for papaer "Distance between Relevant Information Pieces Causes Bias in Long-Context LLMs"☆14Dec 16, 2024Updated last year
- Initialization using Update Approximation is a Silver Bullet for Extremely Efficient Low-Rank Fine-Tuning☆52Oct 17, 2025Updated 3 months ago
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆82Mar 18, 2024Updated last year
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya☆125Aug 7, 2025Updated 6 months ago
- Row-wise block scaling for fp8 quantization matrix multiplication. Solution to GPU mode AMD challenge.☆17Updated this week
- Code repository for the paper "The Inherent Limits of Pretrained LLMs: The Unexpected Convergence of Instruction Tuning and In-Context Le…☆13Jan 16, 2025Updated last year
- [Preprint] Learning to Filter Context for Retrieval-Augmented Generaton☆196Apr 6, 2024Updated last year
- OLMost every training recipe you need to perform data interventions with the OLMo family of models.☆64Updated this week
- ☆31Nov 18, 2025Updated 2 months ago
- ☆42Sep 15, 2025Updated 5 months ago
- PyTorch building blocks for the OLMo ecosystem☆801Updated this week
- A bibliography and survey of the papers surrounding o1☆1,212Nov 16, 2024Updated last year
- GeckoNum Benchmark for T2I Model Eval.☆15Dec 5, 2024Updated last year
- Measuring the Signal to Noise Ratio in Language Model Evaluation☆28Aug 19, 2025Updated 5 months ago
- The first dense retrieval model that can be prompted like an LM☆90May 8, 2025Updated 9 months ago
- Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.☆4,754Jul 18, 2025Updated 6 months ago
- LongLLaVA: Scaling Multi-modal LLMs to 1000 Images Efficiently via Hybrid Architecture☆213Jan 6, 2025Updated last year
- AceParse: A Comprehensive Dataset with Diverse Structured Texts for Academic Literature Parsing☆44Sep 17, 2024Updated last year
- A playbook for effectively prompting post-trained LLMs☆898Jan 21, 2025Updated last year
- The official implementation for Collaborative Word-based Pre-trained Item Representation for Transferable Recommendation.☆25Jan 30, 2024Updated 2 years ago
- Code for the EMNLP 2024 paper "Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps"☆144Oct 13, 2025Updated 4 months ago
- [IROS-2025] MAPF-GPT-DDG is a scalable decentralized multi-agent pathfinding (MAPF) solver based on imitation learning. It builds upon MA…☆60Jan 17, 2026Updated 3 weeks ago
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆150Oct 2, 2025Updated 4 months ago
- ☆17Apr 9, 2025Updated 10 months ago
- From Word to World: Can Large Language Models be Implicit Text-based World Models?☆45Dec 25, 2025Updated last month