srush / LLM-TalkLinks
☆51Updated last year
Alternatives and similar repositories for LLM-Talk
Users that are interested in LLM-Talk are comparing it to the libraries listed below
Sorting:
- ☆38Updated last year
- Official code repo for paper "Great Memory, Shallow Reasoning: Limits of kNN-LMs"☆23Updated last month
- Exploration of automated dataset selection approaches at large scales.☆45Updated 3 months ago
- ☆29Updated 11 months ago
- ☆74Updated last year
- A toolkit for scaling law research ⚖☆49Updated 4 months ago
- Using FlexAttention to compute attention with different masking patterns☆44Updated 9 months ago
- ☆44Updated 10 months ago
- ☆64Updated last year
- FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions☆44Updated 11 months ago
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…☆75Updated 10 months ago
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling☆32Updated 3 months ago
- Simple and efficient pytorch-native transformer training and inference (batched)☆76Updated last year
- Astraios: Parameter-Efficient Instruction Tuning Code Language Models☆58Updated last year
- ☆20Updated last year
- Long Context Extension and Generalization in LLMs☆57Updated 9 months ago
- Python package for serving a local search engine. One command to download and serve a datastore---that's it 😎.☆21Updated 2 weeks ago
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆43Updated last year
- [ACL'24 Oral] Analysing The Impact of Sequence Composition on Language Model Pre-Training☆22Updated 10 months ago
- Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"☆27Updated last year
- Organize the Web: Constructing Domains Enhances Pre-Training Data Curation☆55Updated last month
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆48Updated last year
- ☆37Updated last year
- Codebase for Context-aware Meta-learned Loss Scaling (CaMeLS). https://arxiv.org/abs/2305.15076.☆25Updated last year
- This repo is based on https://github.com/jiaweizzhao/GaLore☆28Updated 9 months ago
- A dataset of LLM-generated chain-of-thought steps annotated with mistake location.☆81Updated 10 months ago
- ☆83Updated 5 months ago
- [EMNLP 2023 Industry Track] A simple prompting approach that enables the LLMs to run inference in batches.☆74Updated last year
- A framework for few-shot evaluation of autoregressive language models.☆24Updated last year
- ☆68Updated 10 months ago