neuralwork / arxiver
Codebase for the arxiver dataset
☆13Updated 2 months ago
Alternatives and similar repositories for arxiver:
Users that are interested in arxiver are comparing it to the libraries listed below
- ☆31Updated 7 months ago
- Supercharge huggingface transformers with model parallelism.☆76Updated 3 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆53Updated 5 months ago
- Implementation of the Mamba SSM with hf_integration.☆56Updated 5 months ago
- Code repository for the c-BTM paper☆105Updated last year
- A repository for research on medium sized language models.☆76Updated 8 months ago
- Implementation of the general framework for AMIE, from the paper "Towards Conversational Diagnostic AI", out of Google Deepmind☆55Updated 4 months ago
- Utilities for Training Very Large Models☆57Updated 4 months ago
- Exploring finetuning public checkpoints on filter 8K sequences on Pile☆115Updated last year
- PyTorch building blocks for OLMo☆49Updated this week
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated 10 months ago
- ☆48Updated 2 months ago
- ☆49Updated 10 months ago
- Data preparation code for Amber 7B LLM☆84Updated 8 months ago
- The Next Generation Multi-Modality Superintelligence☆70Updated 4 months ago
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Updated last year
- Aioli: A unified optimization framework for language model data mixing☆19Updated last week
- ☆60Updated last year
- MEXMA: Token-level objectives improve sentence representations☆37Updated 3 weeks ago
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Updated last year
- Code for the examples presented in the talk "Training a Llama in your backyard: fine-tuning very large models on consumer hardware" given…☆14Updated last year
- ☆60Updated 3 months ago
- ☆42Updated last week
- Codebase accompanying the Summary of a Haystack paper.☆74Updated 4 months ago
- Set of scripts to finetune LLMs☆36Updated 10 months ago
- ☆22Updated last year
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆24Updated 10 months ago
- Train, tune, and infer Bamba model☆80Updated 2 weeks ago
- Implementation of "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"☆42Updated 2 months ago
- ☆57Updated 4 months ago