neuralwork / arxiver
Codebase for the arxiver dataset
☆14Updated 4 months ago
Alternatives and similar repositories for arxiver:
Users that are interested in arxiver are comparing it to the libraries listed below
- ☆61Updated last year
- Supercharge huggingface transformers with model parallelism.☆76Updated 6 months ago
- Set of scripts to finetune LLMs☆37Updated last year
- ☆63Updated 7 months ago
- A library for squeakily cleaning and filtering language datasets.☆47Updated last year
- Command-line script for inferencing from models such as LLaMA, in a chat scenario, with LoRA adaptations☆33Updated last year
- ☆22Updated last year
- The simplest, fastest repository for training/finetuning medium-sized xLSTMs.☆42Updated 11 months ago
- ☆49Updated last year
- ☆20Updated 10 months ago
- Lightweight package that tracks and summarizes code changes using LLMs (Large Language Models)☆33Updated last month
- ☆37Updated last year
- Community Open Source Implementation of GPT4o in PyTorch☆29Updated last week
- The code repository for the CURLoRA research paper. Stable LLM continual fine-tuning and catastrophic forgetting mitigation.☆43Updated 7 months ago
- Utilities for Training Very Large Models☆58Updated 7 months ago
- ☆19Updated 2 weeks ago
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆29Updated this week
- Nexusflow function call, tool use, and agent benchmarks.☆19Updated 4 months ago
- ☆62Updated 3 weeks ago
- Implementation of the general framework for AMIE, from the paper "Towards Conversational Diagnostic AI", out of Google Deepmind☆60Updated 7 months ago
- Implementation of the paper: "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" from Google in pyTO…☆55Updated this week
- Exploring finetuning public checkpoints on filter 8K sequences on Pile☆115Updated 2 years ago
- [WIP] A 🔥 interface for running code in the cloud☆85Updated 2 years ago
- Repository containing awesome resources regarding Hugging Face tooling.☆46Updated last year
- ☆48Updated last year
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆55Updated 7 months ago
- ☆58Updated 9 months ago
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search☆83Updated 4 months ago
- Code, results and other artifacts from the paper introducing the WildChat-50m dataset and the Re-Wild model family.☆29Updated 3 weeks ago
- Code for the examples presented in the talk "Training a Llama in your backyard: fine-tuning very large models on consumer hardware" given…☆14Updated last year