Aleph-Alpha / AtMan
☆25Updated 10 months ago
Related projects: ⓘ
- ☆87Updated last week
- Scaling is a distributed training library and installable dependency designed to scale up neural networks, with a dedicated module for tr…☆38Updated 3 weeks ago
- The history files when recording human interaction while solving ARC tasks☆91Updated this week
- Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*☆77Updated 9 months ago
- A set of scripts and notebooks on LLM finetunning and dataset creation☆89Updated last week
- Starter pack for NeurIPS LLM Efficiency Challenge 2023.☆115Updated last year
- ☆75Updated 3 weeks ago
- ☆85Updated 7 months ago
- A framework for few-shot evaluation of autoregressive language models.☆13Updated 7 months ago
- Truth Forest: Toward Multi-Scale Truthfulness in Large Language Models through Intervention without Tuning☆40Updated 9 months ago
- a unified framework for leveraging LLMs☆50Updated this week
- ☆39Updated 2 months ago
- Draw more samples☆159Updated 2 months ago
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆81Updated last year
- ☆15Updated 5 months ago
- Just a bunch of benchmark logs for different LLMs☆112Updated last month
- ☆48Updated 11 months ago
- ☆58Updated last week
- Functional Benchmarks and the Reasoning Gap☆74Updated last month
- ☆92Updated last year
- ☆91Updated last month
- ☆89Updated 11 months ago
- The Foundation Model Transparency Index☆65Updated 3 months ago
- Survey: A collection of AWESOME papers and resources on the latest research in Mixture of Experts.☆41Updated last month
- This repository contains all the code for collecting large scale amounts of code from GitHub.☆105Updated last year
- ☆10Updated 2 years ago
- Manage scalable open LLM inference endpoints in Slurm clusters☆217Updated 2 months ago
- Contains random samples referenced in the paper "Sleeper Agents: Training Robustly Deceptive LLMs that Persist Through Safety Training".☆81Updated 6 months ago
- git extension for {collaborative, communal, continual} model development☆202Updated 3 months ago
- RuLES: a benchmark for evaluating rule-following in language models☆209Updated this week