LLM360 / Analysis360Links
Open Implementations of LLM Analyses
☆107Updated last year
Alternatives and similar repositories for Analysis360
Users that are interested in Analysis360 are comparing it to the libraries listed below
Sorting:
- Pre-training code for CrystalCoder 7B LLM☆56Updated last year
- Data preparation code for Amber 7B LLM☆94Updated last year
- Data preparation code for CrystalCoder 7B LLM☆45Updated last year
- Evaluating LLMs with fewer examples☆169Updated last year
- Evaluating LLMs with CommonGen-Lite☆93Updated last year
- Small and Efficient Mathematical Reasoning LLMs☆73Updated last year
- ☆129Updated last year
- Advanced Reasoning Benchmark Dataset for LLMs☆47Updated 2 years ago
- Code accompanying "How I learned to start worrying about prompt formatting".☆113Updated 7 months ago
- Codebase accompanying the Summary of a Haystack paper.☆80Updated last year
- Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models☆100Updated 2 years ago
- Mixing Language Models with Self-Verification and Meta-Verification☆111Updated last year
- FuseAI Project☆88Updated 11 months ago
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆77Updated last year
- Scripts for generating synthetic finetuning data for reducing sycophancy.☆118Updated 2 years ago
- ☆55Updated last year
- a curated list of the role of small models in the LLM era☆111Updated last year
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"☆120Updated 2 months ago
- The official repo for "LLoCo: Learning Long Contexts Offline"☆118Updated last year
- CodeUltraFeedback: aligning large language models to coding preferences (TOSEM 2025)☆73Updated last year
- Spherical Merge Pytorch/HF format Language Models with minimal feature loss.☆142Updated 2 years ago
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆81Updated last year
- 🚢 Data Toolkit for Sailor Language Models☆95Updated 10 months ago
- Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first app…☆169Updated 2 years ago
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search☆102Updated last year
- ☆86Updated last year
- Repository for NPHardEval, a quantified-dynamic benchmark of LLMs☆62Updated last year
- ☆75Updated last year
- ☆78Updated 2 years ago
- This project studies the performance and robustness of language models and task-adaptation methods.☆155Updated last year