LLM360 / Analysis360
Open Implementations of LLM Analyses
☆102Updated 7 months ago
Alternatives and similar repositories for Analysis360:
Users that are interested in Analysis360 are comparing it to the libraries listed below
- Pre-training code for CrystalCoder 7B LLM☆54Updated 11 months ago
- Data preparation code for Amber 7B LLM☆89Updated 11 months ago
- Data preparation code for CrystalCoder 7B LLM☆44Updated 11 months ago
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"☆107Updated 7 months ago
- Benchmarking LLMs with Challenging Tasks from Real Users☆221Updated 6 months ago
- Co-LLM: Learning to Decode Collaboratively with Multiple Language Models☆112Updated last year
- Small and Efficient Mathematical Reasoning LLMs☆71Updated last year
- ☆120Updated 7 months ago
- Codebase accompanying the Summary of a Haystack paper.☆77Updated 7 months ago
- Framework and toolkits for building and evaluating collaborative agents that can work together with humans.☆74Updated last month
- Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models☆96Updated last year
- a curated list of the role of small models in the LLM era☆99Updated 7 months ago
- Code accompanying "How I learned to start worrying about prompt formatting".☆104Updated 7 months ago
- Mixing Language Models with Self-Verification and Meta-Verification☆104Updated 4 months ago
- RepoQA: Evaluating Long-Context Code Understanding☆108Updated 6 months ago
- Evaluating LLMs with fewer examples☆151Updated last year
- Reformatted Alignment☆115Updated 7 months ago
- A simple GPT-based evaluation tool for multi-aspect, interpretable assessment of LLMs.☆85Updated last year
- Advancing Language Model Reasoning through Reinforcement Learning and Inference Scaling☆101Updated 3 months ago
- ☆97Updated 10 months ago
- The official repo for "LLoCo: Learning Long Contexts Offline"☆116Updated 10 months ago
- ☆227Updated 8 months ago
- Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."☆64Updated last year
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆139Updated 6 months ago
- Systematic evaluation framework that automatically rates overthinking behavior in large language models.☆88Updated last month
- LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)☆134Updated 5 months ago
- FuseAI Project☆85Updated 3 months ago
- Pre-training code for Amber 7B LLM☆166Updated 11 months ago
- ☆150Updated last year
- Spherical Merge Pytorch/HF format Language Models with minimal feature loss.☆121Updated last year