fblgit / model-similarity
Simple Model Similarities Analysis
☆21Updated 11 months ago
Alternatives and similar repositories for model-similarity:
Users that are interested in model-similarity are comparing it to the libraries listed below
- ☆46Updated 2 months ago
- Small and Efficient Mathematical Reasoning LLMs☆71Updated 11 months ago
- Spherical Merge Pytorch/HF format Language Models with minimal feature loss.☆115Updated last year
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆118Updated 2 months ago
- ☆108Updated 3 months ago
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆24Updated 10 months ago
- ☆31Updated 7 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆53Updated 4 months ago
- The first dense retrieval model that can be prompted like an LM☆65Updated 4 months ago
- ☆74Updated last year
- Code for RATIONALYST: Pre-training Process-Supervision for Improving Reasoning https://arxiv.org/pdf/2410.01044☆30Updated 3 months ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆49Updated 10 months ago
- ☆47Updated 4 months ago
- ☆24Updated last year
- ☆40Updated 8 months ago
- Official code for ACL 2023 (short, findings) paper "Recursion of Thought: A Divide and Conquer Approach to Multi-Context Reasoning with L…☆42Updated last year
- ☆74Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆48Updated 6 months ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆21Updated last month
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆76Updated last year
- Code for NeurIPS LLM Efficiency Challenge☆54Updated 9 months ago
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆79Updated 10 months ago
- EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding for…☆22Updated last month
- ☆115Updated 3 months ago
- Data preparation code for CrystalCoder 7B LLM☆43Updated 8 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆62Updated 2 months ago
- ☆65Updated 7 months ago
- Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"☆50Updated 3 months ago
- QLoRA with Enhanced Multi GPU Support☆36Updated last year
- Pre-training code for CrystalCoder 7B LLM☆55Updated 8 months ago