fblgit / model-similarity
Simple Model Similarities Analysis
☆21Updated last year
Alternatives and similar repositories for model-similarity:
Users that are interested in model-similarity are comparing it to the libraries listed below
- ☆48Updated 3 months ago
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆24Updated 11 months ago
- Small and Efficient Mathematical Reasoning LLMs☆71Updated last year
- ☆31Updated 8 months ago
- ☆74Updated last year
- Spherical Merge Pytorch/HF format Language Models with minimal feature loss.☆115Updated last year
- ☆74Updated last year
- ☆20Updated last year
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆54Updated 5 months ago
- A repository for research on medium sized language models.☆76Updated 8 months ago
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆130Updated this week
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆48Updated 7 months ago
- ☆113Updated 4 months ago
- ☆24Updated last year
- Official code for ACL 2023 (short, findings) paper "Recursion of Thought: A Divide and Conquer Approach to Multi-Context Reasoning with L…☆42Updated last year
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Updated last year
- ☆47Updated 5 months ago
- QLoRA with Enhanced Multi GPU Support☆36Updated last year
- Using open source LLMs to build synthetic datasets for direct preference optimization☆57Updated 11 months ago
- Code repository for the c-BTM paper☆105Updated last year
- Code for NeurIPS LLM Efficiency Challenge☆55Updated 10 months ago
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆35Updated 9 months ago
- ☆53Updated 8 months ago
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆67Updated 4 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆64Updated 3 months ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated 11 months ago
- Advanced Reasoning Benchmark Dataset for LLMs☆45Updated last year
- an implementation of Self-Extend, to expand the context window via grouped attention☆118Updated last year
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆76Updated last year