Simple Model Similarities Analysis
☆21Feb 3, 2024Updated 2 years ago
Alternatives and similar repositories for model-similarity
Users that are interested in model-similarity are comparing it to the libraries listed below
Sorting:
- Using multiple LLMs for ensemble Forecasting☆16Jan 17, 2024Updated 2 years ago
- This is our own implementation of 'Layer Selective Rank Reduction'☆240May 26, 2024Updated last year
- Understanding the correlation between different LLM benchmarks☆29Jan 11, 2024Updated 2 years ago
- An unofficial implementation of SOLAR-10.7B model and the newly proposed interlocked-DUS(iDUS) implementation and experiment details.☆14Mar 20, 2024Updated 2 years ago
- Extract a single expert from a Mixture Of Experts model using slerp interpolation.☆19May 26, 2024Updated last year
- Layout Analysis Dataset with Segmonto (LADaS)☆24Jul 12, 2025Updated 8 months ago
- ☆32Jan 1, 2024Updated 2 years ago
- ☆67Mar 4, 2024Updated 2 years ago
- A chatbot UI for RAG, multimodal, text completion. (support Transformers, llama.cpp, MLX, vLLM)☆20Apr 18, 2024Updated last year
- YouTube Assistant☆12May 15, 2023Updated 2 years ago
- ☆68May 26, 2024Updated last year
- ☆142Aug 20, 2025Updated 7 months ago
- Supervised instruction finetuning for LLM with HF trainer and Deepspeed☆36Jul 6, 2023Updated 2 years ago
- Set of scripts to finetune LLMs☆38Mar 30, 2024Updated last year
- Kanban board made with TailwindCSS☆11Jun 10, 2021Updated 4 years ago
- ☆166Aug 8, 2025Updated 7 months ago
- Creates CMM script that can directly executed on Kaggle from easy merge script☆14Mar 6, 2026Updated 2 weeks ago
- All the world is a play, we are but actors in it.☆50Jul 21, 2025Updated 8 months ago
- A copy of the DirectX Headers from MinGW-64.☆14Sep 7, 2023Updated 2 years ago
- Latent Large Language Models☆19Aug 24, 2024Updated last year
- Vite + Mantine + Vanilla extract template☆12Mar 14, 2026Updated last week
- See https://github.com/cuda-mode/triton-index/ instead!☆11May 8, 2024Updated last year
- ☆14Mar 8, 2025Updated last year
- A Model Agnostic function to directly remove specified layers from the LLM☆10May 23, 2024Updated last year
- ☆10Nov 8, 2019Updated 6 years ago
- ☆13Jun 3, 2023Updated 2 years ago
- Code for paper titled "Towards the Law of Capacity Gap in Distilling Language Models"☆102Jul 9, 2024Updated last year
- Implementation of "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"☆40Nov 11, 2024Updated last year
- Code for NeurIPS LLM Efficiency Challenge☆60Apr 9, 2024Updated last year
- Download TikTok videos online with TikTok Video Downloader. Completely free.☆13Sep 17, 2025Updated 6 months ago
- ☆10Oct 2, 2024Updated last year
- ☆10Nov 14, 2022Updated 3 years ago
- A list of awesome open source projects in the machine learning field, who's developers are mainly based in Germany☆54Updated this week
- ☆138Aug 19, 2024Updated last year
- A simple web-app for generating glassmorphism UI effect!☆12Aug 5, 2023Updated 2 years ago
- Eval LLMs☆11May 12, 2024Updated last year
- A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM☆61Apr 8, 2024Updated last year
- The official implementation of Bi-Mamba☆14Oct 22, 2025Updated 4 months ago
- simple ansible playbook to take clean ubuntu 18.04 to CUDA 10, PyTorch 1.0, fastai, miniconda heaven☆12Dec 16, 2018Updated 7 years ago