joaopauloschuler / less-parameters-llmLinks
This repository contains the source code for the Saving 77% of the Parameters in Large Language Models Technical Report
☆29Updated 3 months ago
Alternatives and similar repositories for less-parameters-llm
Users that are interested in less-parameters-llm are comparing it to the libraries listed below
Sorting:
- Set of scripts to finetune LLMs☆37Updated last year
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆66Updated 7 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated 10 months ago
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆36Updated last year
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya☆110Updated 2 weeks ago
- ☆66Updated last year
- Tools for merging pretrained large language models.☆19Updated 11 months ago
- ☆43Updated 3 months ago
- Fine tune Gemma 3 on an object detection task☆46Updated this week
- Collection of autoregressive model implementation☆85Updated last month
- Source code for the collaborative reasoner research project at Meta FAIR.☆87Updated last month
- The first dense retrieval model that can be prompted like an LM☆73Updated 3 weeks ago
- ☆11Updated last month
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆53Updated 4 months ago
- An introduction to LLM Sampling☆78Updated 5 months ago
- Cerule - A Tiny Mighty Vision Model☆67Updated 9 months ago
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.☆47Updated 9 months ago
- Problem-Oriented Segmentation and Retrieval EMNLP 2024 Findings☆33Updated 6 months ago
- ☆45Updated 4 months ago
- Recaption large (Web)Datasets with vllm and save the artifacts.☆52Updated 6 months ago
- Library to facilitate pruning of LLMs based on context☆32Updated last year
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆56Updated 3 weeks ago
- ☆23Updated last year
- Pre-train Static Word Embeddings☆76Updated this week
- ☆123Updated 7 months ago
- ☆130Updated 9 months ago
- Train your own SOTA deductive reasoning model☆93Updated 3 months ago
- I learn about and explain quantization☆26Updated last year
- Torch-activation, a library of activation functions for PyTorch library☆24Updated last month
- 🤝 Trade any tensors over the network☆30Updated last year