for-ai / llm-profiling-toolkit

☆16

Alternatives and similar repositories for llm-profiling-toolkit:

Users that are interested in llm-profiling-toolkit are comparing it to the libraries listed below

MinishLab / tokenlearn
Pre-train Static Word Embeddings
☆51Updated 3 weeks ago
CarperAI / squeakily
A library for squeakily cleaning and filtering language datasets.
☆46Updated last year
stephantul / unitoken
Tokenization across languages. Useful as preprocessing for subword tokenization.
☆22Updated 2 years ago
castorini / hf-spacerini
Plug-and-play Search Interfaces with Pyserini and Hugging Face
☆31Updated last year
AnswerDotAI / ModernBERT-Instruct-mini-cookbook
☆38Updated last month
Pleias / Quest-Best-Tokens
An introduction to LLM Sampling
☆77Updated 3 months ago
google-research-datasets / QAmeleon
QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…
☆34Updated last year
ZurichNLP / mbr
Minimum Bayes Risk Decoding for Hugging Face Transformers
☆57Updated 9 months ago
raphaelsty / LeNLP
NLP with Rust for Python 🦀🐍
☆61Updated 9 months ago
Muhtasham / summarization-eval
📝 Reference-Free automatic summarization evaluation with potential hallucination detection
☆100Updated last year
KhoomeiK / complexity-scaling
gzip Predicts Data-dependent Scaling Laws
☆34Updated 10 months ago
teknium1 / transformers-gptq-quant
☆48Updated last year
taylorai / onnx_embedding_models
utilities for loading and running text embeddings with onnx
☆44Updated 7 months ago
chandar-lab / NeoBERT
☆43Updated last month
ZeroSumEval / ZeroSumEval
A framework for pitting LLMs against each other in an evolving library of games ⚔
☆32Updated this week
ChrisHayduk / QLoRA-for-MLM
QLoRA for Masked Language Modeling
☆21Updated last year
minosvasilias / simple_grpo
Simple GRPO scripts and configurations.
☆59Updated last month
jackbandy / bookcorpus-datasheet
Documentation effort for the BookCorpus dataset
☆34Updated 3 years ago
CarperAI / decontamination
This repository contains code for cleaning your training data of benchmark data to help combat data snooping.
☆25Updated last year
EleutherAI / improved-t5
Experiments for efforts to train a new and improved t5
☆77Updated 11 months ago
luyug / magix
Supercharge huggingface transformers with model parallelism.
☆76Updated 5 months ago
NathanGodey / headless-lm
Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…
☆26Updated 11 months ago
UKPLab / on-emergence
Codes and files for the paper Are Emergent Abilities in Large Language Models just In-Context Learning
☆33Updated 2 months ago
CarperAI / treasure_trove
☆22Updated last year
davanstrien / haiku-dpo
Using open source LLMs to build synthetic datasets for direct preference optimization
☆59Updated last year
huggingface / disaggregators
🤗 Disaggregators: Curated data labelers for in-depth analysis.
☆65Updated 2 years ago
JoeLi12345 / nGPT
an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)
☆91Updated 3 weeks ago
aryamanarora / causalgym
CausalGym: Benchmarking causal interpretability methods on linguistic tasks
☆41Updated 4 months ago
google-deepmind / mishax
☆124Updated last week
ishan0102 / rsrch.space
Stream of my favorite papers and links
☆41Updated last week