for-ai / llm-profiling-toolkit
☆16Updated 8 months ago
Alternatives and similar repositories for llm-profiling-toolkit:
Users that are interested in llm-profiling-toolkit are comparing it to the libraries listed below
- Pre-train Static Word Embeddings☆51Updated 3 weeks ago
- A library for squeakily cleaning and filtering language datasets.☆46Updated last year
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆22Updated 2 years ago
- Plug-and-play Search Interfaces with Pyserini and Hugging Face☆31Updated last year
- ☆38Updated last month
- An introduction to LLM Sampling☆77Updated 3 months ago
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…☆34Updated last year
- Minimum Bayes Risk Decoding for Hugging Face Transformers☆57Updated 9 months ago
- NLP with Rust for Python 🦀🐍☆61Updated 9 months ago
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆100Updated last year
- gzip Predicts Data-dependent Scaling Laws☆34Updated 10 months ago
- ☆48Updated last year
- utilities for loading and running text embeddings with onnx☆44Updated 7 months ago
- ☆43Updated last month
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆32Updated this week
- QLoRA for Masked Language Modeling☆21Updated last year
- Simple GRPO scripts and configurations.☆59Updated last month
- Documentation effort for the BookCorpus dataset☆34Updated 3 years ago
- This repository contains code for cleaning your training data of benchmark data to help combat data snooping.☆25Updated last year
- Experiments for efforts to train a new and improved t5☆77Updated 11 months ago
- Supercharge huggingface transformers with model parallelism.☆76Updated 5 months ago
- Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…☆26Updated 11 months ago
- Codes and files for the paper Are Emergent Abilities in Large Language Models just In-Context Learning☆33Updated 2 months ago
- ☆22Updated last year
- Using open source LLMs to build synthetic datasets for direct preference optimization☆59Updated last year
- 🤗 Disaggregators: Curated data labelers for in-depth analysis.☆65Updated 2 years ago
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆91Updated 3 weeks ago
- CausalGym: Benchmarking causal interpretability methods on linguistic tasks☆41Updated 4 months ago
- ☆124Updated last week
- Stream of my favorite papers and links☆41Updated last week