for-ai / llm-profiling-toolkit
☆14Updated 6 months ago
Alternatives and similar repositories for llm-profiling-toolkit:
Users that are interested in llm-profiling-toolkit are comparing it to the libraries listed below
- Experiments for efforts to train a new and improved t5☆77Updated 9 months ago
- A library for squeakily cleaning and filtering language datasets.☆45Updated last year
- utilities for loading and running text embeddings with onnx☆42Updated 5 months ago
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆22Updated last year
- Thorn in a HaizeStack test for evaluating long-context adversarial robustness.☆26Updated 5 months ago
- A dataset of alignment research and code to reproduce it☆73Updated last year
- This repository contains code for cleaning your training data of benchmark data to help combat data snooping.☆25Updated last year
- Repository for the code of the "PPL-MCTS: Constrained Textual Generation Through Discriminator-Guided Decoding" paper, NAACL'22☆65Updated 2 years ago
- Code for the paper "Fishing for Magikarp"☆140Updated this week
- An introduction to LLM Sampling☆75Updated last month
- ☆48Updated last year
- Functional Benchmarks and the Reasoning Gap☆82Updated 3 months ago
- ☆37Updated 5 months ago
- Plug-and-play Search Interfaces with Pyserini and Hugging Face☆32Updated last year
- gzip Predicts Data-dependent Scaling Laws☆33Updated 7 months ago
- Documentation effort for the BookCorpus dataset☆33Updated 3 years ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆49Updated 10 months ago
- Textbook on reinforcement learning from human feedback☆112Updated this week
- Training code for Sparse Autoencoders on Embedding models☆35Updated last month
- Pre-train Static Word Embeddings☆34Updated this week
- minimal pytorch implementation of bm25 (with sparse tensors)☆97Updated 10 months ago
- Understanding how features learned by neural networks evolve throughout training☆32Updated 2 months ago
- ☆20Updated 2 months ago
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆24Updated 10 months ago
- Code for Zero-Shot Tokenizer Transfer☆120Updated this week
- Sphynx Hallucination Induction☆51Updated 5 months ago
- Lightweight tools for quick and easy LLM demo's☆26Updated 3 months ago