Cohere-Labs-Community / llm-profiling-toolkitLinks
☆20Updated last year
Alternatives and similar repositories for llm-profiling-toolkit
Users that are interested in llm-profiling-toolkit are comparing it to the libraries listed below
Sorting:
- An introduction to LLM Sampling☆79Updated last year
- NLP with Rust for Python 🦀🐍☆70Updated 8 months ago
- lossily compress representation vectors using product quantization☆59Updated 3 months ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆72Updated last year
- Understanding how features learned by neural networks evolve throughout training☆41Updated last year
- Documentation effort for the BookCorpus dataset☆34Updated 4 years ago
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆103Updated 2 years ago
- Code for the paper "Fishing for Magikarp"☆179Updated 8 months ago
- utilities for loading and running text embeddings with onnx☆45Updated 5 months ago
- ☆45Updated 2 years ago
- Plug-and-play Search Interfaces with Pyserini and Hugging Face☆32Updated 2 years ago
- Sphynx Hallucination Induction☆52Updated last year
- Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.☆32Updated 3 months ago
- Pre-train Static Word Embeddings☆94Updated 4 months ago
- A library for squeakily cleaning and filtering language datasets.☆49Updated 2 years ago
- Just a bunch of benchmark logs for different LLMs☆119Updated last year
- Efficiently computing & storing token n-grams from large corpora☆26Updated last year
- Experiments for efforts to train a new and improved t5☆76Updated last year
- ☆53Updated 11 months ago
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆150Updated 3 weeks ago
- Simple GRPO scripts and configurations.☆59Updated 11 months ago
- Training code for Sparse Autoencoders on Embedding models☆39Updated 11 months ago
- Python library to use Pleias-RAG models☆68Updated 9 months ago
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆32Updated last year
- ☆23Updated 2 years ago
- Your buddy in the (L)LM space.☆64Updated last year
- Thorn in a HaizeStack test for evaluating long-context adversarial robustness.☆26Updated last year
- ☆86Updated 2 years ago
- A demonstration of how a toy (but usable!) semantic search engine can be quickly built using Cohere's platform.☆117Updated 2 years ago
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆23Updated last year