Cohere-Labs-Community / llm-profiling-toolkitLinks
☆20Updated last year
Alternatives and similar repositories for llm-profiling-toolkit
Users that are interested in llm-profiling-toolkit are comparing it to the libraries listed below
Sorting:
- lossily compress representation vectors using product quantization☆59Updated 3 weeks ago
- Efficiently computing & storing token n-grams from large corpora☆26Updated last year
- Code for the paper "Fishing for Magikarp"☆174Updated 6 months ago
- A library for squeakily cleaning and filtering language datasets.☆48Updated 2 years ago
- A toolkit implementing advanced methods to transfer models and model knowledge across tokenizers.☆49Updated 4 months ago
- Documentation effort for the BookCorpus dataset☆34Updated 4 years ago
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆145Updated 9 months ago
- Scaling is a distributed training library and installable dependency designed to scale up neural networks, with a dedicated module for tr…☆64Updated last month
- This repository contains code for cleaning your training data of benchmark data to help combat data snooping.☆27Updated 2 years ago
- Sphynx Hallucination Induction☆53Updated 9 months ago
- Pre-train Static Word Embeddings☆90Updated 2 months ago
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆102Updated last year
- Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.☆32Updated last month
- ReLM is a Regular Expression engine for Language Models☆107Updated 2 years ago
- Latent Large Language Models☆19Updated last year
- ☆51Updated 9 months ago
- ☆45Updated 2 years ago
- NLP with Rust for Python 🦀🐍☆66Updated 6 months ago
- Experiments for efforts to train a new and improved t5☆75Updated last year
- minimal pytorch implementation of bm25 (with sparse tensors)☆104Updated 3 weeks ago
- utilities for loading and running text embeddings with onnx☆44Updated 3 months ago
- Thorn in a HaizeStack test for evaluating long-context adversarial robustness.☆26Updated last year
- ☆58Updated this week
- Code repository for the paper - "AdANNS: A Framework for Adaptive Semantic Search"☆65Updated 2 years ago
- Model implementation for the contextual embeddings project☆36Updated 5 months ago
- ☆104Updated 10 months ago
- Datasets collection and preprocessings framework for NLP extreme multitask learning☆188Updated 4 months ago
- ☆86Updated last year
- Simple GRPO scripts and configurations.☆59Updated 9 months ago
- Functional Benchmarks and the Reasoning Gap☆89Updated last year