for-ai / llm-profiling-toolkit
☆13Updated 4 months ago
Related projects ⓘ
Alternatives and complementary repositories for llm-profiling-toolkit
- Small, simple agent task environments for training and evaluation☆16Updated 3 weeks ago
- ☆101Updated 3 months ago
- Experiments for efforts to train a new and improved t5☆76Updated 7 months ago
- ☆128Updated this week
- ☆24Updated 7 months ago
- RWKV-7: Surpassing GPT☆45Updated this week
- Thorn in a HaizeStack test for evaluating long-context adversarial robustness.☆26Updated 3 months ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆84Updated this week
- ☆57Updated 11 months ago
- Functional Benchmarks and the Reasoning Gap☆78Updated last month
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆128Updated last month
- utilities for loading and running text embeddings with onnx☆39Updated 3 months ago
- code for training & evaluating Contextual Document Embedding models☆119Updated this week
- Repository for the paper Stream of Search: Learning to Search in Language☆93Updated 3 months ago
- Long context evaluation for large language models☆190Updated this week
- Repository for the code of the "PPL-MCTS: Constrained Textual Generation Through Discriminator-Guided Decoding" paper, NAACL'22☆64Updated 2 years ago
- gzip Predicts Data-dependent Scaling Laws☆32Updated 5 months ago
- Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.☆60Updated 6 months ago
- an implementation of Self-Extend, to expand the context window via grouped attention☆118Updated 10 months ago
- Normalized Transformer (nGPT)☆87Updated this week
- Code repository for the c-BTM paper☆105Updated last year
- Code for Zero-Shot Tokenizer Transfer☆115Updated last month
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆113Updated 3 weeks ago
- ☆20Updated 2 weeks ago
- ☆41Updated 3 weeks ago
- ☆107Updated this week
- Scaling is a distributed training library and installable dependency designed to scale up neural networks, with a dedicated module for tr…☆51Updated 3 weeks ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆40Updated 8 months ago
- An introduction to LLM Sampling☆64Updated 2 weeks ago