thomasnormal / fewshotLinks
☆29Updated 3 months ago
Alternatives and similar repositories for fewshot
Users that are interested in fewshot are comparing it to the libraries listed below
Sorting:
- ☆25Updated 9 months ago
- ☆40Updated last year
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆59Updated 3 months ago
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆35Updated 9 months ago
- An introduction to LLM Sampling☆79Updated last year
- Latent Large Language Models☆19Updated last year
- Training code for Sparse Autoencoders on Embedding models☆39Updated 11 months ago
- ☆67Updated 8 months ago
- ☆10Updated last year
- Chat Markup Language conversation library☆55Updated 2 years ago
- j1-micro (1.7B) & j1-nano (600M) are absurdly tiny but mighty reward models.☆102Updated 6 months ago
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments☆99Updated 4 months ago
- Simple GRPO scripts and configurations.☆59Updated last year
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆107Updated 4 months ago
- Tools to make language models a bit easier to use☆64Updated last week
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆82Updated last year
- Simple repository for training small reasoning models☆49Updated last year
- NLP with Rust for Python 🦀🐍☆71Updated 8 months ago
- ☆53Updated 11 months ago
- Structured Generation Evals☆14Updated last year
- PageRank for LLMs☆52Updated 4 months ago
- LLM training in simple, raw C/CUDA☆15Updated last year
- ☆59Updated 2 months ago
- Train your own SOTA deductive reasoning model☆107Updated 11 months ago
- Synthetic data generation and benchmark implementation for "Episodic Memories Generation and Evaluation Benchmark for Large Language Mode…☆63Updated 4 months ago
- ☆134Updated 4 months ago
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆103Updated 2 years ago
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆76Updated last week
- ☆45Updated 2 years ago
- Track the progress of LLM context utilisation☆55Updated 9 months ago