thomasnormal / fewshotLinks
β28Updated last month
Alternatives and similar repositories for fewshot
Users that are interested in fewshot are comparing it to the libraries listed below
Sorting:
- NLP with Rust for Python π¦πβ64Updated 2 months ago
- A framework for optimizing DSPy programs with RLβ94Updated this week
- A framework for pitting LLMs against each other in an evolving library of games ββ32Updated 3 months ago
- An introduction to LLM Samplingβ79Updated 7 months ago
- β38Updated last year
- Simple GRPO scripts and configurations.β59Updated 5 months ago
- β23Updated 2 months ago
- Trully flash implementation of DeBERTa disentangled attention mechanism.β62Updated 2 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optunaβ55Updated 5 months ago
- Storing long contexts in tiny caches with self-studyβ117Updated this week
- Synthetic data generation and benchmark implementation for "Episodic Memories Generation and Evaluation Benchmark for Large Language Modeβ¦β49Updated 3 months ago
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).β80Updated last year
- β47Updated last year
- β9Updated 9 months ago
- Training code for Sparse Autoencoders on Embedding modelsβ38Updated 5 months ago
- j1-micro (1.7B) & j1-nano (600M) are absurdly tiny but mighty reward models.β94Updated 2 weeks ago
- Chat Markup Language conversation libraryβ55Updated last year
- Doing simple retrieval from LLM models at various context lengths to measure accuracyβ102Updated last year
- π Reference-Free automatic summarization evaluation with potential hallucination detectionβ101Updated last year
- β64Updated 2 months ago
- β70Updated 2 weeks ago
- β63Updated 3 weeks ago
- Pre-train Static Word Embeddingsβ85Updated 2 months ago
- β56Updated 2 months ago
- minimal pytorch implementation of bm25 (with sparse tensors)β104Updated last year
- β77Updated last year
- β49Updated 5 months ago
- A reading list of relevant papers and projects on foundation model annotationβ27Updated 5 months ago
- Simple repository for training small reasoning modelsβ32Updated 5 months ago
- XTR/WARP (SIGIR'25) is an extremely fast and accurate retrieval engine based on Stanford's ColBERTv2/PLAID and Google DeepMind's XTR.β150Updated 2 months ago