thomasnormal / fewshot
ā23Updated 2 weeks ago
Related projects: ā
- NLP with Rust for Python š¦šā57Updated 3 months ago
- Codes and files for the paper Are Emergent Abilities in Large Language Models just In-Context Learningā34Updated 6 months ago
- QLoRA for Masked Language Modelingā20Updated last year
- BPE modification that implements removing of the intermediate tokens during tokenizer training.ā13Updated last week
- A place to store reusable transformer components of my own creation or found on the interwebsā43Updated 3 weeks ago
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).ā73Updated 6 months ago
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive argumentsā49Updated 3 weeks ago
- a pipeline for using api calls to agnostically convert unstructured data into structured training dataā26Updated last year
- Improving Text Embedding of Language Models Using Contrastive Fine-tuningā54Updated last month
- ā38Updated this week
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning Pā¦ā33Updated last year
- ā18Updated 5 months ago
- Understanding how features learned by neural networks evolve throughout trainingā30Updated this week
- ā29Updated 2 weeks ago
- Binary vector search example using Unum's USearch engine and pre-computed Wikipedia embeddings from Co:here and MixedBreadā18Updated 5 months ago
- Repository containing the SPIN experiments on the DIBT 10k ranked promptsā22Updated 6 months ago
- Accompanying code and SEP dataset for the "Can LLMs Separate Instructions From Data? And What Do We Even Mean By That?" paper.ā44Updated 3 months ago
- Mixtral finetuning