epfl-dlab / GPTurk
☆29Updated last year
Alternatives and similar repositories for GPTurk:
Users that are interested in GPTurk are comparing it to the libraries listed below
- Documentation effort for the BookCorpus dataset☆34Updated 3 years ago
- Search through Facebook Research's PyTorch BigGraph Wikidata-dataset with the Weaviate vector search engine☆31Updated 3 years ago
- Embedding Recycling for Language models☆38Updated last year
- Few-shot Learning with Auxiliary Data☆27Updated last year
- Weakly Supervised Text-to-SQL Parsing through Question Decomposition☆22Updated last year
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…☆34Updated last year
- Plug-and-play Search Interfaces with Pyserini and Hugging Face☆31Updated last year
- Aioli: A unified optimization framework for language model data mixing☆23Updated 3 months ago
- NLP with Rust for Python 🦀🐍☆61Updated 10 months ago
- Submission to the inverse scaling prize☆23Updated last year
- Based on the tree of thoughts paper☆48Updated last year
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆48Updated last year
- Finding semantically meaningful and accurate prompts.☆46Updated last year
- Code for our paper Resources and Evaluations for Multi-Distribution Dense Information Retrieval☆14Updated last year
- ☆22Updated 3 years ago
- The codebase for our ACL2023 paper: Did You Read the Instructions? Rethinking the Effectiveness of Task Definitions in Instruction Learni…☆29Updated last year
- ☆19Updated 2 years ago
- ☆44Updated 5 months ago
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆22Updated 2 years ago
- Official Repository for "Hypencoder: Hypernetworks for Information Retrieval"☆23Updated last month
- [COLM '24] Source-Aware Training Enables Knowledge Attribution in Language Models☆17Updated 2 weeks ago
- SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batchi…☆33Updated 10 months ago
- [SIGIR 2024 (Demo)] CoSearchAgent: A Lightweight Collborative Search Agent with Large Language Models☆23Updated last year
- Codes and files for the paper Are Emergent Abilities in Large Language Models just In-Context Learning☆33Updated 3 months ago
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆33Updated 2 weeks ago
- 🤗 HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)☆17Updated last year
- Minimum Description Length probing for neural network representations☆19Updated 2 months ago
- ☆28Updated last week
- Code for paper "Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs"☆28Updated 2 years ago
- Training code for Sparse Autoencoders on Embedding models☆38Updated last month