facebookresearch / LIGHT
LIGHT is a platform for text-situated dialogue research. We originally hosted LIGHT as a live game with dialogue models in a grounded setting. This repo contains all of the code to get the LIGHT game running, as well as reproducible code for the research projects along the way of getting LIGHT to where it was.
☆68Updated last year
Alternatives and similar repositories for LIGHT:
Users that are interested in LIGHT are comparing it to the libraries listed below
- ☆27Updated 2 weeks ago
- [AAAI 2024] Investigating the Effectiveness of Task-Agnostic Prefix Prompt for Instruction Following☆79Updated 4 months ago
- ☆64Updated 11 months ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆74Updated 2 months ago
- Experiments with generating opensource language model assistants☆97Updated last year
- Official code for ACL 2023 (short, findings) paper "Recursion of Thought: A Divide and Conquer Approach to Multi-Context Reasoning with L …☆42Updated last year
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆66Updated 2 months ago
- Based on the tree of thoughts paper☆46Updated last year
- Code of ICLR paper: https://openreview.net/forum?id=-cqvvvb-NkI☆91Updated last year
- ☆37Updated last year
- ☆115Updated 3 months ago
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆24Updated 10 months ago
- Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."☆63Updated last year
- A repository for transformer critique learning and generation☆88Updated last year
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆45Updated last year
- ☆35Updated last year
- Plug-and-play Search Interfaces with Pyserini and Hugging Face☆32Updated last year
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Updated last year
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆42Updated last year
- ☆32Updated last year
- Evaluating tool-augmented LLMs in conversation settings☆76Updated 7 months ago
- Sakura-SOLAR-DPO: Merge, SFT, and DPO☆116Updated last year
- Small and Efficient Mathematical Reasoning LLMs☆71Updated 11 months ago
- ☆24Updated last year
- ☆46Updated 2 months ago
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Updated last year
- ReBase: Training Task Experts through Retrieval Based Distillation☆28Updated 6 months ago
- SILO Language Models code repository☆81Updated 10 months ago
- Pre-training code for CrystalCoder 7B LLM☆55Updated 8 months ago