IsaacRe / Syntactically-Constrained-Sampling
LLM sampling method for enforcing syntax adherence in generated output
☆21Updated last year
Related projects: ⓘ
- ☆24Updated last year
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆62Updated last year
- This repository contains code for cleaning your training data of benchmark data to help combat data snooping.☆25Updated last year
- One stop shop for all things carp☆58Updated 2 years ago
- Index of URLs to pdf files all over the internet and scripts☆20Updated last year
- ☆65Updated 2 months ago
- ☆22Updated last year
- Experiments with generating opensource language model assistants☆97Updated last year
- 📜 [ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswa…☆36Updated 10 months ago
- Code repository for the c-BTM paper☆105Updated 11 months ago
- Experiments for efforts to train a new and improved t5☆76Updated 5 months ago
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆22Updated 6 months ago
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆26Updated last year
- Advanced Reasoning Benchmark Dataset for LLMs☆45Updated 10 months ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆33Updated 6 months ago
- Code for the examples presented in the talk "Training a Llama in your backyard: fine-tuning very large models on consumer hardware" given…☆14Updated 11 months ago
- Codebase accompanying the Summary of a Haystack paper.☆65Updated 2 months ago
- ☆34Updated last year
- ☆44Updated 2 months ago
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…☆33Updated last year
- A library for squeakily cleaning and filtering language datasets.☆45Updated last year
- Multi-Domain Expert Learning☆67Updated 7 months ago
- The GitHub repo for Goal Driven Discovery of Distributional Differences via Language Descriptions☆68Updated last year
- ☆13Updated this week
- ☆48Updated 6 months ago
- Small, simple agent task environments for training and evaluation☆13Updated last week
- Official code for ACL 2023 (short, findings) paper "Recursion of Thought: A Divide and Conquer Approach to Multi-Context Reasoning with L…☆41Updated last year
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆73Updated 6 months ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated 6 months ago
- Exploring finetuning public checkpoints on filter 8K sequences on Pile☆115Updated last year