dylanashley / story-distiller
☆12Updated 5 months ago
Alternatives and similar repositories for story-distiller:
Users that are interested in story-distiller are comparing it to the libraries listed below
- ☆15Updated last year
- Code for "Accelerating Training with Neuron Interaction and Nowcasting Networks" [to appear at ICLR 2025]☆18Updated last month
- PyTorch implementation for "Long Horizon Temperature Scaling", ICML 2023☆20Updated last year
- implementation of dualformer☆15Updated last month
- An implementation of (Induced) Set Attention Block, from the Set Transformers paper☆56Updated 2 years ago
- Google Research☆46Updated 2 years ago
- A repository re-creating the PromptBreeder Evolutionary Algorithm from the DeepMind Paper in Python using LMQL as the backend.☆27Updated last year
- Python package for generating datasets to evaluate reasoning and retrieval of large language models☆17Updated this week
- 📰 Computing the information content of trained neural networks☆21Updated 3 years ago
- Official code for the paper: "Metadata Archaeology"☆19Updated last year
- Open source code for paper "On the Learning and Learnability of Quasimetrics".☆32Updated 2 years ago
- Code associated with our paper "Learning Group Structure and Disentangled Representations of Dynamical Environments"☆15Updated 2 years ago
- ☆37Updated 8 months ago
- Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing☆48Updated 3 years ago
- Latent Diffusion Language Models☆68Updated last year
- ☆9Updated last year
- Efficient Computation of d-Dimensional Earth Mover's Distance☆9Updated last year
- Meta-learning inductive biases in the form of useful conserved quantities.☆37Updated 2 years ago
- [NeurIPS'20] Code for the Paper Compositional Visual Generation and Inference with Energy Based Models☆44Updated 2 years ago
- ☆33Updated 2 years ago
- Re-implementation of 'Grokking: Generalization beyond overfitting on small algorithmic datasets'☆38Updated 3 years ago
- Official repository for the paper "Going Beyond Linear Transformers with Recurrent Fast Weight Programmers" (NeurIPS 2021)☆48Updated 2 years ago
- Un-*** 50 billions multimodality dataset☆24Updated 2 years ago
- ☆12Updated 2 years ago
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.☆20Updated 10 months ago
- ☆36Updated 2 years ago
- [NeurIPS 2021] Code for Unsupervised Learning of Compositional Energy Concepts☆61Updated 2 years ago
- Minimum Description Length probing for neural network representations☆19Updated 2 months ago
- ☆21Updated 2 years ago
- ☆11Updated 2 years ago