dylanashley / story-distiller
☆12Updated 2 months ago
Alternatives and similar repositories for story-distiller:
Users that are interested in story-distiller are comparing it to the libraries listed below
- ☆11Updated 2 years ago
- ☆14Updated last year
- Official Implementation of "VibeCheck: Discover and Quantify Qualitative Differences in Large Language Models"☆25Updated last week
- Official code for the paper: "Metadata Archaeology"☆18Updated last year
- 📰 Computing the information content of trained neural networks☆21Updated 3 years ago
- PyTorch implementation for "Long Horizon Temperature Scaling", ICML 2023☆20Updated last year
- Fine-Tuning Pre-trained Transformers into Decaying Fast Weights☆19Updated 2 years ago
- Implementation of a holodeck, written in Pytorch☆17Updated last year
- Minimum Description Length probing for neural network representations☆18Updated last week
- Official implementation for Sparse MetA-Tuning (SMAT)☆16Updated 6 months ago
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆23Updated this week
- Meta-learning inductive biases in the form of useful conserved quantities.☆37Updated 2 years ago
- ☆32Updated last year
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Updated last year
- Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing☆47Updated 2 years ago
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32Updated 7 months ago
- ☆28Updated last year
- ☆35Updated 2 years ago
- ☆11Updated 3 years ago
- ☆29Updated 7 months ago
- ☆12Updated 4 months ago
- Google Research☆46Updated 2 years ago
- A JAX implementation of Broaden Your Views for Self-Supervised Video Learning, or BraVe for short.☆48Updated 7 months ago
- Implementation of Metaformer, but in an autoregressive manner☆23Updated 2 years ago
- ☆36Updated 5 months ago
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.☆20Updated 7 months ago
- Advantage Leftover Lunch Reinforcement Learning (A-LoL RL): Improving Language Models with Advantage-based Offline Policy Gradients☆26Updated 4 months ago
- Code for the paper "Data Feedback Loops: Model-driven Amplification of Dataset Biases"☆15Updated 2 years ago
- code for paper "Accessing higher dimensions for unsupervised word translation"☆21Updated last year