☆24Sep 24, 2024Updated last year
Alternatives and similar repositories for w2s
Users that are interested in w2s are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Minimal RLHF implementation built on top of minGPT.☆32Jul 4, 2024Updated last year
- LLM play 20questions with itself☆13Mar 31, 2023Updated 3 years ago
- ☆12Aug 8, 2023Updated 2 years ago
- James' cookbook of evaluations and finetuning experiments☆27Feb 19, 2026Updated 3 months ago
- ☆24Dec 17, 2025Updated 5 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆21Feb 10, 2025Updated last year
- Apertium tools☆20May 27, 2021Updated 4 years ago
- ☆14May 14, 2019Updated 7 years ago
- Uncertainty-Aware Curriculum Learning for Neural Machine Translation (ACL 2020)☆11Jun 12, 2020Updated 5 years ago
- NeuroSurgeon is a package that enables researchers to uncover and manipulate subnetworks within models in Huggingface Transformers☆43Feb 12, 2025Updated last year
- Yet another dynamic batch sampler for variable sequence data in PyTorch.☆13Dec 9, 2021Updated 4 years ago
- ☆11Aug 10, 2024Updated last year
- An implementation of DecorrelatedBN by tensorflow☆13Jun 30, 2022Updated 3 years ago
- Code for CVPR paper: Computationally Budgeted Continual Learning: What Does Matter?☆17Mar 16, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [ICLR 2025] Code&Data for the paper "Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization"☆15Jun 21, 2024Updated last year
- Official project page for Estimating the Rate-Distortion Function by Wasserstein Gradient Descent☆19Nov 2, 2023Updated 2 years ago
- ☆13Jul 8, 2023Updated 2 years ago
- EMNLP 2020: Filtering before Iteratively Referring for Knowledge-Grounded Response Selection in Retrieval-Based Chatbots☆12Dec 15, 2020Updated 5 years ago
- Official implementation of ICML 2025 paper "Understanding Multimodal LLMs Under Distribution Shifts: An Information-Theoretic Approach"☆12May 27, 2025Updated 11 months ago
- Code for preprint: Summarizing Differences between Text Distributions with Natural Language☆43Feb 24, 2023Updated 3 years ago
- Co-Supervised Learning: Improving Weak-to-Strong Generalization with Hierarchical Mixture of Experts☆16Feb 26, 2024Updated 2 years ago
- PL Reading Group Website☆14Jan 12, 2026Updated 4 months ago
- Code used to create the Linked WikiText-2 dataset☆16May 22, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [ACM MM 24] GROOT:Generating Robust Watermark for Diffusion-Model-Based Audio Synthesis☆20Mar 24, 2025Updated last year
- PyTorch Implementation of Prompt-augmented Temporal Point Process for Streaming Event Sequence, NeurIPS 2023☆14Dec 9, 2023Updated 2 years ago
- "We must know. We shall know." - David Hilbert☆21Sep 8, 2025Updated 8 months ago
- ☆15Jul 14, 2022Updated 3 years ago
- A Public repository for the COMeT model☆13Jul 25, 2024Updated last year
- ☆15Nov 3, 2022Updated 3 years ago
- Code and data for the paper "Emergent Visual-Semantic Hierarchies in Image-Text Representations" (ECCV 2024)☆34Aug 12, 2024Updated last year
- Implementation of Boundary Attributions for Normal (Vector) Explanations☆11Aug 13, 2021Updated 4 years ago
- Conformal Bayes with importance sampling☆23Oct 25, 2021Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆21Dec 17, 2020Updated 5 years ago
- Proof-of-concept of global switching between numpy/jax/pytorch in a library.☆17Jun 18, 2024Updated last year
- Blog post☆17Feb 16, 2024Updated 2 years ago
- Tools for extracting/reverse engineering Bloodborne game files☆21Jan 27, 2018Updated 8 years ago
- Procedural data generators suite for synthetic pretraining and formal reasoning☆40Updated this week
- Commonsense Explanations for Commonsense Question Answering☆13Jun 27, 2019Updated 6 years ago
- 🤗 [ICLR 2024] Disentangling Time Series Representations via Contrastive based l-Variational Inference☆18Dec 11, 2025Updated 5 months ago