amazon-science / unique-batches
☆11Updated 6 months ago
Alternatives and similar repositories for unique-batches:
Users that are interested in unique-batches are comparing it to the libraries listed below
- ☆25Updated last year
- efficient query encoding for dense retrieval☆11Updated 6 months ago
- Code repo for "Model-Generated Pretraining Signals Improves Zero-Shot Generalization of Text-to-Text Transformers" (ACL 2023)☆22Updated last year
- Minimum Description Length probing for neural network representations☆18Updated 3 weeks ago
- ☆15Updated last year
- official repo of AAAI2024 paper Mitigating the Impact of False Negatives in Dense Retrieval with Contrastive Confidence Regularization☆13Updated last year
- Source-to-Source Debuggable Derivatives in Pure Python☆15Updated last year
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Updated last year
- ☆12Updated 5 months ago
- HyPe: Better Pre-trained Language Model Fine-tuning with Hidden Representation Perturbation [ACL 2023]☆14Updated last year
- Implementation of a holodeck, written in Pytorch☆17Updated last year
- ☆21Updated 3 weeks ago
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆14Updated 11 months ago
- Entailment self-training☆25Updated last year
- Code for "Incorporating Relevance Feedback for Information-Seeking Retrieval using Few-Shot Document Re-Ranking" (https://arxiv.org/abs/2…☆13Updated last year
- Aioli: A unified optimization framework for language model data mixing☆20Updated last month
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Updated last year
- PyTorch implementation for "Long Horizon Temperature Scaling", ICML 2023☆20Updated last year
- Embedding Recycling for Language models☆38Updated last year
- [ACL 2023] Gradient Ascent Post-training Enhances Language Model Generalization☆29Updated 5 months ago
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.☆20Updated 8 months ago
- A package for fine tuning of pretrained NLP transformers using Semi Supervised Learning☆15Updated 3 years ago
- A scalable implementation of diffusion and flow-matching with XGBoost models, applied to calorimeter data.☆17Updated 3 months ago
- T5Patches is a set of tools for fast and targeted editing of generative language models built with T5X.☆12Updated 8 months ago
- Efficient Scaling laws and collaborative pretraining.☆14Updated 3 weeks ago
- Code for our paper Resources and Evaluations for Multi-Distribution Dense Information Retrieval☆14Updated last year
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆23Updated last week
- Code for paper: "LASeR: Learning to Adaptively Select Reward Models with Multi-Arm Bandits"☆13Updated 4 months ago
- ☆13Updated 7 months ago