amazon-science / unique-batches
☆11Updated 8 months ago
Alternatives and similar repositories for unique-batches:
Users that are interested in unique-batches are comparing it to the libraries listed below
- PyTorch implementation for "Long Horizon Temperature Scaling", ICML 2023☆20Updated last year
- Implementation of N-Grammer in Flax☆17Updated 2 years ago
- Minimum Description Length probing for neural network representations☆19Updated 2 months ago
- Implementation of a holodeck, written in Pytorch☆17Updated last year
- Aioli: A unified optimization framework for language model data mixing☆23Updated 3 months ago
- ☆23Updated 2 months ago
- ☆13Updated 7 months ago
- JAX Scalify: end-to-end scaled arithmetics☆16Updated 5 months ago
- A scalable implementation of diffusion and flow-matching with XGBoost models, applied to calorimeter data.☆18Updated 5 months ago
- Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …☆16Updated last year
- ☆15Updated 2 years ago
- Code for "Accelerating Training with Neuron Interaction and Nowcasting Networks" [to appear at ICLR 2025]☆18Updated last month
- ☆26Updated 2 years ago
- This repository hosts the code to port NumPy model weights of BiT-ResNets to TensorFlow SavedModel format.☆14Updated 3 years ago
- official repo of AAAI2024 paper Mitigating the Impact of False Negatives in Dense Retrieval with Contrastive Confidence Regularization☆13Updated last year
- ☆25Updated last year
- T5Patches is a set of tools for fast and targeted editing of generative language models built with T5X.☆12Updated 10 months ago
- Exploring an idea where one forgets about efficiency and carries out attention across each edge of the nodes (tokens)☆50Updated 3 weeks ago
- efficient query encoding for dense retrieval☆11Updated 8 months ago
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆23Updated this week
- A package for fine tuning of pretrained NLP transformers using Semi Supervised Learning☆15Updated 3 years ago
- [ACL 2023] Gradient Ascent Post-training Enhances Language Model Generalization☆29Updated 7 months ago
- Code repo for "Model-Generated Pretraining Signals Improves Zero-Shot Generalization of Text-to-Text Transformers" (ACL 2023)☆22Updated last year
- Code for our paper Resources and Evaluations for Multi-Distribution Dense Information Retrieval☆14Updated last year
- Visualize multi-model embedding spaces. The first goal is to quickly get a lay of the land of any embedding space. Then be able to scroll…☆27Updated 11 months ago
- Official repo of dataset-decomposition paper [NeurIPS 2024]☆15Updated 3 months ago
- Shows how to do parameter ensembling using differential evolution.☆10Updated 3 years ago
- 🧪Create domain-adapted language models by distilling from many pre-trained LMs☆10Updated 2 years ago
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Updated last year
- Official implementation of ECCV24 paper: POA☆24Updated 8 months ago