☆21Sep 6, 2021Updated 4 years ago
Alternatives and similar repositories for se-pytorch-xla
Users that are interested in se-pytorch-xla are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Shared code for training sentence embeddings with Flax / JAX☆28Jul 15, 2021Updated 4 years ago
- Cross language information retrieval pipeline☆19Jan 12, 2026Updated 4 months ago
- pytorch读取tfrecords,构造数据流☆18May 1, 2019Updated 7 years ago
- A simple framework for persisting and loading a lineage of payloads while tracking their corresponding lineage of parameters.☆11Sep 19, 2025Updated 8 months ago
- 🦀 A Rust implementation of a RoBERTa classification model for the SNLI dataset☆13Sep 13, 2021Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- No Parameter Left Behind: How Distillation and Model Size Affect Zero-Shot Retrieval☆29Sep 26, 2022Updated 3 years ago
- Submission archive for the MS MARCO passage ranking leaderboard☆13Apr 21, 2023Updated 3 years ago
- Inquisitive Parrots for Search☆200Jun 5, 2025Updated 11 months ago
- One-stop shop for running and fine-tuning transformer-based language models for retrieval☆65Updated this week
- 6th Place Solution for the Google - Isolated Sign Language Recognition Kaggle Competition☆14May 4, 2023Updated 3 years ago
- docTTTTTquery document expansion model☆375Mar 25, 2023Updated 3 years ago
- implementation of https://arxiv.org/pdf/2312.09299☆21Jul 3, 2024Updated last year
- Multi-stage passage ranking: monoBERT + duoBERT☆110Nov 23, 2020Updated 5 years ago
- 4-bit Shampoo for Memory-Efficient Network Training (NeurIPS 2024)☆13Feb 13, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆12Jan 5, 2023Updated 3 years ago
- ☆13Apr 25, 2024Updated 2 years ago
- A Workbench for Autograding Retrieve/Generate Systems☆15Jun 30, 2025Updated 11 months ago
- A library for open domain query facet extraction and generation☆16Apr 24, 2024Updated 2 years ago
- ☆14Jun 22, 2025Updated 11 months ago
- Train to 94% on CIFAR-10 in 4.4 seconds on a single A100☆12Dec 30, 2023Updated 2 years ago
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆19Jul 24, 2025Updated 10 months ago
- Maximal Update Parametrization (μP) with Flax & Optax.☆16Dec 27, 2023Updated 2 years ago
- The source code of our work "Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models" [AISTATS …☆60Oct 11, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Transformer-based models for Natural Language Processing in OCaml☆26May 10, 2021Updated 5 years ago
- An implementation of DecorrelatedBN by tensorflow☆13Jun 30, 2022Updated 3 years ago
- ☆19May 16, 2026Updated 2 weeks ago
- Bullseye Polytope Clean-Label Poisoning Attack☆18Nov 5, 2020Updated 5 years ago
- ☆13Jan 3, 2022Updated 4 years ago
- Open-source Large Language Models are Strong Zero-shot Query Likelihood Models for Document Ranking☆17Oct 26, 2023Updated 2 years ago
- Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: …☆341Jul 6, 2023Updated 2 years ago
- This GUI aims to simplify the process of converting GGUF files to llamafile format by providing an intuitive and convenient way for users…☆14Jan 2, 2026Updated 4 months ago
- ReBase: Training Task Experts through Retrieval Based Distillation☆29Feb 5, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Source code of our paper "PairDistill: Pairwise Relevance Distillation for Dense Retrieval", EMNLP 2024 Main.☆22Nov 28, 2024Updated last year
- 🔍 Code Search Tools & Experiments☆12May 18, 2026Updated last week
- Agent CLI☆13May 20, 2026Updated last week
- Code for the paper "Simulating Bandit Learning from User Feedback for Extractive Question Answering".☆19Aug 30, 2022Updated 3 years ago
- Neural network density models for speech separation.☆20Nov 26, 2020Updated 5 years ago
- An NLP-suite powered by deep learning☆19Mar 24, 2023Updated 3 years ago
- Official Github repo for the paper "Evaluating the Evaluation of Diversity in Natural Language Generation"☆21Feb 23, 2021Updated 5 years ago