☆21Sep 6, 2021Updated 4 years ago
Alternatives and similar repositories for se-pytorch-xla
Users that are interested in se-pytorch-xla are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- pytorch读取tfrecords,构造数据流☆18May 1, 2019Updated 6 years ago
- A simple framework for persisting and loading a lineage of payloads while tracking their corresponding lineage of parameters.☆11Sep 19, 2025Updated 7 months ago
- No Parameter Left Behind: How Distillation and Model Size Affect Zero-Shot Retrieval☆29Sep 26, 2022Updated 3 years ago
- Submission archive for the MS MARCO passage ranking leaderboard☆13Apr 21, 2023Updated 2 years ago
- Inquisitive Parrots for Search☆199Jun 5, 2025Updated 10 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- docTTTTTquery document expansion model☆374Mar 25, 2023Updated 3 years ago
- implementation of https://arxiv.org/pdf/2312.09299☆21Jul 3, 2024Updated last year
- Multi-stage passage ranking: monoBERT + duoBERT☆110Nov 23, 2020Updated 5 years ago
- 4-bit Shampoo for Memory-Efficient Network Training (NeurIPS 2024)☆13Feb 13, 2025Updated last year
- ☆13Apr 25, 2024Updated last year
- A Workbench for Autograding Retrieve/Generate Systems☆15Jun 30, 2025Updated 9 months ago
- A library for open domain query facet extraction and generation☆16Apr 24, 2024Updated last year
- KANs and MLPs☆12Jun 7, 2024Updated last year
- ☆14Jun 22, 2025Updated 9 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Train to 94% on CIFAR-10 in 4.4 seconds on a single A100☆12Dec 30, 2023Updated 2 years ago
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆19Jul 24, 2025Updated 8 months ago
- The source code of our work "Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models" [AISTATS …☆60Oct 11, 2024Updated last year
- Dense hybrid representations for text retrieval☆64Apr 3, 2023Updated 3 years ago
- ☆17Apr 3, 2026Updated 2 weeks ago
- Bullseye Polytope Clean-Label Poisoning Attack☆15Nov 5, 2020Updated 5 years ago
- Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: …☆341Jul 6, 2023Updated 2 years ago
- This GUI aims to simplify the process of converting GGUF files to llamafile format by providing an intuitive and convenient way for users…☆14Jan 2, 2026Updated 3 months ago
- ReBase: Training Task Experts through Retrieval Based Distillation☆29Feb 5, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Solution in KDD Cup2021 Multi-dataset Time Series Anomaly Detection Competition☆10Jul 15, 2021Updated 4 years ago
- Source code of our paper "PairDistill: Pairwise Relevance Distillation for Dense Retrieval", EMNLP 2024 Main.☆22Nov 28, 2024Updated last year
- ☆21Jan 23, 2024Updated 2 years ago
- Neural network density models for speech separation.☆20Nov 26, 2020Updated 5 years ago
- An NLP-suite powered by deep learning☆19Mar 24, 2023Updated 3 years ago
- Official Github repo for the paper "Evaluating the Evaluation of Diversity in Natural Language Generation"☆21Feb 23, 2021Updated 5 years ago
- ☆11Sep 10, 2023Updated 2 years ago
- Code repo for SIGIR 2021 paper "Few-Shot Conversational Dense Retrieval"☆43Dec 9, 2021Updated 4 years ago
- An implementation of PSGD Kron second-order optimizer for PyTorch☆99Jul 24, 2025Updated 8 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Spearmint uses Gaussian Processes to automatically optimize hyper parameter. This is a fork of Spearmint for the deep learning community.…☆11Nov 30, 2016Updated 9 years ago
- A guide to structured generation using constrained decoding☆14Jun 9, 2024Updated last year
- ☆11Nov 15, 2020Updated 5 years ago
- 2020腾讯广告算法大赛初赛rank6,复赛rank11队伍(wujie代码)☆12Apr 12, 2021Updated 5 years ago
- Structured Gradient Tree Boosting☆25Nov 6, 2018Updated 7 years ago
- [SpeechCom Journal] Learning and controlling the source-filter representation of speech with a variational autoencoder☆45Apr 18, 2023Updated 3 years ago
- Template repository of a machine-learning Python project powered by FastAPI and PyTorch☆15Aug 26, 2021Updated 4 years ago