llyx97 / RositaView external linksLinks
[AAAI 2021] "ROSITA: Refined BERT cOmpreSsion with InTegrAted techniques", Yuanxin Liu, Zheng Lin, Fengcheng Yuan
☆14Oct 18, 2022Updated 3 years ago
Alternatives and similar repositories for Rosita
Users that are interested in Rosita are comparing it to the libraries listed below
Sorting:
- [NAACL 2022] "Learning to Win Lottery Tickets in BERT Transfer via Task-agnostic Mask Training", Yuanxin Liu, Fandong Meng, Zheng Lin, Pe…☆15Oct 18, 2022Updated 3 years ago
- Code of our IJCAI2021 paper: "Learning Class-Transductive Intent Representations for Zero-shot Intent Detection"☆15Sep 10, 2021Updated 4 years ago
- Code for our EMNLP-2022 paper: "Towards Robust Visual Question Answering: Making the Most of Biased Samples via Contrastive Learning"☆16Feb 22, 2023Updated 2 years ago
- ☆24Jan 20, 2021Updated 5 years ago
- Code and pre-trained models for "ReasonBert: Pre-trained to Reason with Distant Supervision", EMNLP'2021☆29Feb 1, 2023Updated 3 years ago
- The code for "Past, Present, and Future: Conversational Emotion Recognition through Structural Modeling of Psychological Commonsense Know…☆21May 22, 2022Updated 3 years ago
- Fast and Robust Early-Exiting Framework for Autoregressive Language Models with Synchronized Parallel Decoding (EMNLP 2023 Long)☆65Sep 28, 2024Updated last year
- The code for the paper "Neutral Utterances are Also Causes: Enhancing Conversational Causal Emotion Entailment with Social Commonsense Kn…☆27May 22, 2022Updated 3 years ago
- Pytorch implementation of paper "Efficient Nearest Neighbor Language Models" (EMNLP 2021)☆75Jan 20, 2022Updated 4 years ago
- Code for our ACL2021 paper: "Check It Again: Progressive Visual Question Answering via Visual Entailment"☆31Nov 24, 2021Updated 4 years ago
- Official repository for "DYPLOC: Dynamic Planning of Content Using Mixed Language Models for Opinion Text Generation"☆10May 20, 2022Updated 3 years ago
- Residual Quantization Autoencoder, used for interpreting LLMs☆13Jan 1, 2025Updated last year
- 使用多头注意力机制实现数字预测☆10May 10, 2022Updated 3 years ago
- ☆12Aug 15, 2023Updated 2 years ago
- ☆10Jun 3, 2019Updated 6 years ago
- Compilation of ML/AI Resources for Members of MITxHarvard Women in AI☆11Mar 28, 2022Updated 3 years ago
- incremental symbol learning for natural language understanding☆10Jun 12, 2023Updated 2 years ago
- The contrastive token loss function for reducing generative repetition of autoregressive neural language models.☆13May 11, 2022Updated 3 years ago
- a fast and customizable CUDA int4 tensor core gemm☆15Aug 2, 2024Updated last year
- Prompt-Guided Retrieval For Non-Knowledge-Intensive Tasks☆12Sep 1, 2023Updated 2 years ago
- Pytorch implementation of HCNAF: Hyper-Conditioned Neural Autoregressive Flow (CVPR 2020)☆15Jun 14, 2020Updated 5 years ago
- Federated reconnaissance mini-ImageNet benchmark and baseline models☆13Sep 2, 2021Updated 4 years ago
- ☆15Aug 19, 2024Updated last year
- Code for Augment & Reduce, a scalable stochastic algorithm for large categorical distributions☆10May 16, 2018Updated 7 years ago
- hakken is a coding agent which needs hell lot of context☆30Dec 4, 2025Updated 2 months ago
- This repository is the summary of all of our works for the XLA.☆11Jan 14, 2018Updated 8 years ago
- Machine Learning Reading Group☆11Sep 15, 2023Updated 2 years ago
- Official code for the paper "Does CLIP's Generalization Performance Mainly Stem from High Train-Test Similarity?" (ICLR 2024)☆10Aug 26, 2024Updated last year
- a demo for how to execute bert_base_chinese based model in java☆10Mar 8, 2019Updated 6 years ago
- Code and Dataset release of "Carpe Diem: On the Evaluation of World Knowledge in Lifelong Language Models" (NAACL 2024)☆10Oct 16, 2024Updated last year
- An implementation of Compositional Attention: Disentangling Search and Retrieval by MILA☆14Jun 1, 2022Updated 3 years ago
- ☆11Mar 19, 2023Updated 2 years ago
- ☆11Apr 19, 2021Updated 4 years ago
- BachDuet enables a human performer to improvise a duet counterpoint with a computer agent in real time.☆14Aug 8, 2022Updated 3 years ago
- Bias Correction of Learned Generative Models using Likelihood-Free Importance Weighting☆12Mar 24, 2023Updated 2 years ago
- Official codebase for NeurIPS 2022 paper End-to-end Learning to Index and Search in Large Output Spaces☆12Apr 19, 2023Updated 2 years ago
- Official Pytorch Implementation of Length-Adaptive Transformer (ACL 2021)☆102Nov 2, 2020Updated 5 years ago
- Implementation for ACProp ( Momentum centering and asynchronous update for adaptive gradient methdos, NeurIPS 2021)☆16Oct 11, 2021Updated 4 years ago
- ⛔️ DEPRECATED GPU Ocean Python/CUDA codebase☆11Nov 9, 2023Updated 2 years ago