Calculating FLOPs of Pre-trained Models in NLP
☆18Mar 29, 2021Updated 5 years ago
Alternatives and similar repositories for Cal-FLOPs-for-PLM
Users that are interested in Cal-FLOPs-for-PLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13Feb 13, 2026Updated 4 months ago
- A gomoku AI based on Alpha Zero paper.☆12May 1, 2023Updated 3 years ago
- ☆41Mar 12, 2022Updated 4 years ago
- ☆24Jan 20, 2021Updated 5 years ago
- 日期时间实体识别☆11Sep 10, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- PyTorch Implementation: Code for the paper "Generalizing to Unseen Domains via Adversarial Data Augmentation", NeurIPS 2018. Origin Tenso…☆14Sep 17, 2020Updated 5 years ago
- The code for Mask-based Decoupling-Fusing Network☆23Dec 14, 2020Updated 5 years ago
- Code and data for "Retrieval Enhanced Model for Commonsense Generation" (ACL-IJCNLP 2021).☆29Dec 31, 2021Updated 4 years ago
- A Python Bot for Scraping Conversations from Twitter☆16May 16, 2019Updated 7 years ago
- All-in-one repository for Fine-tuning & Pretraining (Large) Language Models☆15Mar 8, 2023Updated 3 years ago
- ☆10Dec 21, 2022Updated 3 years ago
- This is a simple torch implementation of the high performance Multi-Query Attention☆16Aug 23, 2023Updated 2 years ago
- [Findings of NAACL2022] A Dog Is Passing Over The Jet? A Text-Generation Dataset for Korean Commonsense Reasoning and Evaluation☆11May 27, 2022Updated 4 years ago
- Open-Retrieval Conversational Machine Reading: A new setting & OR-ShARC dataset☆13Nov 19, 2022Updated 3 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- 💬A curated list of incredible amount of publications related to Dialogue Systems especially Chatbots and Chit-chat Systems☆10Dec 5, 2019Updated 6 years ago
- ☆20Mar 3, 2025Updated last year
- Synthetic Data Generation with Execution-Based Verification and Grounding for LLM Training.☆22Feb 7, 2025Updated last year
- [Work in progress] A reading list for machine commonsense reasoning☆34Apr 14, 2020Updated 6 years ago
- Spatial Spectral Machine Learning☆14Oct 15, 2025Updated 8 months ago
- Few-Shot Preference Optimization (FSPO) personalizes LLMs by reframing reward modeling as a meta-learning problem, enabling rapid adaptat…☆16Feb 27, 2025Updated last year
- ☆13Apr 17, 2018Updated 8 years ago
- Efficient Finetuning for OpenAI GPT-OSS☆24Oct 2, 2025Updated 8 months ago
- Contains the code for my Imperial College London Master's thesis on text summarization☆11Oct 25, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- On-the-fly Definition Augmentation of LLMs for Biomedical NER☆14Apr 14, 2025Updated last year
- Structural Pre-training for Dialogue Comprehension (ACL 2021)☆10Apr 25, 2022Updated 4 years ago
- ☆14Sep 7, 2022Updated 3 years ago
- The official implementation of the NAACL-HLT 2019 paper "Microblog Hashtag Generation via Encoding Conversation Contexts"☆29Jun 27, 2019Updated 7 years ago
- Code for SIGIR 2019 paper titled "Syntax-Aware Aspect-Level Sentiment Classification with Proximity-Weighted Convolution Network"☆25Nov 21, 2023Updated 2 years ago
- ☆12Apr 25, 2022Updated 4 years ago
- ☆16Dec 14, 2022Updated 3 years ago
- Code to reproduce results of our experiments using LoRe☆18Jun 10, 2026Updated 2 weeks ago
- Code for reproducing our paper "Low Rank Adapting Models for Sparse Autoencoder Features"☆17Mar 31, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Reproduction Code for Paper "Investigating Multi-Hop Factual Shortcuts in Knowledge Editing of Large Language Models"☆14Jun 1, 2024Updated 2 years ago
- BIT IR Final Project 2019☆24Nov 15, 2019Updated 6 years ago
- Making of cuda kernel☆17May 27, 2025Updated last year
- The sources codes of the DR-BERT model and baselines☆37Nov 17, 2021Updated 4 years ago
- C++로 딥러닝, 머신러닝 모델 구현☆10Mar 25, 2021Updated 5 years ago
- Codes for coreference-aware machine reading comprehension☆13Mar 13, 2022Updated 4 years ago
- Unsupervised Cross-lingual Sentiment Analysis (CoNLL 2019)☆10Nov 4, 2019Updated 6 years ago