Calculating FLOPs of Pre-trained Models in NLP
☆18Mar 29, 2021Updated 5 years ago
Alternatives and similar repositories for Cal-FLOPs-for-PLM
Users that are interested in Cal-FLOPs-for-PLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A list of recent papers on knowledge-based machine reading comprehension.☆26Aug 4, 2020Updated 5 years ago
- ☆13Feb 13, 2026Updated 3 months ago
- A gomoku AI based on Alpha Zero paper.☆12May 1, 2023Updated 3 years ago
- ☆41Mar 12, 2022Updated 4 years ago
- ☆24Jan 20, 2021Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- The code for Mask-based Decoupling-Fusing Network☆23Dec 14, 2020Updated 5 years ago
- All-in-one repository for Fine-tuning & Pretraining (Large) Language Models☆15Mar 8, 2023Updated 3 years ago
- This is a simple torch implementation of the high performance Multi-Query Attention☆16Aug 23, 2023Updated 2 years ago
- [Findings of NAACL2022] A Dog Is Passing Over The Jet? A Text-Generation Dataset for Korean Commonsense Reasoning and Evaluation☆11May 27, 2022Updated 4 years ago
- Open-Retrieval Conversational Machine Reading: A new setting & OR-ShARC dataset☆13Nov 19, 2022Updated 3 years ago
- 💬A curated list of incredible amount of publications related to Dialogue Systems especially Chatbots and Chit-chat Systems☆10Dec 5, 2019Updated 6 years ago
- Structured Chemistry Reasoning with Large Language Models☆42May 4, 2024Updated 2 years ago
- Synthetic Data Generation with Execution-Based Verification and Grounding for LLM Training.☆21Feb 7, 2025Updated last year
- Implementation of QKVAE☆11Feb 24, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [ICLR 2023] PyTorch code for DFPC: Data flow driven pruning of coupled channels without data.☆15Aug 25, 2023Updated 2 years ago
- Source code of FedPrompt☆16May 4, 2022Updated 4 years ago
- [Work in progress] A reading list for machine commonsense reasoning☆34Apr 14, 2020Updated 6 years ago
- Spatial Spectral Machine Learning☆14Oct 15, 2025Updated 7 months ago
- Towards Efficient Shapley Value Estimation via Cross-contribution Maximization☆14Jul 8, 2022Updated 3 years ago
- [ICLR 2025] No Preference Left Behind: Group Distributional Preference Optimization☆16Apr 21, 2025Updated last year
- Salinity and turbidity detection of water ponds using Satellite imagery☆15Nov 2, 2023Updated 2 years ago
- ☆13Apr 17, 2018Updated 8 years ago
- The open-source project for "Mandheling: Mixed-Precision On-Device DNN Training with DSP Offloading"[MobiCom'2022]☆19Aug 4, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Efficient Finetuning for OpenAI GPT-OSS☆24Oct 2, 2025Updated 8 months ago
- An implementation of <Group Fisher Pruning for Practical Network Compression> based on pytorch and mmcv☆18Nov 21, 2021Updated 4 years ago
- Contains the code for my Imperial College London Master's thesis on text summarization☆10Oct 25, 2022Updated 3 years ago
- On-the-fly Definition Augmentation of LLMs for Biomedical NER☆14Apr 14, 2025Updated last year
- Structural Pre-training for Dialogue Comprehension (ACL 2021)☆10Apr 25, 2022Updated 4 years ago
- Our unique contributions are in tools/train/benchmark.☆22Apr 14, 2025Updated last year
- Implementation of stop sequencer for Huggingface Transformers☆16Jun 6, 2023Updated 3 years ago
- ☆14Sep 7, 2022Updated 3 years ago
- Explicit Sentence Compression for Neural Machine Translation☆10May 12, 2020Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Large-scale exact string matching tool☆17Mar 7, 2025Updated last year
- The official implementation of the NAACL-HLT 2019 paper "Microblog Hashtag Generation via Encoding Conversation Contexts"☆29Jun 27, 2019Updated 6 years ago
- chineseocr lite android dbnet,超轻量级中文ocr android demo,支持竖排文字识别, 支持ncnn推理 ( dbnet+crnn+anglenet)☆11Jan 18, 2021Updated 5 years ago
- ☆12Apr 25, 2022Updated 4 years ago
- ☆16Dec 14, 2022Updated 3 years ago
- Code for reproducing our paper "Low Rank Adapting Models for Sparse Autoencoder Features"☆17Mar 31, 2025Updated last year
- Reproduction Code for Paper "Investigating Multi-Hop Factual Shortcuts in Knowledge Editing of Large Language Models"☆14Jun 1, 2024Updated 2 years ago