Pytorch自动混合精度训练模板
☆18Apr 6, 2022Updated 3 years ago
Alternatives and similar repositories for amp-pytorch
Users that are interested in amp-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 使用numpy从零开始实现llama3的推理流程,并对其进行封装,对比GPU,CPU上的表现以及Lora微调。llama3 implemented from scratch using numpy and lora fine-tune.。☆12Jul 16, 2024Updated last year
- 本插件是将faiss集成到greenplum数据库中,以提供向量召回的能力。☆24Jul 18, 2022Updated 3 years ago
- FIGMENT☆15Jan 27, 2020Updated 6 years ago
- 大语言模型训练和服务调研☆37Aug 4, 2023Updated 2 years ago
- The source of MNER-MI.☆19Dec 17, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Clustering Analysis via Deep Generative Models With Mixture Models powered by@pytorch☆14Jan 6, 2022Updated 4 years ago
- ☆14Jun 20, 2022Updated 3 years ago
- ☆12Aug 22, 2021Updated 4 years ago
- Public implementation of ICML'19 paper "White-box vs Black-box: Bayes Optimal Strategies for Membership Inference"☆18May 28, 2020Updated 5 years ago
- An Implementation of Deep Exhaustive Model for Nested NER☆15Jul 19, 2019Updated 6 years ago
- ☆10Mar 22, 2021Updated 5 years ago
- ☆23Mar 21, 2023Updated 3 years ago
- End-to-End ELT data pipeline with Postgres, Airbyte, dbt, Dagster, Snowflake and Metabase☆11Jul 13, 2023Updated 2 years ago
- ☆12Sep 22, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Unsupervised Deep Embedding for Clustering Analysis (DEC)☆27Oct 11, 2020Updated 5 years ago
- Flink 中文社区文章整理☆13Jun 3, 2020Updated 5 years ago
- Source code for NAACL 2022 paper: Relation-Specific Attentions over Entity Mentions for Enhanced Document-Level Relation Extraction☆17May 9, 2022Updated 3 years ago
- Deep Subspace Clustering + GAN☆23Feb 8, 2018Updated 8 years ago
- A simple home-made way of getting AI to help you out during a remote interview☆11Nov 30, 2023Updated 2 years ago
- [NLP] A english to chinese translator using Transformer structure☆15Jul 26, 2024Updated last year
- 通过观看尚硅谷的Flink实战视频,开了一个仓库,记录源码和一些所需要的数据文件,也欢迎大家积极讨论☆16Mar 1, 2021Updated 5 years ago
- 【CoG2020】Multi-source Data Multi-task Learning for Profiling Players in Online Games☆15Mar 29, 2022Updated 4 years ago
- We define and estimate smooth unique information of samples with respect to classifier weights and predictions. We compute these quantiti…☆11Mar 9, 2021Updated 5 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- the benchmark for finance☆10Jul 4, 2023Updated 2 years ago
- 【TOIS2023】perCLTV: A General System for Personalized Customer Lifetime Value Prediction in Online Games☆11Apr 27, 2023Updated 2 years ago
- Cramming 1568 Tokens into a Single Vector and Back Again: Exploring the Limits of Embedding Space Capacity (ACL 2025, oral)☆32Jun 14, 2025Updated 9 months ago
- 基于A股市场建立机构持有网络,运用SNA算法计算出最具有投资价值的股票☆12Aug 14, 2019Updated 6 years ago
- ☆19Jun 25, 2024Updated last year
- Design data models, build data warehouses, data lakes & lakehouse, automate data pipelines - SQL | NoSQL | AWS | Spark | Airflow☆15Aug 19, 2023Updated 2 years ago
- 计算广告召回&模型&创意算法(A collection of research and application papers about Match, Ranking, Targeting and Creatives in Internet advertising.)☆18Jan 26, 2024Updated 2 years ago
- ☆29Nov 29, 2022Updated 3 years ago
- BBPE 底层实现☆38Apr 29, 2024Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- [NAACL 2025 Main Selected Oral] Repository for the paper: Prompt Compression for Large Language Models: A Survey☆35May 18, 2025Updated 10 months ago
- Official repo for ACL 2023 paper Code4Struct: Code Generation for Few-Shot Structured Prediction from Natural Language.☆43Jan 7, 2024Updated 2 years ago
- An AWS Data Engineering End-to-End Project (Glue, Lambda, Kinesis, Redshift, QuickSight, Athena, EC2, S3)☆16Sep 20, 2023Updated 2 years ago
- Website for the KU Affective Communication and Computing Lab☆18Mar 18, 2026Updated last week
- Reproduction of the paper "Deep Attentive Learning for Stock Movement Prediction From Social Media Text and Company Correlations"☆12Jul 6, 2023Updated 2 years ago
- Model Agnostic Meta Learning (MAML) implemented in Flax, the neural network library for JAX.☆21Sep 18, 2020Updated 5 years ago
- Stock Market Forecasting: Using Technical Indicators for Price Movement Predictions.☆16Apr 6, 2023Updated 2 years ago