☆25Jul 13, 2024Updated last year
Alternatives and similar repositories for LLaMA-Factory-docker
Users that are interested in LLaMA-Factory-docker are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Keras-based and TensorFlow-backend NLP Models Toolkit.☆12Jul 7, 2022Updated 3 years ago
- Reproduce the paper Distributed Representations of Sentences and Documents in tensorflow☆14Apr 8, 2017Updated 8 years ago
- Attentional Neural Network that translates text to phones.☆11Jan 25, 2018Updated 8 years ago
- 中文文本的向量表示方法(Sentence-BERT, CoSENT)的PyTorch简单实现,可以用于文本相似度计算。☆10Mar 27, 2022Updated 4 years ago
- ☆10May 6, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ACL Paper Lists(machine translation)☆13Mar 23, 2022Updated 4 years ago
- Python repo for the XDK auto-generated code.☆25Feb 28, 2026Updated last month
- MobileSAM のエンコーダー/デコーダーをONNXに変換し、推論するサンプル☆12Apr 11, 2024Updated last year
- multi task learning for multi-classification using keras☆13Feb 10, 2020Updated 6 years ago
- Code for ACL 2022 findings paper "Gaussian Multi-head Attention for Simultaneous Machine Translation"☆11Mar 31, 2022Updated 3 years ago
- ☆18Apr 1, 2025Updated 11 months ago
- Covert Keras models to Pytorch☆12Dec 22, 2018Updated 7 years ago
- Comparaison of adversarial training algorithms (FreeLB, FreeAT and K-PGD) on natural language tasks☆13Feb 14, 2020Updated 6 years ago
- ☆10Jul 18, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code for ACL 2022 main conference paper "Modeling Dual Read/Write Paths for Simultaneous Machine Translation"☆12Mar 31, 2022Updated 3 years ago
- 本课程主要介绍强化学习的基础知识,其目标是帮助同学们快速、顺利地进入强化学习及其应用领域的研究工作。课程主要内容包含有限马尔可夫决策过程,动态规划,无模型预测与控制(SASA,Q-Learning),价值函数逼近(DQN),策略梯度方法(REINFORCE),执行者/评论者…☆18Oct 17, 2022Updated 3 years ago
- KnowMAN: Weakly Supervised Multinomial Adversarial Networks☆12Nov 9, 2021Updated 4 years ago
- Trying to deconstruct RWKV in understandable terms☆14May 6, 2023Updated 2 years ago
- Numbeo Unofficial API☆16Oct 16, 2022Updated 3 years ago
- a game framework. warning: wip, dev, unstable, radiation hazard, defcon 3☆24May 10, 2015Updated 10 years ago
- ☆32Dec 12, 2025Updated 3 months ago
- An agent for CUDA compute-communication kernel co-design☆33Updated this week
- Mitigating Lost-in-Retrieval Problems in Retrieval Augmented Multi-Hop Question Answering, ACL 2025☆21Oct 28, 2025Updated 5 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code for EMNLP 2021 paper "Measuring Association Between Labels and Free-Text Rationales"☆12Sep 12, 2023Updated 2 years ago
- A concise implementation of SimCSE☆16Aug 2, 2021Updated 4 years ago
- finetune script for SDXL adapted from waifu-diffusion trainer☆11Aug 21, 2023Updated 2 years ago
- Implementation of paper accepted by EMNLP 2018 using Pytorch named "A Self-Attentive Model with Gate Mechanism for Spoken Language Unders…☆17Dec 11, 2018Updated 7 years ago
- 供AI训练的中文数据集(持续更新。。。)与AI公司图谱,目前的数据集餐饮行业8000问,百度知道,Alpaca中文数据集,计算机领域数据集,Vicuna数据集,RedPajama数据集,Wikipedia中文词条数据集,网站论坛问答数据集☆65Nov 29, 2023Updated 2 years ago
- Code for paper "When Can Models Learn From Explanations? A Formal Framework for Understanding the Roles of Explanation Data"☆14Feb 16, 2021Updated 5 years ago
- bert-flat 简化版 添加了很多注释☆15Nov 25, 2021Updated 4 years ago
- 小飞机翻墙教程