A comparison of pretraining framework for LLM
☆22Feb 6, 2025Updated last year
Alternatives and similar repositories for LLMTrainer
Users that are interested in LLMTrainer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code repository for the BMVC 2022 paper: Geometry Driven Progressive Warping for One Shot Face Animation☆12Jan 6, 2023Updated 3 years ago
- [AAAI 2024] stle2talker - Official PyTorch Implementation☆52Aug 6, 2025Updated 9 months ago
- an implementation of paper"Retentive Network: A Successor to Transformer for Large Language Models" https://arxiv.org/pdf/2307.08621.pdf☆11Jul 25, 2023Updated 2 years ago
- ☆10Apr 22, 2021Updated 5 years ago
- ☆10Nov 15, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [NeurIPS 2023] Learning Motion Refinement for Unsupervised Face Animation☆40Dec 3, 2023Updated 2 years ago
- ☆14Oct 30, 2021Updated 4 years ago
- KDD 2024 AQA competition 2nd place solution☆12Jul 21, 2024Updated last year
- Mutual attention model for matching QA pairs in dialogues☆11Sep 20, 2020Updated 5 years ago
- Commonsense Knowledge Base Reasoning☆10Sep 3, 2018Updated 7 years ago
- ☆14Nov 8, 2024Updated last year
- 😜Constrative Learning of Sentence Embedding using LoRA (EECS487 final project)☆13Apr 19, 2023Updated 3 years ago
- Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model☆13Mar 30, 2025Updated last year
- ☆20Aug 11, 2025Updated 9 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- llms related stuff , including code, docs☆13Feb 25, 2025Updated last year
- Making large AI models cheaper, faster and more accessible☆15Apr 20, 2023Updated 3 years ago
- Resources for our AAAI 2022 paper: "Unsupervised Editing for Counterfactual Stories".☆12Oct 25, 2022Updated 3 years ago
- Unofficial implementation of Sketch-Guided Text-to-Image Diffusion Models☆13Jun 19, 2023Updated 2 years ago
- ☆25Apr 6, 2026Updated last month
- analyse problems of AI with Math and Code☆29Jul 28, 2025Updated 9 months ago
- ☆16Jan 8, 2024Updated 2 years ago
- Source code for the EMNLP 2020 long paper <Token-level Adaptive Training for Neural Machine Translation>.☆20Oct 28, 2022Updated 3 years ago
- About Flutter 仿小红书项目,个人学习demo,无后端接口全是模拟数据,因为工程量有点大,所以很多交互只预留了接口并没有写入事件,不过核心功能已实现。Flutter RedBook Clone - Personal Learning Demo☆14Jul 17, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆11Apr 24, 2025Updated last year
- Recurrent versus Recursive Approaches Towards Compositionality in Semantic Vector Spaces.☆13Sep 22, 2021Updated 4 years ago
- Facial Reenactment from Sparse Landmarks using StyleGAN3☆13Aug 18, 2024Updated last year
- paper information spider; 论文信息爬虫☆14May 22, 2023Updated 3 years ago
- DataFountain第五届达观杯第4名方案☆11Dec 3, 2021Updated 4 years ago
- Multi-Figurative Language Generation (COLING 2022)☆12Jan 30, 2023Updated 3 years ago
- 中英文神经网络机器翻译☆14Jan 17, 2021Updated 5 years ago
- ☆11Oct 8, 2022Updated 3 years ago
- ☆15Jul 4, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆21May 24, 2024Updated 2 years ago
- Implementation of paper Long-Term Effect Estimation with Surrogate Representation☆14Oct 20, 2020Updated 5 years ago
- Serialization and Deserialization for NLP Annotated Documents.☆17Dec 31, 2015Updated 10 years ago
- A query predictor pipeline and service to predict resource usages of Presto queries☆14May 2, 2023Updated 3 years ago
- ☆11Oct 9, 2021Updated 4 years ago
- PersonaTalk Hack☆15Jan 10, 2025Updated last year
- 文言文信息抽取(实体识别+关系抽取)☆10Feb 24, 2023Updated 3 years ago