LLaMa Tuning with Stanford Alpaca Dataset using Deepspeed and Transformers
☆50Mar 15, 2023Updated 2 years ago
Alternatives and similar repositories for llama-tune
Users that are interested in llama-tune are comparing it to the libraries listed below
Sorting:
- ☆43Dec 15, 2023Updated 2 years ago
- From Symbolic Logic Reasoning to Soft Reasoning: A Neural-Symbolic Paradigm☆11Jul 18, 2022Updated 3 years ago
- 本项目采用BERT等预训练模型实现多项选择型阅读理解任务(Multiple Choice MRC)☆16Jun 20, 2021Updated 4 years ago
- Train large COMET (T5-3B/GPT2-XL) with small memory (on 11GB memory GPUs like 1080/2080) using DeepSpeed.☆14Jan 23, 2022Updated 4 years ago
- A Python implementation of Toolformer using Huggingface Transformers☆14Mar 20, 2023Updated 2 years ago
- self complement of baike knowledge base info-box extraction by online analysis.基于互动百科,百度百科,搜狗百科的词条infobox结构化信息抽取,百科知识的融合☆37Mar 30, 2018Updated 7 years ago
- 智能设计实验工具 Artificial Intelligence for Graph Design☆20Dec 12, 2022Updated 3 years ago
- NumGLUE: A Suite of Fundamental yet Challenging Mathematical Reasoning Tasks☆20May 10, 2022Updated 3 years ago
- [EMNLP 2020] PyTorch code of PRover: Proof Generation for Interpretable Reasoning over Rules☆19Jul 6, 2023Updated 2 years ago
- Author implementation of the paper "Don’t paraphrase, detect! Rapid and Effective Data Collection for Semantic Parsing"☆20Oct 5, 2020Updated 5 years ago
- Fast inference of Instruct tuned LLaMa on your personal devices.☆23Mar 16, 2023Updated 2 years ago
- Alpaca-lora for huggingface implementation using Deepspeed and FullyShardedDataParallel☆24Apr 3, 2023Updated 2 years ago
- Source code for the paper "Automatic Prompt Augmentation and Selection with Chain-of-Thought from Labeled Data"☆20Feb 24, 2024Updated 2 years ago
- A collection of prompts for Llama☆102Mar 23, 2023Updated 2 years ago
- train llama on a single A100 80G node using 🤗 transformers and 🚀 Deepspeed Pipeline Parallelism☆224Nov 21, 2023Updated 2 years ago
- AdaLoGN: Adaptive Logic Graph Network for Reasoning-Based Machine Reading Comprehension (ACL 2022)☆27May 20, 2022Updated 3 years ago
- Experimenting with langchain, FAISS, OpenAI Embeddings, and GPT-3☆26Feb 10, 2023Updated 3 years ago
- chatglm多gpu用deepspeed和☆408Jul 8, 2024Updated last year
- Search through Facebook Research's PyTorch BigGraph Wikidata-dataset with the Weaviate vector search engine☆31Dec 12, 2021Updated 4 years ago
- ☆19Feb 27, 2026Updated last week
- Knowledge graph extraction from text using OpenAI ChatGPT for graph extraction and Neo4j for DB storage☆11Feb 26, 2024Updated 2 years ago
- Example of Alpaca-LoRA with llama index.☆31Mar 30, 2023Updated 2 years ago
- realize the reinforcement learning training for gpt2 llama bloom and so on llm model☆27Sep 19, 2023Updated 2 years ago
- LSTM-based dependency graph parser with Bi-LSTM Subtraction and Incremental Tree-LSTM☆28Dec 13, 2017Updated 8 years ago
- An implementation of an autoregressive language model using an improved Transformer and DeepSpeed pipeline parallelism.☆30Jan 12, 2026Updated last month
- 基于向量召回的检索式对话系统解决方案,dense retrieval,FAQ……☆35Nov 10, 2021Updated 4 years ago
- Collection of scripts to pretrain T5 in unsupervised text, using PyTorch Lightning. CORD-19 pretraining provided as example.☆32Apr 26, 2021Updated 4 years ago
- 是APEX贡献的一个基于大数据平台能力的数据开发平台,帮助企业以最小成本实现链接数据,构建和沉淀数仓模型,降低数据应用门槛,沉淀数据价值。☆12Oct 31, 2024Updated last year
- Simplifies data migration between Apache Ignite clusters by relying on Apache Avro as an intermediate storage format☆13Jun 27, 2023Updated 2 years ago
- Code to reproduce "GPT-too: A Language-Model-First Approach for AMR-to-Text-Generation"☆38Sep 17, 2025Updated 5 months ago
- [NLPCC 2022] Kformer: Knowledge Injection in Transformer Feed-Forward Layers☆38Oct 20, 2022Updated 3 years ago
- OAQA Biomedical Question Answering (BioASQ) System☆38May 16, 2017Updated 8 years ago
- CLIP-based Adaptive Graph Attention Network for Large-Scale Unsupervised Multi-modal Hashing Retrieval☆10Mar 18, 2024Updated last year
- A simple tool to help get information in NKU-EAMIS(NKU Education Affairs Management Information System).☆10Jul 27, 2020Updated 5 years ago
- A Multi Layer Perceptron (MLP) Artificial Neural Network (ANN) Framework Developed in C for Machine Learning (ML) and Deep Learning (DL)☆11May 4, 2025Updated 10 months ago
- ☆30May 5, 2014Updated 11 years ago
- Official Implementation for "Platypose: Calibrated Zero-Shot Multi-Hypothesis 3D Human Motion Estimation"☆14May 6, 2025Updated 10 months ago
- Transferability of Natural Language Inference to Biomedical Question Answering☆12Mar 25, 2021Updated 4 years ago
- the datasets of our paper☆11Feb 26, 2024Updated 2 years ago