lxe / llama-tuneView external linksLinks
LLaMa Tuning with Stanford Alpaca Dataset using Deepspeed and Transformers
☆50Mar 15, 2023Updated 2 years ago
Alternatives and similar repositories for llama-tune
Users that are interested in llama-tune are comparing it to the libraries listed below
Sorting:
- KuaiSearch PERKS☆12Nov 16, 2021Updated 4 years ago
- ☆43Dec 15, 2023Updated 2 years ago
- From Symbolic Logic Reasoning to Soft Reasoning: A Neural-Symbolic Paradigm☆11Jul 18, 2022Updated 3 years ago
- Train large COMET (T5-3B/GPT2-XL) with small memory (on 11GB memory GPUs like 1080/2080) using DeepSpeed.☆14Jan 23, 2022Updated 4 years ago
- 本项目采用BERT等预训练模型实现多项选择型阅读理解任务(Multiple Choice MRC)☆16Jun 20, 2021Updated 4 years ago
- A Python implementation of Toolformer using Huggingface Transformers☆14Mar 20, 2023Updated 2 years ago
- self complement of baike knowledge base info-box extraction by online analysis.基于互动百科,百度百科,搜狗百科的词条infobox结构化信息抽取,百科知识的融合☆37Mar 30, 2018Updated 7 years ago
- [EMNLP 2020] PyTorch code of PRover: Proof Generation for Interpretable Reasoning over Rules☆19Jul 6, 2023Updated 2 years ago
- NumGLUE: A Suite of Fundamental yet Challenging Mathematical Reasoning Tasks☆20May 10, 2022Updated 3 years ago
- Author implementation of the paper "Don’t paraphrase, detect! Rapid and Effective Data Collection for Semantic Parsing"☆20Oct 5, 2020Updated 5 years ago
- A Keras implementation of the AAAI21 paper "a lightweight neural model for biomedical entity linking"☆53Jul 24, 2022Updated 3 years ago
- Source code for the paper "Automatic Prompt Augmentation and Selection with Chain-of-Thought from Labeled Data"☆20Feb 24, 2024Updated last year
- A collection of prompts for Llama☆102Mar 23, 2023Updated 2 years ago
- AdaLoGN: Adaptive Logic Graph Network for Reasoning-Based Machine Reading Comprehension (ACL 2022)☆27May 20, 2022Updated 3 years ago
- Train 🤗transformers with DeepSpeed: ZeRO-2, ZeRO-3☆23May 20, 2021Updated 4 years ago
- Experimenting with langchain, FAISS, OpenAI Embeddings, and GPT-3☆26Feb 10, 2023Updated 3 years ago
- chatglm多gpu用deepspeed和☆409Jul 8, 2024Updated last year
- [Findings of ACL 2022] Meta-Path Guided Contrastive Learning for Logical Reasoning of Text☆28Mar 21, 2022Updated 3 years ago
- Knowledge graph extraction from text using OpenAI ChatGPT for graph extraction and Neo4j for DB storage☆11Feb 26, 2024Updated last year
- An open-source session replay tool for single-page applications that uses AI analysis, aggregated trends, and a RAG chatbot to help devel…☆11Jan 23, 2026Updated 3 weeks ago
- realize the reinforcement learning training for gpt2 llama bloom and so on llm model☆27Sep 19, 2023Updated 2 years ago
- Example of Alpaca-LoRA with llama index.☆31Mar 30, 2023Updated 2 years ago
- LSTM-based dependency graph parser with Bi-LSTM Subtraction and Incremental Tree-LSTM☆28Dec 13, 2017Updated 8 years ago
- An implementation of an autoregressive language model using an improved Transformer and DeepSpeed pipeline parallelism.☆30Jan 12, 2026Updated last month
- Collection of scripts to pretrain T5 in unsupervised text, using PyTorch Lightning. CORD-19 pretraining provided as example.☆32Apr 26, 2021Updated 4 years ago
- 基于向量召回的检索式对话系统解决方案,dense retrieval,FAQ……☆35Nov 10, 2021Updated 4 years ago
- Annotating Columns with Pre-trained Language Models☆34Jun 10, 2022Updated 3 years ago
- Simplifies data migration between Apache Ignite clusters by relying on Apache Avro as an intermediate storage format☆13Jun 27, 2023Updated 2 years ago
- 是APEX贡献的一个基于大数据平台能力的数据开发平台,帮助企业以最小成本实现链接数据,构建和沉淀数仓模型,降低数据应用门槛,沉淀数据价值。☆12Oct 31, 2024Updated last year
- Code to reproduce "GPT-too: A Language-Model-First Approach for AMR-to-Text-Generation"☆38Sep 17, 2025Updated 4 months ago
- Contextual Bandit Spectral Representation Learner☆12Oct 25, 2022Updated 3 years ago
- Only for real elders who really want to experience old AI Dungeon experience.☆10Nov 8, 2020Updated 5 years ago
- [ACL 2024] <Large Language Models for Automated Open-domain Scientific Hypotheses Discovery>. It has also received the best poster award …☆42Oct 28, 2024Updated last year
- Real-time multi-language unit test generation tool via LSP☆31Updated this week
- LangReact 是一个配置化的 Planning Agent 应用开发工具,通过配置、插件,能快速为你的 GPT 应用提供 Planning 功能。☆12Apr 23, 2024Updated last year
- Official Implementation for "Platypose: Calibrated Zero-Shot Multi-Hypothesis 3D Human Motion Estimation"☆14May 6, 2025Updated 9 months ago
- Persistent memory system for agentic AI via MCP - remember, recall, forget with semantic search with knowledge graph☆24Updated this week
- Collaborative Discourse Manager☆11Nov 6, 2016Updated 9 years ago
- Azure Machine Learning - MLOps Python SDKv2☆10Jul 24, 2023Updated 2 years ago