mrcabbage972 / simple-toolformerLinks
A Python implementation of Toolformer using Huggingface Transformers
☆14Updated 2 years ago
Alternatives and similar repositories for simple-toolformer
Users that are interested in simple-toolformer are comparing it to the libraries listed below
Sorting:
- In-Context Alignment: Chat with Vanilla Language Models Before Fine-Tuning☆35Updated last year
- ROUGE for multilingual Summarization☆25Updated 3 years ago
- Implementation of "Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation"☆22Updated last year
- A collection of instruction data and scripts for machine translation.☆20Updated last year
- An Experiment on Dynamic NTK Scaling RoPE☆64Updated last year
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆30Updated this week
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Updated last year
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆39Updated last year
- Large-scale query-focused multi-document Summarization dataset☆10Updated 3 years ago
- ☆20Updated last year
- Code for preprint "Metadata Conditioning Accelerates Language Model Pre-training (MeCo)"☆39Updated last month
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆50Updated 3 weeks ago
- The multilingual variant of GLM, a general language model trained with autoregressive blank infilling objective☆62Updated 2 years ago
- OpenLLMDE: An open source data engineering framework for LLMs☆17Updated last year
- This repository contains the ToolSelect dataset which was used to fine-tune Llama-2 70B for tool selection.☆20Updated last year
- Code, datasets and results of the ChatGPT evaluation presented in paper "ChatGPT: Jack of all trades, master of none"☆29Updated 2 years ago
- Implementation of "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"☆41Updated 7 months ago
- This repository is the official implementation of our EMNLP 2022 paper ELMER: A Non-Autoregressive Pre-trained Language Model for Efficie…☆26Updated 2 years ago
- Source code for paper: Knowledge Inheritance for Pre-trained Language Models☆38Updated 3 years ago
- Code for COLING22 paper, DPTDR: Deep Prompt Tuning for Dense Passage Retrieval☆25Updated last year
- LGEB: Benchmark of Language Generation Evaluation☆16Updated 2 years ago
- Large Scale Distributed Model Training strategy with Colossal AI and Lightning AI☆57Updated last year
- Code for the arXiv paper: "LLMs as Factual Reasoners: Insights from Existing Benchmarks and Beyond"☆59Updated 5 months ago
- ☆68Updated 2 years ago
- Code for paper 'Data-Efficient FineTuning'☆29Updated 2 years ago
- [NAACL 2024] Enhancing Chain-of-Thoughts Prompting with Iterative Bootstrapping in Large Language Models☆85Updated last year
- Apply Iprompt on GLM with innovative new methods. Currently support Chinese QA, English QA and Chinese poem generation.☆20Updated 3 years ago
- ☆35Updated last year
- ☆31Updated 2 years ago
- the instructions and demonstrations for building a formal logical reasoning capable GLM☆53Updated 9 months ago