mrcabbage972 / simple-toolformer
A Python implementation of Toolformer using Huggingface Transformers
☆15Updated 2 years ago
Alternatives and similar repositories for simple-toolformer:
Users that are interested in simple-toolformer are comparing it to the libraries listed below
- An Experiment on Dynamic NTK Scaling RoPE☆63Updated last year
- In-Context Alignment: Chat with Vanilla Language Models Before Fine-Tuning☆34Updated last year
- ROUGE for multilingual Summarization☆24Updated 3 years ago
- A simple GPT-based evaluation tool for multi-aspect, interpretable assessment of LLMs.☆85Updated last year
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆30Updated this week
- ☆10Updated 3 years ago
- A collection of instruction data and scripts for machine translation.☆20Updated last year
- The multilingual variant of GLM, a general language model trained with autoregressive blank infilling objective☆62Updated 2 years ago
- Code for preprint "Metadata Conditioning Accelerates Language Model Pre-training (MeCo)"☆36Updated last month
- ☆35Updated last year
- [AAAI 2024] Investigating the Effectiveness of Task-Agnostic Prefix Prompt for Instruction Following☆79Updated 7 months ago
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆36Updated last year
- Code for the arXiv paper: "LLMs as Factual Reasoners: Insights from Existing Benchmarks and Beyond"☆59Updated 2 months ago
- Apply Iprompt on GLM with innovative new methods. Currently support Chinese QA, English QA and Chinese poem generation.☆20Updated 2 years ago
- Transformers at any scale☆41Updated last year
- ☆34Updated 10 months ago
- the instructions and demonstrations for building a formal logical reasoning capable GLM☆53Updated 7 months ago
- Implementation of "Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation"☆22Updated last year
- Source code for SIGIR 2022 paper.☆15Updated 3 years ago
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆76Updated last year
- Source code for the paper "Automatic Prompt Augmentation and Selection with Chain-of-Thought from Labeled Data"☆20Updated last year
- OpenLLMDE: An open source data engineering framework for LLMs☆17Updated last year
- [NAACL 2024] Enhancing Chain-of-Thoughts Prompting with Iterative Bootstrapping in Large Language Models☆85Updated last year
- Code, datasets and results of the ChatGPT evaluation presented in paper "ChatGPT: Jack of all trades, master of none"☆29Updated 2 years ago
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆47Updated 3 months ago
- The open source implementation of "Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers"☆20Updated last year
- PPTC Benchmark: Evaluating Large Language Models for PowerPoint Task Completion☆53Updated last year
- ☆98Updated 6 months ago
- Source code for GreaTer ICLR 2025 - Gradient Over Reasoning makes Smaller Language Models Strong Prompt Optimizers☆20Updated this week
- Implementation of "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"☆42Updated 5 months ago