mrcabbage972 / simple-toolformer
A Python implementation of Toolformer using Huggingface Transformers
☆15Updated last year
Related projects: ⓘ
- An Experiment on Dynamic NTK Scaling RoPE☆59Updated 9 months ago
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆32Updated 8 months ago
- In-Context Alignment: Chat with Vanilla Language Models Before Fine-Tuning☆33Updated last year
- ☆33Updated 3 months ago
- [NAACL 2024] Enhancing Chain-of-Thoughts Prompting with Iterative Bootstrapping in Large Language Models☆79Updated 6 months ago
- Reasoning by Communicating with Agents☆19Updated last month
- Code implementation of synthetic continued pretraining☆13Updated this week
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆27Updated this week
- A dataset of LLM-generated chain-of-thought steps annotated with mistake location.☆65Updated last month
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆33Updated 6 months ago
- LMTuner: Make the LLM Better for Everyone☆33Updated last year
- Transformers at any scale☆39Updated 8 months ago
- A repository sharing the literatures about large language models☆19Updated last month
- Code for RL4F: Generating Natural Language Feedback with Reinforcement Learning for Repairing Model Outputs. ACL 2023.☆58Updated 2 months ago
- Contextual Position Encoding but with some custom CUDA Kernels https://arxiv.org/abs/2405.18719☆18Updated 3 months ago
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆72Updated 8 months ago
- ☆52Updated 7 months ago
- ☆45Updated 7 months ago
- Conic10K: A large-scale dataset for closed-vocabulary math problem understanding. Accepted to EMNLP2023 Findings.☆23Updated 9 months ago
- ☆18Updated 3 months ago
- Implementation of "Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation"☆22Updated last year
- CodeUltraFeedback: aligning large language models to coding preferences☆62Updated 2 months ago
- PPTC Benchmark: Evaluating Large Language Models for PowerPoint Task Completion☆45Updated 6 months ago
- Tool for converting LLMs from uni-directional to bi-directional by removing causal mask for tasks like classification and sentence embedd…☆39Updated 2 months ago
- Code, datasets and results of the ChatGPT evaluation presented in paper "ChatGPT: Jack of all trades, master of none"☆29Updated last year
- code for paper 《RankingGPT: Empowering Large Language Models in Text Ranking with Progressive Enhancement》☆28Updated 8 months ago
- Code for the arXiv paper: "LLMs as Factual Reasoners: Insights from Existing Benchmarks and Beyond"☆58Updated 5 months ago
- Scratchpad/Chain-of-Thought Prompts☆12Updated 2 years ago
- This repository is the official implementation of our EMNLP 2022 paper ELMER: A Non-Autoregressive Pre-trained Language Model for Efficie…☆25Updated last year
- [AAAI 2024] Investigating the Effectiveness of Task-Agnostic Prefix Prompt for Instruction Following☆79Updated last week