Data preparation code for Amber 7B LLM
☆93May 10, 2024Updated last year
Alternatives and similar repositories for amber-data-prep
Users that are interested in amber-data-prep are comparing it to the libraries listed below
Sorting:
- Pre-training code for Amber 7B LLM☆172May 10, 2024Updated last year
- Pre-training code for CrystalCoder 7B LLM☆57May 10, 2024Updated last year
- Open Implementations of LLM Analyses☆107Oct 8, 2024Updated last year
- ☆26May 30, 2023Updated 2 years ago
- Code of our paper "Method-Level Bug Severity Prediction using Source Code Metrics and LLMs" which is accepted to ISSRE 2023.☆10Nov 12, 2023Updated 2 years ago
- 🚀 End-to-end examples and analysis of deploying LLMs serverless using Modal, Runpod, and Beam☆28Mar 25, 2024Updated last year
- ☆77Apr 29, 2024Updated last year
- Terminal Image Viewer for iTerm2☆12Jul 6, 2019Updated 6 years ago
- Code and dataset for EMNLP 2022 Findings paper "Benchmarking Language Models for Code Syntax Understanding"☆16Oct 24, 2022Updated 3 years ago
- QuoteSum is a textual QA dataset containing Semi-Extractive Multi-source Question Answering (SEMQA) examples written by humans, based on …☆13Mar 25, 2024Updated last year
- GPT-J 6B inference on TensorRT with INT-8 precision☆11Apr 5, 2023Updated 2 years ago
- Planning feature for Superdesk☆12Updated this week
- TensorFlow implementation of the "Prompt-to-Prompt Image Editing with Cross Attention Control" for Stable Diffusion☆16Mar 25, 2023Updated 2 years ago
- ☆52Jun 6, 2024Updated last year
- MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.☆1,410Apr 21, 2025Updated 10 months ago
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…☆35Aug 15, 2023Updated 2 years ago
- Documenting large text datasets 🖼️ 📚☆14Dec 17, 2024Updated last year
- Reaching LLaMA2 Performance with 0.1M Dollars☆988Jul 23, 2024Updated last year
- ☆13Oct 20, 2022Updated 3 years ago
- codebase release for EMNLP2023 paper publication☆19Sep 18, 2025Updated 5 months ago
- This repository contains the ToolSelect dataset which was used to fine-tune Llama-2 70B for tool selection.☆22Mar 11, 2024Updated last year
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆29Oct 9, 2025Updated 4 months ago
- ☆67Aug 14, 2025Updated 6 months ago
- LMTuner: Make the LLM Better for Everyone☆38Sep 21, 2023Updated 2 years ago
- ☆20May 30, 2024Updated last year
- ☆17Feb 19, 2024Updated 2 years ago
- Minimal AlphaZero in PyTorch, trained on Connect4 on a 6x6 board.☆21Aug 12, 2022Updated 3 years ago
- Package and scripts used to build a dataset of Wikipedia articles in Markdown.☆20Sep 11, 2023Updated 2 years ago
- Hugging Face and Pyserini interoperability☆19May 18, 2023Updated 2 years ago
- ☆25Jun 10, 2025Updated 8 months ago
- Klear-Reasoner: Advancing Reasoning Capability via Gradient-Preserving Clipping Policy Optimization☆81Dec 25, 2025Updated 2 months ago
- The code for "MoPE: Mixture of Prefix Experts for Zero-Shot Dialogue State Tracking"☆19Jan 25, 2025Updated last year
- Official PyTorch Implementation of EMoE: Unlocking Emergent Modularity in Large Language Models [main conference @ NAACL2024]☆39May 28, 2024Updated last year
- ☆22Jul 27, 2023Updated 2 years ago
- Pile Deduplication Code☆18May 15, 2023Updated 2 years ago
- ☆15Feb 21, 2024Updated 2 years ago
- ☆37Oct 10, 2024Updated last year
- ☆33Jun 3, 2025Updated 8 months ago
- [ICLR2023] NTK-SAP: Improving neural network pruning by aligning training dynamics☆20May 1, 2023Updated 2 years ago