Data preparation code for Amber 7B LLM
☆95May 10, 2024Updated 2 years ago
Alternatives and similar repositories for amber-data-prep
Users that are interested in amber-data-prep are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Data preparation code for CrystalCoder 7B LLM☆45May 10, 2024Updated 2 years ago
- Pre-training code for Amber 7B LLM☆174May 10, 2024Updated 2 years ago
- Pre-training code for CrystalCoder 7B LLM☆59May 10, 2024Updated 2 years ago
- Open Implementations of LLM Analyses☆109Oct 8, 2024Updated last year
- MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.☆1,438Apr 30, 2026Updated 3 weeks ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Quickly and securely turn any Linux box into a build and deployment assistant☆25Oct 3, 2024Updated last year
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…☆34Aug 15, 2023Updated 2 years ago
- ☆77Apr 29, 2024Updated 2 years ago
- ☆35Jun 3, 2025Updated 11 months ago
- Reproducible and flexible LLM evaluations for scientific reasoning.☆28Jul 23, 2025Updated 9 months ago
- LMTuner: Make the LLM Better for Everyone☆38Sep 21, 2023Updated 2 years ago
- Reaching LLaMA2 Performance with 0.1M Dollars☆986Jul 23, 2024Updated last year
- Code and dataset for EMNLP 2022 Findings paper "Benchmarking Language Models for Code Syntax Understanding"☆16Oct 24, 2022Updated 3 years ago
- ☆207Apr 19, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Data and tools for generating and inspecting OLMo pre-training data.☆1,497Nov 5, 2025Updated 6 months ago
- ☆56Jun 6, 2024Updated last year
- ☆15Feb 21, 2024Updated 2 years ago
- All-in-one Full-Featured Python/Flet/Flutter Application to make the most of all the latest Open-Source AI Art Generators in an intuitive…☆16May 30, 2025Updated 11 months ago
- ☆15Apr 14, 2026Updated last month
- ☆56Jun 26, 2025Updated 10 months ago
- Code of our paper "Method-Level Bug Severity Prediction using Source Code Metrics and LLMs" which is accepted to ISSRE 2023.☆10Nov 12, 2023Updated 2 years ago
- ☆94Oct 5, 2023Updated 2 years ago
- codebase release for EMNLP2023 paper publication☆19Sep 18, 2025Updated 8 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A family of open-sourced Mixture-of-Experts (MoE) Large Language Models☆1,679Mar 8, 2024Updated 2 years ago
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.☆3,058May 6, 2026Updated 2 weeks ago
- Documenting large text datasets 🖼️ 📚☆14Dec 17, 2024Updated last year
- ☆13Oct 20, 2022Updated 3 years ago
- A collection of CLI LLM tools that I built and use daily☆15Aug 7, 2024Updated last year
- This repository contains code and tooling for the Abacus.AI LLM Context Expansion project. Also included are evaluation scripts and bench…☆604Nov 17, 2023Updated 2 years ago
- Hugging Face and Pyserini interoperability☆19May 18, 2023Updated 3 years ago
- PyTorch code for System-1.x: Learning to Balance Fast and Slow Planning with Language Models☆25Jul 22, 2024Updated last year
- Awesome latest models, datasets and benchmarks on streaming/online video understanding.☆28Oct 19, 2025Updated 7 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Official repository for paper "ReasonIR Training Retrievers for Reasoning Tasks".☆227May 6, 2026Updated 2 weeks ago
- Language models scale reliably with over-training and on downstream tasks☆101Apr 2, 2024Updated 2 years ago
- ☆415Nov 2, 2023Updated 2 years ago
- Official PyTorch Implementation of EMoE: Unlocking Emergent Modularity in Large Language Models [main conference @ NAACL2024]☆39May 28, 2024Updated last year
- 🎓Automatically Update LLM inference systems Papers Daily using Github Actions (Update Every 12th hours)☆12May 13, 2026Updated last week
- Package and scripts used to build a dataset of Wikipedia articles in Markdown.☆20Sep 11, 2023Updated 2 years ago
- Demo of fine-tuning QA models for answering FAQ of cloud providers documentation☆11Mar 7, 2023Updated 3 years ago