Data preparation code for CrystalCoder 7B LLM
☆45May 10, 2024Updated 2 years ago
Alternatives and similar repositories for crystalcoder-data-prep
Users that are interested in crystalcoder-data-prep are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Pre-training code for CrystalCoder 7B LLM☆59May 10, 2024Updated 2 years ago
- Data preparation code for Amber 7B LLM☆95May 10, 2024Updated 2 years ago
- Pre-training code for Amber 7B LLM☆174May 10, 2024Updated 2 years ago
- Open Implementations of LLM Analyses☆109Oct 8, 2024Updated last year
- ☆239May 10, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A list where most values will be None (or default)☆11Apr 11, 2026Updated last month
- An open-source conversational language model developed by the Knowledge Works Research Laboratory at Fudan University.☆64Oct 12, 2023Updated 2 years ago
- This is the implementation of CounterCurate, the data curation pipeline of both physical and semantic counterfactual image-caption pairs.☆19Jun 27, 2024Updated last year
- [NeurIPS 2024 poster] Cross-model Control: Improving Multiple Large Language Models in One-time Training☆14Oct 25, 2024Updated last year
- This repository contains the replication package of our paper "Assessing the Security of GitHub Copilot’s Generated Code - A Targeted Rep…☆10Nov 16, 2023Updated 2 years ago
- INDICT: Code Generation with Internal Dialogues of Critiques for Both Security and Helpfulness☆14Nov 10, 2025Updated 6 months ago
- ☆10Apr 15, 2023Updated 3 years ago
- ☆13Oct 11, 2024Updated last year
- [ICML'25] MELON: Provable Defense Against Indirect Prompt Injection Attacks in AI Agents☆28Jul 31, 2025Updated 9 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Minimum Description Length probing for neural network representations☆20Jan 28, 2025Updated last year
- Implementation of "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"☆40Nov 11, 2024Updated last year
- [Findings of EMNLP22] From Mimicking to Integrating: Knowledge Integration for Pre-Trained Language Models☆19Mar 16, 2023Updated 3 years ago
- ☆20Dec 14, 2024Updated last year
- a Fine-tuned LLaMA that is Good at Arithmetic Tasks☆178Sep 15, 2023Updated 2 years ago
- Source code for paper: Knowledge Inheritance for Pre-trained Language Models☆37Apr 24, 2022Updated 4 years ago
- This repository is the official implementation of the TRAC optimizer in Fast TRAC: A Parameter-Free Optimizer for Lifelong Reinforcement …☆35Updated this week
- ☆26Jun 10, 2025Updated 11 months ago
- ☆19Aug 23, 2025Updated 8 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Hyperparameter tuning via uncertainty modeling☆51May 3, 2024Updated 2 years ago
- 🔮✍🏻 Automatically organize, analyze, and augment the quality of your obsidian.md notes with AI.☆17Aug 28, 2024Updated last year
- Tiny evaluation of leading LLMs on competitive programming problems☆14Apr 10, 2026Updated last month
- OpenSource deployment made easy☆10Jun 13, 2015Updated 10 years ago
- Official repository for "Reweighting Strategy based on Synthetic Data Identification for Sentence Similarity (COLING2022)"☆18Sep 4, 2022Updated 3 years ago
- Implementation of the content-aware image resizing algorithm presented in the paper "Seam carving for content-aware image resizing"☆13Jul 22, 2019Updated 6 years ago
- This is the repo for our work “An Extensible Plug-and-Play Method for Multi-Aspect Controllable Text Generation” (ACL 2023).☆14Jul 23, 2023Updated 2 years ago
- An MCP server for Raindrop.io (bookmarking service)☆20Apr 10, 2025Updated last year
- ☆11Apr 10, 2023Updated 3 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- M2-Reasoning: Empowering MLLMs with Unified General and Spatial Reasoning☆47Jul 17, 2025Updated 10 months ago
- A Framework for Machine Learning on Encrypted Data☆12Feb 10, 2022Updated 4 years ago
- Generate text images for training deep learning ocr model☆10Oct 22, 2018Updated 7 years ago
- Repository for Sparse Finetuning of LLMs via modified version of the MosaicML llmfoundry☆43Jan 15, 2024Updated 2 years ago
- ☆30Apr 29, 2026Updated 3 weeks ago
- ☆10Dec 10, 2024Updated last year
- ☆27Mar 13, 2024Updated 2 years ago