Data preparation code for CrystalCoder 7B LLM
☆43May 10, 2024Updated last year
Alternatives and similar repositories for crystalcoder-data-prep
Users that are interested in crystalcoder-data-prep are comparing it to the libraries listed below
Sorting:
- Pre-training code for CrystalCoder 7B LLM☆57May 10, 2024Updated last year
- Data preparation code for Amber 7B LLM☆93May 10, 2024Updated last year
- Pre-training code for Amber 7B LLM☆172May 10, 2024Updated last year
- Open Implementations of LLM Analyses☆107Oct 8, 2024Updated last year
- ☆235May 10, 2024Updated last year
- This repository provides installation scripts and configuration files for deploying the CSGHub instance, includes Helm charts and Docker…☆19Updated this week
- A list where most values will be None (or default)☆11Jul 19, 2023Updated 2 years ago
- An open-source conversational language model developed by the Knowledge Works Research Laboratory at Fudan University.☆64Oct 12, 2023Updated 2 years ago
- This is the implementation of CounterCurate, the data curation pipeline of both physical and semantic counterfactual image-caption pairs.☆19Jun 27, 2024Updated last year
- A collection of CLI LLM tools that I built and use daily☆15Aug 7, 2024Updated last year
- [NeurIPS 2024 poster] Cross-model Control: Improving Multiple Large Language Models in One-time Training☆14Oct 25, 2024Updated last year
- ☆12Jul 25, 2023Updated 2 years ago
- [NAACL 2025] Representing Rule-based Chatbots with Transformers☆23Feb 9, 2025Updated last year
- This repository contains the replication package of our paper "Assessing the Security of GitHub Copilot’s Generated Code - A Targeted Rep…☆10Nov 16, 2023Updated 2 years ago
- INDICT: Code Generation with Internal Dialogues of Critiques for Both Security and Helpfulness☆14Nov 10, 2025Updated 4 months ago
- ☆10Apr 15, 2023Updated 2 years ago
- UM1 test programs and sample code☆11Jul 25, 2022Updated 3 years ago
- [ICML'25] MELON: Provable Defense Against Indirect Prompt Injection Attacks in AI Agents☆24Jul 31, 2025Updated 7 months ago
- Minimum Description Length probing for neural network representations☆20Jan 28, 2025Updated last year
- ☆16Oct 2, 2024Updated last year
- Implementation of "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"☆40Nov 11, 2024Updated last year
- Text-2-SQL☆19Feb 21, 2025Updated last year
- Say hi to anyone, for humans and agents. An Inbox Zero product☆20Jul 8, 2025Updated 8 months ago
- An open-source framework for building monolithic or distributed agentic systems, ranging from simple LLM calls to compositional workflows…☆26Jan 14, 2026Updated 2 months ago
- [Findings of EMNLP22] From Mimicking to Integrating: Knowledge Integration for Pre-Trained Language Models☆19Mar 16, 2023Updated 3 years ago
- ☆20Dec 14, 2024Updated last year
- a Fine-tuned LLaMA that is Good at Arithmetic Tasks☆178Sep 15, 2023Updated 2 years ago
- Source code for paper: Knowledge Inheritance for Pre-trained Language Models☆38Apr 24, 2022Updated 3 years ago
- This repository is the official implementation of the TRAC optimizer in Fast TRAC: A Parameter-Free Optimizer for Lifelong Reinforcement …☆34May 2, 2025Updated 10 months ago
- An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.☆13Jun 7, 2023Updated 2 years ago
- Transform natural language into beautiful, interactive data visualizations using the Model Context Protocol (MCP) with Claude Desktop int…☆19Jun 27, 2025Updated 8 months ago
- ☆19Aug 23, 2025Updated 6 months ago
- Official repository for "Reweighting Strategy based on Synthetic Data Identification for Sentence Similarity (COLING2022)"☆18Sep 4, 2022Updated 3 years ago
- Implementation of the content-aware image resizing algorithm presented in the paper "Seam carving for content-aware image resizing"☆13Jul 22, 2019Updated 6 years ago
- Scaling In-context Learning from Few-shot to 1,024-shot on Tabular ML☆59Dec 12, 2025Updated 3 months ago
- This is the repo for our work “An Extensible Plug-and-Play Method for Multi-Aspect Controllable Text Generation” (ACL 2023).☆14Jul 23, 2023Updated 2 years ago
- ☆26Jan 5, 2026Updated 2 months ago
- 我的微信公众号 ”aber的个人号“☆12May 9, 2024Updated last year
- M2-Reasoning: Empowering MLLMs with Unified General and Spatial Reasoning☆46Jul 17, 2025Updated 8 months ago