Repository for analysis and experiments in the BigCode project.
☆127Mar 20, 2024Updated 2 years ago
Alternatives and similar repositories for bigcode-analysis
Users that are interested in bigcode-analysis are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆492Aug 15, 2024Updated last year
- ☆19Aug 10, 2024Updated last year
- A framework for the evaluation of autoregressive code generation language models.☆1,040Jul 22, 2025Updated 9 months ago
- ☆26Mar 6, 2024Updated 2 years ago
- Scaling Data-Constrained Language Models☆343Jun 28, 2025Updated 10 months ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Locality Sensitive Hashing☆80Jul 12, 2023Updated 2 years ago
- All-in-one text de-duplication☆753Mar 9, 2026Updated last month
- Hugging Face Download (Cache) Manager☆22Aug 7, 2022Updated 3 years ago
- Source of the website of the BigCode project.☆22Apr 9, 2026Updated 2 weeks ago
- This is the official PyTorch implementation for our NAACL 2024 paper: "AnchorAL: Computationally Efficient Active Learning for Large and …☆22Apr 15, 2025Updated last year
- Code repository for the paper - "AdANNS: A Framework for Adaptive Semantic Search"☆67Oct 10, 2023Updated 2 years ago
- [ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, …☆43Nov 9, 2023Updated 2 years ago
- ☆23Jul 10, 2023Updated 2 years ago
- ANE accelerated embedding models!☆20Dec 11, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆60Oct 18, 2025Updated 6 months ago
- User-friendly viewer for Parquet files☆11Mar 7, 2026Updated last month
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆104Jan 15, 2024Updated 2 years ago
- Code for the curation of The Stack v2 and StarCoder2 training data☆131Apr 11, 2024Updated 2 years ago
- various experiments for scaling inference time compute with small reasoning models☆17Jan 16, 2025Updated last year
- Run evaluation on LLMs using human-eval benchmark☆430Sep 12, 2023Updated 2 years ago
- Operations Research Algorithms☆19Mar 20, 2024Updated 2 years ago
- FlexAttention w/ FlashAttention3 Support☆27Oct 5, 2024Updated last year
- Fine-tune SantaCoder for Code/Text Generation.☆195Apr 11, 2023Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- FlexiTokens☆20Dec 27, 2025Updated 4 months ago
- This is the repository for the paper Static Prediction of Runtime Errors by Learning to Execute Programs with External Resource Descripti…☆25Nov 18, 2022Updated 3 years ago
- Generative model for code infilling and synthesis☆313Sep 9, 2023Updated 2 years ago
- 🐙 OctoPack: Instruction Tuning Code Large Language Models☆478Feb 5, 2025Updated last year
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Jul 12, 2023Updated 2 years ago
- Code for SaGe subword tokenizer (EACL 2023)☆28Nov 30, 2024Updated last year
- ☆86Jun 13, 2023Updated 2 years ago
- A basic pure pytorch implementation of flash attention☆16Oct 28, 2024Updated last year
- Evaluation Code repository for the paper "ModuLoRA: Finetuning 3-Bit LLMs on Consumer GPUs by Integrating with Modular Quantizers". (2023…☆13Dec 5, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- An open collection of methodologies to help with successful training of large language models.☆558Feb 15, 2024Updated 2 years ago
- CodeTF: One-stop Transformer Library for State-of-the-art Code LLM☆1,482May 1, 2025Updated 11 months ago
- DARPA Cyber Grand Challenge Linux source code☆18Jul 9, 2015Updated 10 years ago
- Make triton easier☆50Jun 12, 2024Updated last year
- Flexibly track outputs and grad-outputs of torch.nn.Module.☆13Oct 6, 2023Updated 2 years ago
- Development containers for triton and triton-cpu☆27Apr 17, 2026Updated last week
- Standalone commandline CLI tool for compiling Triton kernels☆20Sep 13, 2024Updated last year