Ongoing research project for code&math LLMs
☆27Jul 4, 2025Updated 7 months ago
Alternatives and similar repositories for swallow-code-math
Users that are interested in swallow-code-math are comparing it to the libraries listed below
Sorting:
- ☆12Feb 26, 2025Updated last year
- SWE-Swiss: A Multi-Task Fine-Tuning and RL Recipe for High-Performance Issue Resolution☆104Sep 24, 2025Updated 5 months ago
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling☆42Dec 29, 2025Updated 2 months ago
- Bayes-Adaptive RL for LLM Reasoning☆45May 28, 2025Updated 9 months ago
- ☆34May 9, 2025Updated 9 months ago
- Code for the preprint "Cache Me If You Can: How Many KVs Do You Need for Effective Long-Context LMs?"☆48Jul 29, 2025Updated 7 months ago
- Debiasing Through Data Attribution☆12May 23, 2024Updated last year
- DOMAINEVAL is an auto-constructed benchmark for multi-domain code generation that consists of 2k+ subjects (i.e., description, reference …☆14Dec 12, 2024Updated last year
- This is the code of a agentic rag method with dynamic workflow.☆12Jan 22, 2026Updated last month
- ☆18Mar 2, 2025Updated last year
- MetaLadder: Ascending Mathematical Solution Quality via Analogical-Problem Reasoning Transfer (EMNLP 2025)☆11Apr 18, 2025Updated 10 months ago
- ☆11Jun 12, 2024Updated last year
- Example code for the NNGeometry PyTorch library☆10Aug 20, 2025Updated 6 months ago
- scene detect and auto cut implementation☆11Mar 15, 2024Updated last year
- Official InfiniBench: A Benchmark for Large Multi-Modal Models in Long-Form Movies and TV Shows☆19Nov 4, 2025Updated 3 months ago
- LongAttn :Selecting Long-context Training Data via Token-level Attention☆15Jul 16, 2025Updated 7 months ago
- ☆17Dec 23, 2025Updated 2 months ago
- This repository provides the code for applying Contrastive Learning Penalty Loss (CLPL) and Mixture of Experts (MoE) to the BGE-M3 text e…☆11Dec 27, 2024Updated last year
- Offical implementation of our paper "Exploring the Potential of Diffusion Large Language Models in Code Generation".☆20Oct 29, 2025Updated 4 months ago
- Graphical user interface for text-guided face editing☆11Jan 18, 2023Updated 3 years ago
- Some commonly used functions and modules☆10Jan 15, 2024Updated 2 years ago
- ☆12Mar 1, 2025Updated last year
- ☆10Oct 20, 2023Updated 2 years ago
- ☆14Jun 24, 2024Updated last year
- ACL24☆11Jun 7, 2024Updated last year
- Tiny evaluation of leading LLMs on competitive programming problems☆14Nov 28, 2024Updated last year
- The code for ”T-GRAG: A Dynamic GraphRAG Framework for Resolving Temporal Conflicts and Redundancy in Knowledge Retrieval“☆20Jul 30, 2025Updated 7 months ago
- An implementation of online data mixing for the Pile dataset, based on the GPT-NeoX library.☆13Jan 9, 2024Updated 2 years ago
- Implementation of various generative models☆14Oct 1, 2018Updated 7 years ago
- This repository provides a multi task benchmark for instance segmentation, depth estimation, and 3D object detection.☆14Jul 29, 2023Updated 2 years ago
- Example of using Epochraft to train HuggingFace transformers models with PyTorch FSDP☆11Jan 29, 2024Updated 2 years ago
- Implementation for for "L-CoDer: Language-based Colorization with Color-object Decoupling Transformer"☆13Jan 20, 2024Updated 2 years ago
- Implementation for the paper "Fictitious Synthetic Data Can Improve LLM Factuality via Prerequisite Learning"☆11Jan 10, 2025Updated last year
- ☆26Jan 4, 2026Updated last month
- ☆11Mar 12, 2019Updated 6 years ago
- Official implementation of Self-Taught Agentic Long Context Understanding (ACL 2025).☆12Sep 22, 2025Updated 5 months ago
- Fine-tuning-free Shapley value (FreeShap) for instance attribution☆14May 29, 2024Updated last year
- ☆13Nov 22, 2024Updated last year
- MPI Code Generation through Domain-Specific Language Models☆14Nov 19, 2024Updated last year