A benchmark for evaluating LLMs on open-ended CS problems. Exploring the Next Frontier of Computer Science.
☆162Mar 4, 2026Updated 2 weeks ago
Alternatives and similar repositories for Frontier-CS
Users that are interested in Frontier-CS are comparing it to the libraries listed below
Sorting:
- The official repository of ALE-Bench☆170Updated this week
- Preview Code for Continuum Paper☆51Updated this week
- MARSHAL: Incentivizing Multi-Agent Reasoning via Self-Play with Strategic LLMs☆39Mar 13, 2026Updated last week
- ☆12Jan 25, 2026Updated last month
- ☆14Nov 12, 2025Updated 4 months ago
- ☆20May 14, 2025Updated 10 months ago
- DeepResearchEval: An Automated Framework for Deep Research Task Construction and Agentic Evaluation.☆130Feb 10, 2026Updated last month
- Official implementation of NeurIPS'24 Spotlight paper "Monte Carlo Tree Search based Space Transfer for Black-box Optimization".☆13Nov 28, 2024Updated last year
- ☆30Dec 23, 2025Updated 2 months ago
- PoC for "SpecReason: Fast and Accurate Inference-Time Compute via Speculative Reasoning" [NeurIPS '25]☆67Oct 2, 2025Updated 5 months ago
- Accelerating MoE with IO and Tile-aware Optimizations☆606Feb 27, 2026Updated 3 weeks ago
- Thinking with Videos from Open-Source Priors. We reproduce chain-of-frames visual reasoning by fine-tuning open-source video models. Give…☆219Oct 12, 2025Updated 5 months ago
- AI-Driven Research Systems (ADRS)☆128Dec 17, 2025Updated 3 months ago
- ☆14Nov 2, 2022Updated 3 years ago
- HPC Python Workshop at RSECon22☆14Oct 17, 2022Updated 3 years ago
- Baselines for Model-Based Optimization installation fixes and compatible with newer AMPERE+ GPUs (e.g. 3090)☆11Apr 30, 2023Updated 2 years ago
- [ICLR 2026] The official code for "Doxing via the Lens: Revealing Location-related Privacy Leakage on Multi-modal Large Reasoning Models"☆23Feb 7, 2026Updated last month
- Simple Fortran parallel IO benchmark for teaching and benchmarking purposes☆11Nov 25, 2025Updated 3 months ago
- Cray Lustre is HPE's curated Lustre distro for HPE ClusterStor, Cray EX, and other HPE/Cray clients☆18Updated this week
- Competition instructions for the Center for High Performance Computing (CHPC) 2024 Student Cluster Compettion (SCC). Which is hosted by t…☆16Mar 12, 2026Updated last week
- Code for the paper "Bounce: Reliable High-Dimensional Bayesian Optimization for Combinatorial and Mixed Spaces"☆15Apr 30, 2024Updated last year
- Explaining neural decisions contrastively to alternative decisions.☆24Mar 18, 2021Updated 5 years ago
- [COLM 2025] Official code for "When To Solve, When To Verify: Compute-Optimal Problem Solving and Generative Verification for LLM Reasoni…☆15Oct 31, 2025Updated 4 months ago
- Diagrams as text tool for visualizing concurrent operation histories☆23Feb 12, 2025Updated last year
- A WebUI for Side-by-Side Comparison of Media (Images/Videos) Across Multiple Folders☆25Feb 21, 2025Updated last year
- Cray System Management☆11Updated this week
- Initial commit☆13Aug 14, 2023Updated 2 years ago
- The repo of "BugLens"☆39Nov 12, 2025Updated 4 months ago
- Official code for the paper: "Metadata Archaeology"☆19May 10, 2023Updated 2 years ago
- Keras tutorial code for the SC18 tutorial on Deep Learning at Scale☆12Nov 12, 2018Updated 7 years ago
- ☆167Dec 13, 2025Updated 3 months ago
- ☆18Apr 24, 2024Updated last year
- ☆23Aug 1, 2025Updated 7 months ago
- ☆22Dec 25, 2025Updated 2 months ago
- OPT-BENCH: Evaluating LLM Agent on Large-Scale Search Spaces Optimization Problems☆120Jul 13, 2025Updated 8 months ago
- Pytorch implementation of “MetaPerturb: Transferable Regularizer for Heterogeneous Tasks and Architectures” (NeurIPS 2020 spotlight)☆13Jul 22, 2021Updated 4 years ago
- A collection of GPU experiments and benchmarks for my personal understanding and research.☆26Updated this week
- [NeurIPS 2024 Spotlight] CLIPLoss and Norm-Based Data Selection Methods for Multimodal Contrastive Learning.☆14Dec 12, 2024Updated last year
- CLI that uses DSPy to interact with MCP servers.☆24Mar 10, 2025Updated last year