Fullstack bench: Evaluating LLMs as full stack coder

Published in arXiv preprint, 2024

Seed-Foundation-Code Team, ByteDance.

A benchmark for end-to-end full-stack coding capabilities of large language models.