GDPVAL: Evaluating AI Model Performance On Real-World Economically Valuable Tasks – Paper Summary
TL;DR GDPval is an OpenAI evaluation of real professional “knowledge-work” on computers. It contains 1,320 expert-authored tasks spanning 44 occupations ...














