HongShan, formerly Sequoia China, has open-sourced two datasets from its xBench benchmark to help developers test reasoning and search skills in real-world
AI tasks. The ScienceQA and DeepSearch sets feature high-difficulty, regularly updated challenges for LLMs and AI agents.