기사 목록으로 돌아가기
Models12 min
OpenAI o1/o3 vs Claude Opus Extended Thinking: Reasoning Bench
Misaki HayashiAI Research Analyst2026-04-2112 min
o1o3Extended ThinkingReasoningBenchmarks
이 글은 일본어로 작성되어 있습니다. 한국어 요약은 아래와 같습니다:
OpenAI o1/o3 vs Claude Opus Extended Thinking: Reasoning Bench—OpenAI o1/o3 vs Claude Opus 4.5/4.7 Extended Thinking measured on AIME 2025, GPQA Diamond, ARC-AGI 2 and SWE-bench Verified — per-task strengths broken down.