AI DailyAI Daily – 2025-05-04(Evening)AI model benchmarkingClaude CodeClaude Code programming assistantLangGraphLangGraph Agent applicationsQwen3 series modelsQwen3-235B-A22B performanceRunway Gen-4Runway Gen-4 References featureSimpleBench benchmarking