Berita AI - 2025-08-07(Edisi malam)

Kata Kunci：AI hidrogel, Robot bedah otonom, Mikroskop cerdas, GPT-5, Kompetisi catur model besar, Generasi video AI, Robot pendamping AI, Etika AI, Desain AI untuk hidrogel perekat super, Robot bedah Da Vinci melakukan kolesistektomi otonom, Prediksi salah lipat protein dengan pembelajaran mendalam, Kemampuan penalaran GPT-5 melampaui manusia, Kinerja AI catur Grok 4

🔥 Spotlight

AI-Created Hydrogel Adheres to Everything : Breakthroughs have been made in AI-assisted material design, with Nature featuring an AI-designed super-adhesive hydrogel on its cover. This hydrogel, developed by analyzing natural adhesive protein sequences, achieves strong adhesion in wet environments and boasts long-term stability and biocompatibility. This technology is poised to revolutionize biomedical applications such as prosthetic coatings, wearable biosensors, and underwater repair materials, opening up a new end-to-end data-driven path for soft material design and demonstrating AI’s immense potential in material science. (Source: 36氪)

Autonomous Surgical Robot Successfully Removes Gallbladder : Johns Hopkins University and other institutions have developed a system called SRT-H, enabling the da Vinci surgical robot to autonomously complete critical steps of gallbladder removal without continuous human intervention. The system trains high-level planners and low-level motion generators through imitation learning and can self-correct errors during operation, demonstrating the immense potential of autonomous surgery. Although currently tested only on ex-vivo tissue and slower than human surgeons, its natural language interface and interpretability lay a crucial foundation for future safe autonomous surgery. (Source: DeepLearning.AI Blog)

Open Agentic LLMs Proliferate, Robot Removes Gallbladders, Reasoning Models Boost Emissions, OpenAI Re-Opens

Smart Microscope Predicts Protein Misfolding Aggregation : EPFL researchers have developed a smart microscope using deep learning that can track and analyze the aggregation process of misfolded proteins associated with neurodegenerative diseases in real-time, even predicting it before it begins. The system combines image classification algorithms with Brillouin microscopy, automatically triggering analysis upon detecting protein aggregation, significantly improving imaging efficiency and reducing fluorescent labeling. This breakthrough is crucial for understanding the biomechanical mechanisms of neurodegenerative diseases and for drug discovery, marking the immense potential of smart microscopes in life sciences. (Source: aihub.org)

Smart microscope captures aggregation of misfolded proteins

🎯 Trends

Silicon Valley AI Giants Intensively Release New Models : Silicon Valley AI giants have recently launched new advancements, accelerating AI competition. OpenAI, after six years, re-released its open-source model gpt-oss, including 120B and 20B versions, emphasizing local deployment and Agent applications, with performance approaching o4-mini. Google unveiled Genie 3, enabling text-to-minute-level interactive 3D virtual worlds, seen as a key step towards AGI. Anthropic updated Claude Opus 4.1, achieving new SOTA in AI programming capabilities, further solidifying its lead in the programming domain. These releases signal accelerated AI competition in open-source, world models, and vertical applications. (Source: 36氪, DeepLearning.AI Blog, 量子位, 36氪)

GPT-5 Information Leaks Extensively Ahead of Release : OpenAI has announced the GPT-5 launch event, with significant information leaks. GPT-5 is reportedly launching in standard, mini, nano, and chat versions, supporting tiered access, allowing free users to experience the basic version. Internal tests show its excellent performance in reasoning, programming, mathematics, and scientific problem-solving, with reasoning capabilities surpassing human average for the first time. Concurrently, Sam Altman has issued huge bonuses to employees, and OpenAI’s valuation is expected to reach $500 billion, indicating its confidence in GPT-5 and market anticipation. (Source: 36氪, 36氪)

First Large Model Chess Championship: Grok 4 and o3 Advance to Finals : Google’s Kaggle platform hosted the first AI International Chess Championship, pitting eight top LLMs against each other. In the first round, domestic models like DeepSeek R1 and Kimi K2 Instruct were unfortunately eliminated. In the semifinals, xAI’s Grok 4 and OpenAI’s o3 defeated their opponents to advance to the finals. The competition rules restricted models from calling external tools, aiming to purely test their reasoning abilities, exposing deficiencies in AI models’ context understanding and tactical execution. However, Grok 4’s performance received high praise from Elon Musk, attracting widespread attention. (Source: 36氪, 36氪, 36氪)

爆冷，首届大模型争霸，Grok 4下出“神之一手”？DeepSeek、Kimi惨遭淘汰

Overview of China’s AI Large Model Platform Progress in July : July saw a vibrant Chinese large model market. The WAIC conference focused on embodied AI, emphasizing AI’s shift from “screen to reality.” Multi-agent systems emerged as a new trend, with 360 Nano AI launching L4 multi-agent swarms for complex task collaboration. Leading manufacturers open-sourced their latest models, such as Alibaba’s Qwen3 series, Moonshot AI’s Kimi K2, and Zhipu AI’s GLM-4.5, fostering the nascent “ecosystem” of domestic large models, continuously enhancing their technical strength, and dominating international rankings. (Source: 36氪, 量子位, DeepLearning.AI Blog, 量子位)

Explosion of AI Video Generation Models and Rise of Agentic Web Concept : The AI video generation field is experiencing explosive growth. Following Sora’s technical breakthrough, Runway Gen-3, Luma Dream Machine, Kuaishou Keling, and others have successively launched, significantly reducing video production costs. The market landscape remains unsettled, with domestic manufacturers like ByteDance, Kuaishou, MiniMax, and Aishi Technology showing strong performance. Concurrently, the Agentic Web concept is emerging, proposing a next-generation internet driven by AI agents, where agents will become the primary operators of the Web, automating tasks and signaling a complete restructuring of the internet’s underlying logic. (Source: 36氪, 36氪, 36氪)

AI Glasses Enable ‘Grasping Objects from Afar’ for New Mixed Reality Interaction : Researchers, including Zhejiang University alumni, have proposed Reality Proxy technology, empowering AI glasses with an “air-grabbing” function. Users can select and interact with real-world objects via gestures, greatly enhancing the mixed reality experience. This technology abstracts real objects into digital proxies, supporting browsing, multi-object selection, attribute filtering, and semantic grouping. It is expected to be applied in daily information retrieval, architectural navigation, and drone control, representing a significant advancement in embodied AI and human-computer interaction. (Source: 36氪)

🧰 Tools

Nokia 3210 Features DeepSeek AI: New Feature Phone Experience : HMD has launched a revamped Nokia 3210 feature phone, now with built-in DeepSeek AI. This phone, priced at a low 429 yuan, offers AI voice assistant capabilities with fast and accurate speech recognition, providing concise and humorous replies, even humorously responding to “smash walnuts.” Despite limited AI capabilities, its “good enough” philosophy and user-friendliness for elderly users offer a new approach to AI popularization in low-cost terminals, demonstrating the potential for inclusive AI. (Source: 36氪)

Tencent AI Lab Open-Sources Deep Research Agent Framework Cognitive Kernel-Pro : Tencent AI Lab has open-sourced Cognitive Kernel-Pro, a fully open-source, multi-module, hierarchical deep research agent framework. This framework uses Python code as its action space, minimizing external dependencies, and aims to improve knowledge discovery and problem-solving efficiency. It performs excellently in GAIA benchmarks, approaching paid tool agents, and enhances performance through innovative training methods, providing a reproducible solution for AI agent development and training. (Source: 量子位)

Claude Code Launches Automated Security Review Feature : Anthropic’s Claude Code now features automated security review, allowing users to run security checks directly from the terminal and integrate them into GitHub Actions for automatic review of every new PR. This feature can identify and fix vulnerabilities like SQL injection, XSS, and authentication flaws. Anthropic has internally used it to discover and fix real vulnerabilities, demonstrating AI’s potential in enhancing software development security and efficiency, though community discussions on its trustworthiness persist. (Source: Reddit r/ClaudeAI)

OpenWebUI User Experience Issues : The Reddit community is discussing issues with OpenWebUI running Ollama and LiteLLM in a Proxmox LXC environment, specifically the inability to use tools, functions, and pipeline features, with users seeking successful configurations. Additionally, users are interested in how to hide or expand the Chain-of-Thought (CoT) output of gpt-oss models (run via llama.cpp-server) within OpenWebUI. These issues highlight the challenges in deploying and configuring AI tools in specific virtualization environments and optimizing user experience. (Source: Reddit r/OpenWebUI, Reddit r/OpenWebUI)

Demand for Open-Source, Lightweight, CPU-Friendly Word Alignment AI Model : A Reddit user is seeking an open-source, lightweight, CPU-friendly AI model for language translation that takes source and target language sentences as input and returns an array of word alignment indices, similar to simalign but without its accuracy issues. This reflects developers’ specific needs for model performance, deployment environment, and open-source customizability in certain NLP tasks to achieve efficient language processing in resource-constrained scenarios. (Source: Reddit r/deeplearning)

📚 Learning

LLM ‘Soft Thinking’ Capability and Reasoning Optimization : Research explores the “soft thinking” capability of large reasoning models, finding that LLMs primarily rely on the most influential parts of soft inputs during subsequent decoding, leading to a degradation of “soft thinking” into greedy decoding. By introducing Dirichlet resampling and Gumbel-Softmax techniques, randomness can be effectively introduced to unleash the potential of “soft thinking,” achieving excellent performance across eight reasoning benchmarks and revealing new directions for enhancing LLM reasoning capabilities. (Source: Reddit r/MachineLearning)

Book Recommendation: ‘Mastering Modern Time Series Forecasting’ : “Mastering Modern Time Series Forecasting” continues to rank first in Leanpub’s machine learning, time series, and forecasting categories. The book comprehensively covers classic methods like ARIMA and Prophet, as well as modern ML/DL models such as LightGBM and Transformer, emphasizing Python practice, production deployment, interpretability, and uncertainty quantification. It aims to provide data scientists, ML engineers, and researchers with a resource that balances theory and practice. (Source: Reddit r/deeplearning)

Qwen3’s New Paradigm GSPO: Addressing DeepSeek GRPO Model Collapse Issues : The Qwen team has proposed the GSPO (Group Sequence Policy Optimization) algorithm, aiming to solve the stability issues, particularly model collapse in MoE models, encountered by DeepSeek GRPO (Group Relative Policy Optimization) during large language model training. GSPO significantly reduces variance and eliminates reliance on auxiliary policies by elevating importance sampling from token-level to sequence-level, potentially becoming a new standard for LLM post-training reinforcement learning and crucial for improving model reasoning capabilities. (Source: 36氪, Reddit r/MachineLearning)

Frontier Research in Reinforcement Learning : Recent advancements in reinforcement learning include the HyCodePolicy framework, which enhances the robustness and efficiency of embodied agent manipulation policies through code synthesis, geometric localization, perceptual monitoring, and iterative repair. Sotopia-RL improves LLM social intelligence training effectiveness via discourse-level, multi-dimensional reward design. The EARL model, combining RL and VLM validators, excels in image editing tasks with less training data. Meanwhile, community discussions indicate that Bayesian deep learning methods still face training challenges in achieving SOTA performance, with most successful cases being “Bayesianized” non-Bayesian models. (Source: HuggingFace Daily Papers, HuggingFace Daily Papers, HuggingFace Daily Papers, Reddit r/MachineLearning)

Research on LLM Behavior and Optimization Mechanisms : Multiple studies focus on LLM behavior and optimization. AttnTrace proposes an attention-weight-based context backtracking method for long-context LLMs, enhancing trustworthiness and prompt injection detection. LeanK significantly reduces memory and accelerates decoding through KV cache channel pruning. However, research finds that LLM’s Chain-of-Thought (CoT) reasoning is fragile when exceeding training data distribution, potentially being a “mirage.” The Sculptor framework mitigates interference and improves long-context task reasoning reliability through active context management tools. Web-CogReasoner enhances Web agents’ knowledge content learning and cognitive processes through knowledge-driven Chain-of-Thought reasoning. (Source: HuggingFace Daily Papers, HuggingFace Daily Papers, HuggingFace Daily Papers, HuggingFace Daily Papers, HuggingFace Daily Papers)

Progress in Multimodal Models and Generative Technologies : Recent advancements in multimodal AI include UniEgoMotion, which proposes a unified model for first-person human motion reconstruction, prediction, and generation, opening new possibilities for AR/VR applications. AI agents’ purchasing behavior in e-commerce was evaluated, showing model preferences similar to humans but varying in degree. The BLiM framework improves text-to-video retrieval performance by combining query and candidate likelihood. HPSv3 provides a new human preference evaluation standard for text-to-image generation models and optimizes image quality through CoHP. The 3D Occupancy Grounding Benchmark and GroundingOcc model enhance spatial perception capabilities in autonomous driving. Additionally, Gaussian Splatting Diffusion Models achieve high-quality video-to-4D content generation. (Source: HuggingFace Daily Papers, HuggingFace Daily Papers, HuggingFace Daily Papers, HuggingFace Daily Papers, HuggingFace Daily Papers, HuggingFace Daily Papers)

💼 Business

Differences in China-US AI Investment and Profit Models : US tech giants Meta, Microsoft, Google, and Amazon are projected to spend up to $400 billion on AI capital expenditure this year, with AI revenue growing rapidly; OpenAI and Anthropic’s annualized revenue is expected to reach $29 billion by year-end. In contrast, China’s AI industry faces commercialization challenges, with slow revenue and profit growth, and some AI innovation products and talent accelerating outflow. This disparity stems from fundamental differences in China-US internet paradigms: the US SaaS model drives AI applications with an “interface” mindset, while China relies on a “portal” mindset, limiting AI commercialization returns and highlighting the critical link between capital investment and business logic. (Source: 36氪, 36氪, 36氪, DeepLearning.AI Blog)

Meitu’s AI Transformation Achieves Profitability and Growth : Meitu Inc. has achieved business transformation through AI technology, with net profit expected to grow significantly in the first half of 2025. It has built the visual large model MiracleVision, an AI open platform, and multiple AIGC products for C/B-sides, driving VIP subscription revenue as its main growth engine and expanding into overseas markets and B-side productivity scenarios. Despite still trailing professional design tools like Figma, Meitu has successfully emerged from years of losses, reshaping its profit model through AI, and forming a strategic partnership with Alibaba to further explore the B-side market, demonstrating AI’s immense potential in empowering traditional enterprises. (Source: 36氪)

AI Companion Robots Emerge as a Hot New Sector : With aging populations and the rise of the single economy, “loneliness” is driving the AI companion robot market, which is projected for rapid global growth. Tech leaders like Lei Jun, Richard Liu, Zhu Xiaohu, and Yu Minhong are entering the field through investments or product launches. The sector features diverse business models, including hardware sales, subscription services, scenario-based solutions, and data monetization, though high return rates and user expectation discrepancies remain challenges. The industry is transitioning from a technology validation phase to rapid commercialization, signaling an explosive growth in emotional tech products. (Source: 36氪)

🌟 Community

ChatGPT Privacy Leak Controversy and User Trust Crisis : Over 70,000 ChatGPT private chat contents were publicly indexed in Google search results due to a design flaw in the “sharing” feature, sparking user privacy concerns and widespread controversy. OpenAI acknowledged the design issue and urgently removed the “discoverable” option, but this incident exacerbated user trust issues regarding AI chat privacy and OpenAI’s data governance, drawing criticism for treating users as “guinea pigs.” The event highlights the importance of clearly informing users about data processing methods in AI product design. (Source: 36氪)

超7万条ChatGPT私聊内容在网上“裸奔”，遭用户怒骂后，OpenAI急忙下架：这不是Bug，而是个“实验”

AI’s Impact on the Job Market and Career Transformation : Microsoft research indicates that AI will replace many human jobs, listing 40 high-risk professions (e.g., interpreters, journalists) and 40 low-risk ones (e.g., surgeons, construction workers). AI is reshaping the developer’s role from “code writer” to “AI manager,” requiring core competencies like AI literacy and agent collaboration. While AI won’t cause complete human unemployment, it will drive structural reshaping of the labor market, necessitating educational system reforms to adapt to the AI era. For example, the Reddit community is also discussing AI’s impact on resumes and hiring. (Source: 36氪, 36氪, Reddit r/ArtificialInteligence)

AI’s Impact on Human Cognition and Mental Health : The Reddit community is discussing whether AI makes humans “dumber,” with MIT research suggesting excessive reliance on ChatGPT may reduce brain activity and impair critical thinking. Concurrently, ChatGPT launched an “anti-addiction mode” to address potential mental health issues from prolonged use, reflecting concerns about AI over-reliance. Elon Musk’s Grok “AI girlfriends” Ani and Valentine sparked ethical controversy, as their emotional companionship model challenges the boundaries between AI tools and emotions, raising alarms about social atomization and emotional manipulation. (Source: Reddit r/ChatGPT, Reddit r/ArtificialInteligence, 量子位, 36氪, 36氪)

Societal Discussion on AI Ethics and Governance : The Reddit community is discussing the necessity of AI governance, with students seeking interviewees for theses on macro, meso, and micro levels of AI governance. Public concern over Duolingo’s “AI-first” policy’s profit model is growing, with worries about environmental damage, job displacement, and weakening human connection, leading to calls for boycotts. Concurrently, discussions on LLM data leak risks emphasize the importance of responsible API usage and local models, urging stronger AI data privacy protection and ethical review. (Source: Reddit r/ArtificialInteligence, Reddit r/artificial, Reddit r/ArtificialInteligence)

Philosophical Reflections on AI’s Impact on Social Structure and Human Meaning : Scholar Zhang Xiaoyu proposes “emergence principle,” “human equivalent,” “algorithmic judgment,” and “civilization contract” as key concepts to understand AI’s comprehensive transformation of society. He believes AI will mass-produce intelligence at extremely low cost, potentially widening social divides to a “species-level,” necessitating “universal basic work” and recommendation algorithms for balanced distribution. AI will become an “impartial third-party judge,” forcing humanity to contemplate justice and the meaning of existence, urging humans to abandon “anthropocentrism” and adapt to the AI era. DeepMind head Demis Hassabis also believes the AI revolution will bring a world of “extreme abundance” but requires addressing resource allocation and unemployment. (Source: 36氪, 36氪, 36氪)

Humor and Reflection in the AI Community : The Reddit community features numerous humorous discussions about AI, such as ChatGPT’s “precise” explanations of professions causing user self-doubt, Claude Opus 4.1’s “kicking out” image when solving problems, and playful jabs at OpenAI’s “open-source” naming and Qwen models’ “personalization.” These discussions reflect users’ lighthearted reflections on AI’s limitations, ethical boundaries, and future development in daily use, as well as a community culture that uses humor to alleviate tech anxiety. (Source: Reddit r/ChatGPT, Reddit r/ClaudeAI, Reddit r/LocalLLaMA, Reddit r/LocalLLaMA, Reddit r/LocalLLaMA, Reddit r/ChatGPT)

Sustainability Challenges of AI Conference Models : A HuggingFace paper argues that the current centralized AI conference model is unsustainable due to rapid expansion, facing scientific (excessive publication rates), environmental (carbon footprint), psychological (negative emotions, mental health issues), and logistical (venue capacity) pressures. The research proposes a “Community Federated Conferences (CFC)” model, separating peer review, presentations, and networking through global coordination and local organization, to achieve more sustainable, inclusive, and resilient AI research development, addressing new challenges brought by the rapid growth of the AI field. (Source: HuggingFace Daily Papers)

💡 Other

Interview with Unitree Robotics Founder Wang Xingxing: Pragmatic Idealism in Embodied AI : An unreleased interview by Vertex Ventures with Unitree Robotics founder Wang Xingxing reveals his profound insights into quadruped/biped robots and AI’s role in embodied AI. Wang Xingxing emphasizes “slow is fast,” insisting on independent R&D of core components, pursuing low-cost and high-performance, and is optimistic about AI’s long-term prospects in robotics. The interview showcases his pragmatic and long-term entrepreneurial philosophy, and his extreme pursuit of technical rationality and product implementation, seen as a microcosm of top entrepreneurs in the era of technological innovation. (Source: 36氪)

New Directions in Finance and Management Education in the AI Era : Shanghai Advanced Institute of Finance (SAIF) at Shanghai Jiao Tong University has upgraded its 2026 EMBA program, for the first time deeply integrating AI technology and legal rules into finance and management education, creating “Finance × AI” and “Finance × Law” specializations. The program launched the “Talent Cultivation Special Scholarship for a Strong Science and Technology Nation,” offering full or half scholarships to outstanding scientific and technological innovation talents, aiming to cultivate interdisciplinary talents with global vision and local insights, and support enterprise development in the AI era. (Source: 量子位)

Tencent 2026 Campus Recruitment Launched, Focusing on AI Product Manager Trainees : Tencent has officially launched its 2026 campus recruitment, opening over 70 types of positions across five major categories including technology, product, and design, to university students worldwide. This recruitment will significantly increase investment in AI-related positions and introduce an “AI Product Manager Trainee” program for top product talents, aiming to attract outstanding young individuals to deeply participate in AI technological transformation and build a talent pipeline for Tencent’s AI business development, demonstrating leading enterprises’ strong demand for and commitment to cultivating AI talent. (Source: 量子位)

🔥 Spotlight

🎯 Trends

🧰 Tools

📚 Learning

💼 Business

🌟 Community

💡 Other

Tag Terkait

Related Posts

Berita AI – 2025-10-30(Edisi malam)

Berita AI – 2025-10-30(Edisi pagi)

Berita AI – 2025-10-29(Edisi pagi)