
Alibaba pushes robotics forward with open-source RynnBrain foundation model
Read Our Expert Analysis
Create an account or login for free to unlock our expert analysis and key takeaways for this development.
By continuing, you agree to receive marketing communications and our weekly newsletter. You can opt-out at any time.
Recommended for you

Alibaba, ByteDance and Kuaishou Unveil Next-Gen Robotics and Video AI
Chinese technology leaders released distinct AI models this week: Alibaba introduced a robotics-focused model for real-world object interaction, ByteDance launched an improved text-to-video generator, and Kuaishou rolled out a paywalled video model with longer outputs. These releases sharpen competition with Western labs on robotics, video synthesis, and agentic capabilities while raising consent and commercialisation questions.

Intrinsic pushes AI-driven robotics to reshape manufacturing
Intrinsic, led by CEO Wendy Tan White, is advancing adaptable, software-first robotics control and has partnered with Foxconn to pilot real factory deployments. The move reflects a broader industry inflection—driven by advances in simulation, compute and orchestration—that favors modular, updatable robotics platforms and could enable partial reshoring for higher-wage regions if integration, standards and workforce retraining keep pace.
Inside Physical Intelligence: Betting patient capital on general-purpose robot brains
A San Francisco startup led by Lachy Groom and academic founders is training general-purpose robotic foundation models using inexpensive arms and diverse real-world data rather than chasing immediate commercial deployments. Its research-first, compute-heavy strategy sits against an industry pivot toward rapid commercialization and infrastructure concentration, creating both a potential long-term advantage if models generalize and a near-term risk that revenue-led competitors entrench customers and data flywheels.
Agile Robots signs research pact with Google DeepMind to embed Gemini models
Agile Robots has sealed a multi-year research pact with Google DeepMind to integrate Gemini Robotics models into industrial robots and to share field telemetry with the model owner. The deal coincides with Google's consolidation of robotics software (Flowstate/Intrinsic) into its central Cloud and AI organization — a move that eases model serving and commercial bundling but heightens contractual questions about telemetry, model updates and enterprise controls.

Nvidia unveils DreamDojo — a robot world model trained on 44,000 hours of human video
Nvidia and academic partners released DreamDojo, a two-stage world model trained on 44,000 hours of egocentric human video to teach robots physical interaction via observation and targeted post-training. The system delivers real-time, action-conditioned simulation at roughly 10 frames per second and aims to shrink the data and cost barriers for deploying humanoid robots in messy real-world settings.

ABB accelerates robot training with NVIDIA simulation libraries
ABB and NVIDIA are integrating high-fidelity simulation to tighten robot behavior between digital training and factory floors, with Foxconn piloting camera-guided assembly and a planned product launch in H2 2026. The move sits inside a broader industry shift — Alphabet’s Intrinsic is also piloting Foxconn collaborations but emphasizes continuous, field-driven adaptation — highlighting two competing strategies for production-ready robotics.
Hark Rewires Consumer AI with Model–Hardware Stack
Hark, backed by $100M from founder Brett Adcock , is building tightly coupled multimodal models and custom interfaces to push consumer-grade persistent intelligence. The startup plans a GPU ramp in April and has hired design lead Abidur Chowdhury , signaling a bet on productized AI beyond apps — though that timetable is exposed to industry-wide memory, DRAM and allocation constraints that could affect April capacity targets.
Alibaba Qwen3.5: frontier-level reasoning with far lower inference cost
Alibaba’s open-weight Qwen3.5-397B-A17B blends a sparse-expert architecture and multi-token prediction to deliver large-context, multimodal reasoning at sharply lower runtime cost and latency. The release — permissively Apache 2.0 licensed and offering hosted plus options up to a 1M-token window — pushes enterprises to weigh on-prem self-hosting, in-region hosting, and new procurement trade-offs around cost, sovereignty and operational maturity.