
Apple rebuilds Siri on a 1.2-trillion-parameter Gemini model and opens iOS 27 to third-party AI defaults
Apple has rebuilt Siri from scratch on a custom Google Gemini model — 1.2 trillion parameters, ~$1B/year — routing the heaviest queries to Google Cloud on Nvidia Blackwell B200s while simpler requests stay on-device via Private Cloud Compute, delivering the AI overhaul it promised two years ago. iOS 27 also lets users designate Claude, ChatGPT, or Grok as the default AI for Siri and Writing Tools, opening Apple's billion-device installed base to third-party providers for the first time. Tim Cook steps down as CEO on September 1, handing the role to hardware chief John Ternus.
techtimes.com →- 02
NVIDIA and SK hynix commit to co-develop memory for Vera Rubin, RTX Spark, and Jetson ThorNVIDIA and SK hynix signed a multiyear agreement to co-develop advanced memory for Vera Rubin AI supercomputers, RTX Spark PCs, and Jetson Thor robotics platforms — securing supply at the design stage across three distinct NVIDIA markets, not just at procurement. The deal landed during Jensen Huang's Seoul visit that swept in four more Korean partnerships — Naver's 55 MW sovereign AI data center, SK Telecom's gigawatt-scale cloud by 2027, LG humanoids, and Hyundai autonomous manufacturing — marking Korea as a central node in NVIDIA's global AI infrastructure buildout.
nvidianews.nvidia.com → - 03
OpenAI says 'chat is dead' as it rebuilds ChatGPT into an agent superappOpenAI is merging ChatGPT, Codex, and partner integrations — Canva, Booking — into a superapp in the biggest overhaul since the 2022 launch, with CPO Thibault Sottiaux describing the target as a personal agent handling tasks across work and life. Codex's sixfold growth to 5 million weekly users since February, and business tools now at 40% of revenue targeted at 50% by year-end, mark this as product consolidation already underway — not a roadmap promise.
the-decoder.com → - 04
Perplexity's Search as Code lets agents write custom search scripts, cutting tokens 85% on a CVE research taskPerplexity's Search as Code (SaC) replaces fixed search API calls with agent-generated Python scripts that run in a sandbox — on an internal task tracking 200 critical CVEs, token usage fell 85% and the approach outscored Anthropic and OpenAI systems on four of five self-reported benchmarks. Agents on Perplexity's Agent API or Perplexity Computer can now write custom retrieve-filter-deduplicate-rerank pipelines, running parallel queries and scripting their own filters instead of accepting whatever a fixed API returns.
the-decoder.com → - 05
Cognition ships Devin Desktop with ACP, an open protocol for coordinating AI coding agentsDevin Desktop wraps Windsurf with a multi-agent dashboard — ACP, an open protocol, lets Codex, Claude Agent, OpenCode, and custom-built agents share context in one workspace via project-grouped 'Spaces' that persist across pull requests. Engineering teams running multiple AI coding tools can consolidate them under one interface today; Cognition has not published pricing, deployment terms, or independent benchmarks for the claimed 30% efficiency gains in the rebuilt Devin Local.
techedt.com → - 06
SpaceX signs $920M/month Google deal for 110,000 Nvidia chips ahead of its $1.75T IPOSpaceX signed a $920M/month compute deal with Google for ~110,000 Nvidia chips through June 2029 — capacity Musk originally built for xAI, which lagged, now leased to a second hyperscaler alongside SpaceX's existing $1.25B/month Anthropic contract. The two-tenant structure converts a stranded infrastructure bet into locked-in hyperscaler revenue, and both deals surface in SpaceX's IPO S-1 ahead of its June 12 Nasdaq debut at a $1.75T target valuation.
the-decoder.com → - 07
Uber opens UK interest list for Wayve robotaxis as Waymo tests 100 vehicles across LondonUber opened a UK interest list for Wayve autonomous rides — Ford Mustang Mach-Es with human safety operators, at standard fares, launching in the coming months pending regulatory approval — as Waymo tests roughly 100 Jaguar I-PACEs across 100 square miles of London, making this the first city where Uber's two AV relationships compete directly. The $300M tranche of Wayve's $1.5B raise, contingent on London deployment, reframes Uber as an equity stakeholder whose payout depends on Wayve taking ground Waymo is already testing.
techcrunch.com →