AI agents stuck in pilot mode as banks demand trust, auditability, and on-chain primitives

Banks are keeping AI agents in “pilot mode” because they still do not trust delegated automation in high-stakes finance. Citi is the latest example: on April 22, Citi launched “Citi Sky,” an AI wealth assistant built with Google Cloud and Google DeepMind. Citi’s wealth tech head Dipendra Malhotra said the key limitation is “memory,” asking how long a client conversation can continue before hallucinations appear—an operational risk for advisory and portfolio execution. Industry adoption is uneven. A Deloitte poll of 3,300+ finance professionals found 80.5% expect agentic AI (AI agents and GenAI chatbots) to become standard within five years, but only 13.5% say their organizations are already using AI agents. McKinsey estimates 50% to 60% of bank operations are within AI-agent reach, yet experts warn of “pilot purgatory,” where proofs of concept run without changing operating models. Ethereum is pushing possible infrastructure. Draft standards ERC-8004 and ERC-8183 aim to add on-chain primitives for agent identity, reputation, validation, and escrow/evaluator attestation. The goal is to support verifiable delegation—who the agent is, what it did, and how jobs are funded and completed. Key unresolved questions remain: who is responsible for losses caused by AI agents, whether reputation can be trusted (agents can inflate signals at machine speed), who has control at scale, and what regulatory framework applies when an agent acts outside its scope. The article frames these issues as the central barrier to scaling AI agents in finance.
Neutral
这则消息偏“中性”:它不是直接的加密价格驱动,而是关于金融机构对 AI agents 的信任与基础设施(尤其是以太坊链上标准)的可用性讨论。短期内,对交易者更可能带来的是叙事层面的关注(AI+链上身份/托管/声誉机制),但缺乏立刻影响资金流向的明确事件。 从市场机制看,类似“试点阶段”常见于新技术落地初期:当监管、审计、责任认定尚未成体系,市场往往先反映概念热度,随后等待可验证的产品与制度进展再重新定价。文章提到的“pilot purgatory”、责任归属与声誉可被操纵等问题,意味着规模化落地速度可能慢于预期,从而抑制短期情绪的持续走强。 长期上,若 ERC-8004 / ERC-8183 等标准真的推动链上身份、托管与可验证执行,可能增强“可审计委托”的信任框架,利好与基础设施相关的采用预期;但短期仍需监管框架与企业运营模式重构同步推进。因此对整体市场更像是风险与机会并存的叙事更新,而不是单向利多或利空。