A separate undertaking, Agent Evals, was introduced to allow the dependable transport of brokers. This undertaking was born out of inside expertise the place brokers have been discovered to be non-deterministic, creating a powerful want for reliability and confidence. Agent Evals gives tooling to benchmark brokers by leveraging open requirements like OpenTelemetry. It collects real-time metrics and tracing because the agent runs to attain efficiency and inference high quality, producing a report that helps customers perceive their agent’s reliability. This evaluation is essential for figuring out the extent of human intervention required, whether or not absolutely autonomous, human-in-the-loop, or human-outer-loop. Agent Evals works together with different observability instruments that assist OpenTelemetry requirements.
Shifting past particular person developer laptops into full manufacturing requires strong safety and governance. Solo is addressing this by fixing issues resembling securing agent communication with LLMs and MCP instruments. The Agent Gateway gives a vital answer, providing centralized coverage, enforcement, safety, and observability for visitors. This contains “context layer enforcement,” which may be configured to place guardrails on responses, as an illustration, stripping out delicate information like bank card or checking account numbers as visitors travels by way of the gateway. Moreover, Agent Gateway is being built-in into Istio as an experimental information aircraft possibility in Istio Ambient mode, serving to mediate agent visitors with out requiring modifications to the brokers or MCP instruments themselves.
Collectively, these instruments—Agent Registry for governance, Agent Evals for reliability, and Agent Gateway for safety—are filling within the puzzles wanted to run agentic AI in manufacturing with confidence. Nevertheless, for vital work, human involvement stays a vital part, because the philosophy suggests viewing the agent like a rising co-worker that also advantages from supervision and peer assessment.
“I’m all the time occupied with the agent as like an individual,” Lin advised SD Occasions. “Even together with your coworker, you don’t all the time belief their work. You want a peer assessment of the work, to iterate and make it higher. So, at this stage of the agent, possibly it’s extra like from toddler to kindergarten. It’s rising, proper? However even when the agent turns into an grownup, like my son simply turned 18, you continue to must sort of supervise somewhat little bit of offering some insights.”







![[Webinar] Eradicate Ghost Identities Earlier than They Expose Your Enterprise Knowledge](https://techtrendfeed.com/wp-content/uploads/2026/04/ghost-120x86.jpg)