Partner with us

Get your ticket

Call to action

Your text goes here. Insert your content, thoughts, or information in this space.

Button

Back to speakers

Aditya

Bansal

Machine Learning Engineer

Adobe

Aditya Bansal is a Machine Learning Engineer at Adobe, where he builds production AI systems powering creative and enterprise experiences, spanning predictive modeling, LLM-based agentic workflows, and scalable ML pipelines. His work bridges applied research and real-world deployment, with publications at top ML venues and conferences. With experience across AI safety and large-scale infrastructure, Aditya is driven by the challenge of making AI systems both reliable and equitable at scale.

Button

20 May 2026 12:15 - 12:45

From data to deployment: Evaluating enterprise AI in production

Enterprise AI systems built on large language models are rapidly evolving beyond static question-answering into workflows involving retrieval, tool use, and multi-step agentic behavior. Reliably deploying these systems remains challenging, as performance depends heavily on data quality, context construction, and how systems behave in production. This session presents a practical framework for building reliable enterprise AI systems, by utilizing operational data signals and structured evaluation. By decomposing quality across retrieval relevance, generation faithfulness, and agent-level behavior, rather than collapsing it into a single metric, teams can turn operational data into a foundation for continuous improvement. The session draws on recent advances in evaluating RAG systems and agentic workflows to illustrate both the promise and limitations of automated evaluation pipelines. It also explores how production signals can be systematically fed back through targeted test sets and human calibration to keep systems reliable as models and data change over time.