PyConMY 2025

PyConMY 2025

kalyan

Kalyan is a Lead Data and AI scientist with a background as a former data science and analytics manager, effectively balancing both academia and industry. He has presented talks at various PyCon's, Data Science and AI conferences, showcasing his expertise. As a community leader, Kalyan currently serves as the one of the Chair for PyConf Hyderabad 2025 and has held the role of Co-chair for PyCon India 2023 and PyConf Hyderabad from 2022 to 2024. In addition to these leadership positions, he is an active contributor to numerous Python, data science, and scientific communities worldwide.


Session

11-02
09:00
45min
How I Use Evals to Keep My AI Apps From Falling Apart
kalyan

As Large Language Model apps become more powerful and widely used, one challenge keeps surfacing: how do we know if they’re actually working well? LLM apps often fail silently with hallucinations, bugs, and inconsistent outputs going unnoticed. Manual reviews don’t scale. This talk introduces a practical 3-part evaluation framework using code checks, golden sets, and LLMs as judges to catch failures early, improve output quality, and help you build more reliable AI applications.

Hall 1