Lim Jyy Bing
Jyy Bing's story began in analytics, where she mastered the art of creating comprehensive master datasets to map the entire customer journey, from initial profiling to on-page user behavior. This early work quickly pushed her beyond traditional analytics, leading her to pioneer a hands-on approach to DevOps. She took on the challenge of self-hosting her own Airflow instance and learned to configure CI/CD pipelines with YAML, ultimately mastering deployments to Kubernetes.
Today, Jyy Bing applies this unique blend of skills in the green energy sector as a Data Engineer at Ørsted. She manages over 600 datasets, engineering high-performance data pipelines with Python, SQL, and other tools to process sensor data from above and below the sea. Her work is critical to supporting engineers in maintaining the health of wind turbines, a powerful example of data's impact in the real world.
As a passionate advocate for advancing Python’s role in modern data engineering, Jyy Bing is continually exploring tools that promise faster and more reliable data manipulation. This relentless curiosity has led her to the heart of her PyCon talk: a comparative deep dive into Pandas, DuckDB, and Polars. She is excited to share her extensive knowledge, providing attendees with a clear look at their strengths, trade-offs, and real-world performance considerations to empower them to choose the right tool for their next large-scale data workflow.
Session
This session compares three popular Python data processing tools. Pandas, DuckDB, and Polars, focusing on their performance, scalability, and best-use scenarios.