Using Python on SPRM Offenders Data
08-25, 13:40–14:20 (Asia/Kuala_Lumpur), JC 3

SPRM (Malaysian Anti Corruption Commission) has listed offenders on their website. The database is for public use, particularly to assist others in conducting background checks. The data includes the offenders' images and other details: personal information, summary of offense, penalty, and employer information.

Using a crawler in Python, we converted the data into a machine-readable format that will be publicly available. From this dataset alone, there are many use cases to be considered for contributing to the Open Sanctions Database, matching names against other persons of interest databases such as CIDB and ICIJ, and using the images for facial recognition in conducting security checks/ other use.


Outline of presentation

  • Introduction
  • How the team crawled and cleaned the data
  • Key insights
  • Recommendations of use cases
  • Conclusion

Siti Nurliza is a Technologist slash Data Analyst at Sinar Project, a civic tech organization in Malaysia that works on open data, open government and digital rights.

Aissatou is an IT Business Analyst and Project Management graduate, currently interning with the Sinar Project. With strong analytical skills, She works on initiatives enhancing transparency and accountability. She is passionate about impactful projects and continuous professional growth.

Trinidad, originally from Chile, moved to Canada as a teenager. She recently graduated from McGill University in International Management and is now interning with Sinar Project. She has a keen interest in international cooperation, public policy, sustainability, and languages.