I'm currently a Data Engineer at Sea (NYSE: SE). My interest lies in building scalable data engineering solutions and implementing data-driven solutions to complex business problems. Through my various experiences, I have had the chance to demonstrate my ability to optimize large-scale data infrastructure to support real-time and batch processing at scale, work with cross-functional teams, and present key business insights to stakeholders. Open to global opportunities across high-impact data teams.
You can click here to find out about the latest happenings in my professional life.
Age: 27
Currently In: Singapore
I speak: English (Native)
Chinese (Bilingual)
Japanese (JLPT N3, working towards N2)
Contact Me: seansljh@gmail.com
Part of Corporate Data Team - Data Platform Group
• Led a cross-team PySpark ETL optimization project, improving processing speed by 50% and meeting all milestones
• Architected and executed the deprecation of Apache Sqoop to Spark JDBC, orchestrating efforts across 8 members. This improved data throughput, reduced maintenance overhead, and improved platform reliability
• Developed a Flink Java Framework to scale 18 real-time CDC jobs, enabling seamless data ingestion and warehouse expansion
Part of Corporate Data Team - Data Platform Group
• Optimising the ETL jobs of over 100 daily tables through code and log level analysis, resulting in a 25 to 50% improvement in run time
• Collaborating with upstream sources and downstream users to ensure key business requirements are met when handling ingestion requests from sources such as Kafka, REST APIs and MySQL
• Pushing for greater data service reliability by defining 2 Service Level Agreements with data producers and end users, implementing new file delivery monitoring mechanisms and escalation procedures
• Liaised with multiple teams to design the Kafka message protocol of a self-service ingestion tool built from cross-functional collaboration and technologies, enabling new feature implementation
• Initiate and assist with internal process improvements (setting documentation standards, performing code reviews, the addition of code comments to legacy scripts, coding standards, etc)
Part of Product Monetisation Team
• Built a dashboard for 25 stakeholders to monitor AI chatbot performance, eliminating all manual data pulling efforts
• Reduced time taken to generate reports by up to 90% through writing Python scripts to process customer ticket data from 3 sources
Part of Technology Solutions Team
• Interpreted software usage statistics & explained to 40 members of diverse teams (up to and including Director level) how the insights can influence product road-maps
• Deeply unpacked the workflow of different users (Portfolio Managers, Traders, Data Specialists, etc) and used this understanding to deliver 5 user specific software demonstrations to great response
Part of Corporate Labs Data Team
• Developed a dynamic set of timezone transformation functions that have been successfully integrated into over 150 daily ETL pipelines, thus preventing the transformation of timestamp columns via hardcoded methods
• Liaised with 5 stakeholders and effectively reflected their business requirements to 60 Data Models & Reports that are being used by over 200 local finance users for financial reconciliation purposes
• Fixed and refactored historical data & table ingestion checker scripts, which allowed colleagues to have a smoother daily batch job monitoring process
Part of Grab Financial Group Data & Insights Team
• Led deep-dive on data to propose fresh growth sources for GrabPay and presented the actionable business insights to numerous cross-functional teams (50+ stakeholders, including management level)
• Suggested, conceptualized and created a new market trends Tableau Dashboard used by over 80 users for competitor analysis, market size estimation viral web search tracking
• Repaired Tableau Dashboards by identifying discrepancies in Data Tables and SQL scripts, allowing rewards vertical to make more accurate business decisions
Part of Fraud Detection Team
• Established Google Data Studio dashboards by building data pipelines from multiple sources to monitor NLP model performance in 8 markets
• Investigated and scoped potential projects through Proof-of-Concepts (POC) and Statistical Analysis (t-test, ANOVA test, etc), saving resources by disproving projects with lower potential
Part of Brand Analytics Team
• Automated and optimized sales report generation using Python, Spark and Bash scripts, reducing generation time by 80%
• Improved C2C sales estimation accuracy by up to 50% through writing Python scripts to quantify keyword search volume
• Leveraged on insights derived from data analysis and cleaning to devise data-driven recommendations to brands
For BT1101 Introduction to Business Analytics (Taught in R)
• Graded 80 R-Script submissions on a weekly basis (Regression, Hypothesis Testing, Clustering, Time Series Analysis, etc)
• Conducted weekly coaching sessions and lecture hours support, helping to clarify doubts regarding module material
Part of Applications Support Team
• Reduced monthly financial report generation time by 4 hours through writing Excel VBA scripts
• Corresponded with 7 vendors for bank’s Digital Transformation Project, ensuring accurate communication of information
Double Degree Programme - BSc (Hons) Business Analytics & B.A. Economics
GPA: 4.88/5.0 (BSc Hons), 5.0/5.0 (B.A.)
Awarded TM Asia Life Medal & Prize for coming in as the Top Student in the graduating BSc Business Analytics Cohort
You can click here to find the coursework I've completed!
If you would like to find out more about me, hit me up for coffee, or would like to share with me something that your are incredibly passionate about, please do feel free to contact me via the various means below!