Summary
Overview
Work History
Education
Skills
Certification
Timeline
Accomplishments
Personal Strengths
AccountManager

Chih Han Yu

Data Engineering, Software Engineering
Taichung District

Summary

A data engineer, fast-learner and self-motivated with comprehensive problem-solving abilities. Providing data as products, design and build-up large-scale automation data processing system for AI and data science usage.

Overview

8
8
years of professional experience
5
5
years of post-secondary education
3
3
Certificates
3
3
Languages

Work History

Senior Data Engineer / Data Engineer

Micron
Taichung
12.2017 - Current

Global Data Warehouse Automated Ingestion Platform (Sep. 2019 to Current)

  • Web platform can automatically deploy batch "EL" pipelines on Apache NiFi, ingest data from RDBMS/File to GCP BigQuery, Snowflake and Hive.
  • Performed both backend services (Java) and frontend development (Angular) for features to support multiple sources and target types.
  • Managed ~9000 ETLs built by this platform to Snowflake/GCP/Hive which processes over than terabytes of data per day across global facilities.
  • Designed with detailed SA/SD documentation, CICD, solid code review, version control and change management process.
  • Performed training and demo documents and sessions for better user coworking and communication.
  • Additional Roles: Scrum Master / Operation Chief / Team Coach

Global Data Warehouse Migration - GCP Migration / Hadoop Retirement (Dec. 2020 to Current)

  • Designed data landing structure and peformed POC to adapt new platform (GCP) for GDW Automated Ingestion Platform.
  • Redesigned ETLs for Manufacturing Photo image calculation with Spark in Scala, Cloud Composer and Cloud DataProc.

Global Data Warehouse Migration - Teradata Retirement (Mar. 2019 to Aug. 2019)

  • Performed existing ETLs migration from Teradata to Snowflake with no impact.

Global Manufacturing SCADA/IoT Data Ingestion Project (Dec. 2017 to Aug. 2019)

  • Designed real-time ETLs on Apache NiFi for handling large volume of sensors data to Hadoop Hive.
  • Performed requirements clarification, ETL refactor, performance tuning and operation maintenance.

Data Engineer

TripleOneTech
Taipei
05.2017 - 12.2017

Gaming Log Collection Platform

  • Designed and built-up ELK Stack (Elasticsearch / Logstash / Kibana) environment within 1 month.
  • Designed and build-up log collecting system from RDBMS to Elasticsearch by Logstash.
  • Open Source Project Contribution: Apache Spark (Bugfix), Elastalert (ElasticSearch 3rd Party Module Bugfix)

Gaming Log Analysis Platform

  • Designed and developed ETLs to process, transfer and clean data with Python.
  • Built-up data visualization dashboard with Grafana and Kibana for developer team and company business customer.

Software Engineer

Inventec Appliances Corp.
Taipei
10.2016 - 05.2017

Manufacturing Big Data Platform Project

  • Designed and built-up Big Data Platform POC environment in on-prem machine alone within 2 weeks, including Hadoop ecosystem and SMACK ecosystem (Spark, Mesos, Akka, Cassandra, Kafka).
  • Designed and developed ETL for manufacturing data preprocess and analysis by Spark Streaming, Kafka and Cassandra.

IoT Data Acquisition Project

  • Developed Web RESTful API with Spring Boot and Cassandra.

IT Engineer

TSMC
Tainan
08.2014 - 09.2016

300mm Fab In-line Quality Control MES System Administrator

  • Managed Quality Control systems, including server, databases, ETLs and web application.
  • User requirements coordinator between Fab users and system developers.
  • Executed fast troubleshooting, server support, fast recovery in high pressure environment urgently.
  • Maintained service release schedule for MES related system.
  • Performed scheduled MES system work, testing and repairs.
  • Offered automation tools for fast recovery archiving data to support Customer complaint event, processing speed is 24 times faster than before.

Education

MBA - Information Management

National Central University
Taiwan
09.2012 - 07.2014

BBA - Information Management

Chang Gung University
Taiwan
09.2008 - 02.2012

Skills

Data Engineering

undefined

Certification

Google Cloud Certified - Professional Data Engineer

Timeline

Google Cloud Certified - Professional Data Engineer

11-2021

Senior Data Engineer / Data Engineer

Micron
12.2017 - Current

Data Engineer

TripleOneTech
05.2017 - 12.2017

Software Engineer

Inventec Appliances Corp.
10.2016 - 05.2017

IT Engineer

TSMC
08.2014 - 09.2016

MBA - Information Management

National Central University
09.2012 - 07.2014

Oracle Database SQL Certified Expert, OCE

02-2011

Sun Certified Java Programmer, SCJP

08-2010

BBA - Information Management

Chang Gung University
09.2008 - 02.2012

Accomplishments

Top 1 in 2022 Enablement Brainstorming Competition.

  • Department Internal Brainstorming competition, encourage members to promote their ideas with POC results to improve current system.
  • Topic: Centralized Schema Management by GCP Data Catalog

Personal Strengths

  • Fast learner and self-motivated.
  • Comprehensive problem-solving Abilities.
  • Motivated to work quickly and efficiently. “Work smart rather than hard!”
  • Willing to work effectively in a Team.
  • Aggressive learning in new technologies.
Chih Han YuData Engineering, Software Engineering