Experienced in data engineering, data warehousing, and data modeling, specializing in optimizing Spark ETL and building end-to-end pipelines for batch processing and CDC. Skilled in driving efficient data processing, storage, and retrieval, with a focus on resolving performance bottlenecks for seamless integration. Dedicated to delivering solutions that enable data-driven decision-making.
Programming: Proficient in Python for data engineering and automation
Big Data Batch Processing: Expertise in Spark for large-scale processing and analytics for historical records
CDC & ETL: Experience with Debezium integrated Kafka for real-time data ingestion and dbt for data transformation
Data Warehousing & Modeling: Strong in data warehouse architecture design, and data modeling for high data quality service
Query Optimization: Advanced SQL skills, with experience in ClickHouse for high-performance and queries
Containerization: Skilled in Docker for application deployment in Linux environments
Collaboration & Team Leadership: Worked closely with stakeholders to align data strategies with business goals, and contributed to the expansion and upskilling of the data analytics team