AirAsia — Data Engineer
August 2022 – April 2026. Joined as Data Engineer I, promoted to Data Engineer II in August 2024. 3.5 years building ETL pipelines, LLM tooling, and operational dashboards.
Python SQL BigQuery Postgres Airflow GCP Tableau LLM APIs (OpenAI / Gemini)
I joined AirAsia as a Data Engineer and have spent the time building and scheduling ETL pipelines that move data between operational systems, vendor sources, and our data warehouse — and increasingly, owning end-to-end projects that put that data in front of the people who make daily operational decisions.
🛠 Selected projects
-
Cabin Crew Operational App. Integrated data from Google Sheets and BigQuery into a centralised application serving 5000+ flight crew members — putting key performance metrics directly in the hands of the people they describe. Built the multi-source ingestion pipelines (→ Postgres) and the Airflow DAGs that keep them current.
Python BigQuery Postgres Google Sheets Airflow
-
Automated Training Tracker. Built an end-to-end workflow that monitors compliance and training schedules for 60+ operations staff and automatically emails escalations when training is overdue. Eliminated the manual chase entirely.
Python BigQuery Email automation Airflow
-
Data Democratisation via LLM Tools. Spearheaded internal LLM tooling that integrates 6+ core operational modules (ancillary sales, aircraft availability, etc.) so non-technical staff can self-serve analytics. Built the supporting pipelines and AppScript automation in Google Sheets.
Python LLM Google Sheets AppScript Airflow
-
Aircraft On Ground (AOG) Recovery Foundation. Built the data foundation feeding clean datasets into the operations research team’s heuristic recovery models — directly supporting how AirAsia mitigates critical operational disruptions.
SQL ETL Airflow
-
NPS Dashboard. Built the ETL pipeline behind the Net Promoter Score dashboard the customer happiness team uses to track passenger sentiment. Used the Google Translation API for multi-language reviews and Gemini for sentiment + keyword extraction (powering the dashboard’s word cloud).
Python Google Translation API Gemini API Airflow
-
Ground Operations Dashboards & Bot. Built an automation that logs into Tableau, screenshots configured charts (e.g. flight load factor), and emails them to stakeholders on a daily cadence for operational decision-making.
Python Tableau Email automation Airflow
-
Automated Crew Bag Tag. Designed the SQL filtering logic — coordinating with the data science and data analytics teams — that the software engineering team consumed to automate the layover check-in process for crew.
SQL Cross-team
-
Navitaire Logging Pipeline. ETL pipeline that extracts data from BigQuery and lands it in GCS for audit purposes.
Python BigQuery GCS Airflow
-
Disaster Recovery Exercise. Annual DR participation for a major operational platform — validating recovery procedures across the data pipelines and dependent systems.
DR Operations
- Google Sheets → BigQuery ingestion with Python ETL on a schedule.
- Database → Warehouse pipelines — Postgres, MS SQL, and other operational databases into the data warehouse.
- Stored Procedure → BigQuery — execute SP on source, transfer result.
- SFTP ingestion + reverse exports — pick up files, transform, load to BigQuery; reverse direction (BigQuery → SFTP CSV) for downstream consumers.
- Airflow scheduling and monitoring for all of the above.
- LLM experimentation — exploring how ChatGPT and Gemini speed up ETL development (boilerplate, schema debugging, doc drafting).
🤝 Other contributions
- Mentored new team joiners. Helped them ramp on the codebase, internal tooling, and team conventions — pairing on early pipelines and reviewing first PRs so they could ship confidently from day one.
- Led 10+ knowledge-sharing sessions across the department, covering pipeline patterns, tooling decisions, and lessons learned from completed projects — turning individual learning into shared team capability.
- Documented every shipped project on Confluence as a default practice, not an afterthought — the goal being that anyone joining six months later can pick up the context without needing me in the room.
- Co-organised department townhalls — helping shape the agenda, coordinate logistics, and curate content so each session genuinely served the team rather than ticking a calendar box.
Outside of work I also build personal projects to go deeper on the tooling — see Projects.