AirAsia — Data Engineer
AirAsia — Data Engineer
Since August 2022. Started as Data Engineer I, promoted to Data Engineer II in August 2024.
Python SQL BigQuery Postgres Airflow GCP Tableau LLM APIs (OpenAI / Gemini)
I joined AirAsia as a Data Engineer and have spent the time building and scheduling ETL pipelines that move data between operational systems, vendor sources, and our data warehouse — and increasingly, owning end-to-end projects that put that data in front of the people who make daily operational decisions.
🛠 Selected projects
-
Cabin Crew Operational App. Integrated data from Google Sheets and BigQuery into a centralised application serving 5000+ flight crew members — putting key performance metrics directly in the hands of the people they describe. Built the multi-source ingestion pipelines (→ Postgres) and the Airflow DAGs that keep them current.
Python BigQuery Postgres Google Sheets Airflow -
Automated Training Tracker. Built an end-to-end workflow that monitors compliance and training schedules for 60+ operations staff and automatically emails escalations when training is overdue. Eliminated the manual chase entirely.
Python BigQuery Email automation Airflow -
Data Democratisation via LLM Tools. Spearheaded internal LLM tooling that integrates 6+ core operational modules (ancillary sales, aircraft availability, etc.) so non-technical staff can self-serve analytics. Built the supporting pipelines and AppScript automation in Google Sheets.
Python LLM Google Sheets AppScript Airflow -
Aircraft On Ground (AOG) Recovery Foundation. Built the data foundation feeding clean datasets into the operations research team’s heuristic recovery models — directly supporting how AirAsia mitigates critical operational disruptions.
SQL ETL Airflow -
NPS Dashboard. Built the ETL pipeline behind the Net Promoter Score dashboard the customer happiness team uses to track passenger sentiment. Used the Google Translation API for multi-language reviews and Gemini for sentiment + keyword extraction (powering the dashboard’s word cloud).
Python Google Translation API Gemini API Airflow -
Ground Operations Dashboards & Bot. Built an automation that logs into Tableau, screenshots configured charts (e.g. flight load factor), and emails them to stakeholders on a daily cadence for operational decision-making.
Python Tableau Email automation Airflow -
Automated Crew Bag Tag. Designed the SQL filtering logic — coordinating with the data science and data analytics teams — that the software engineering team consumed to automate the layover check-in process for crew.
SQL Cross-team -
Navitaire Logging Pipeline. ETL pipeline that extracts data from BigQuery and lands it in GCS for audit purposes.
Python BigQuery GCS Airflow -
Disaster Recovery Exercise. Annual DR participation for a major operational platform — validating recovery procedures across the data pipelines and dependent systems.
DR Operations
⚙️ Core platform work
- Google Sheets → BigQuery ingestion with Python ETL on a schedule.
- Database → Warehouse pipelines — Postgres, MS SQL, and other operational databases into the data warehouse.
- Stored Procedure → BigQuery — execute SP on source, transfer result.
- SFTP ingestion + reverse exports — pick up files, transform, load to BigQuery; reverse direction (BigQuery → SFTP CSV) for downstream consumers.
- Airflow scheduling and monitoring for all of the above.
- LLM experimentation — exploring how ChatGPT and Gemini speed up ETL development (boilerplate, schema debugging, doc drafting).
🤝 Other contributions
- Mentored new team joiners. Helped them ramp on the codebase, internal tooling, and team conventions — pairing on early pipelines and reviewing first PRs so they could ship confidently from day one.
- Led 10+ knowledge-sharing sessions across the department, covering pipeline patterns, tooling decisions, and lessons learned from completed projects — turning individual learning into shared team capability.
- Documented every shipped project on Confluence as a default practice, not an afterthought — the goal being that anyone joining six months later can pick up the context without needing me in the room.
- Co-organised department townhalls — helping shape the agenda, coordinate logistics, and curate content so each session genuinely served the team rather than ticking a calendar box.
Outside of work I also build personal projects to go deeper on the tooling — see Projects.