Khaqan Azam Expert in SQL Databases, Python, Data Engineer
9 Reviews

Senior Data Engineer in Saudi Arabia | AI & Data Expert | Python, SQL, Big Data, Cloud, LLMs, Generative AI, AI Agentic and RAG Specialist | 6+ Years Industry & 8+ Teaching Experience | Google Cloud Certified | 500+ Students Mentored Worldwide(UK, USA, Australia, Canada, India, Germany, KSA, Dubai, Pakistan)

I am a Senior Data Engineer based in Riyadh, Saudi Arabia, working with one of the region’s largest telecom companies. Over the last 6+ years, I have delivered impactful data and AI solutions for STC, HBL, and Easypaisa, while mentoring students and professionals globally in Python, SQL, Data Engineering, Cloud and AI technologies( LLMs, Generative AI, AI Agentic and RAG Specialist).

My Expertise Includes:

Programming & Data Engineering: Python, Shell Scripting/Documentation, SQL, Scala, PySpark, Hadoop, Kafka, ETL Pipelines, Data Warehousing, APIs(Django)
Cloud & Big Data: Google Cloud (GCP), AWS, Databricks, Spark, HDFS
Artificial Intelligence: LLMs, Generative AI, RAG, AgenticAI, Prompt Engineering, AI Chatbot Development
Analytics & Visualization: Power BI, Google Analytics, Firebase
Automation & Web Scraping: Selenium, BeautifulSoup, API Integration
Career Preparation: Interview training, HackerRank prep, portfolio projects

I offer:

Tutoring (Beginner to Advanced)
Assignments & Projects (Minor/Major/Dissertation)
Exam Preparation (University, Technical, and Interview Readiness)
Industry-based Practical Training
HackerRank & Coding Test Support

Whether you want to master AI(LLMs, Generative AI, AI Agentic and RAG Specialist).), build data pipelines, or prepare for a career in data and cloud technologies and if you are a student aiming for top grades or a professional upskilling for your next big opportunity, I will help you learn, apply, and excel with the confidence to work on global scale projects.

Let’s start your journey to becoming a future ready AI & Data professional!

Subjects

  • Python Beginner-Expert

  • PySpark Beginner-Expert

  • Database Fundamentals Beginner-Expert

  • ETL SQL Beginner-Expert

  • Google Cloud Platform Beginner-Expert

  • Databricks PySpark Beginner-Expert

  • Data engineer Beginner-Expert

  • Advanced Database Management System Beginner-Expert

  • Dimensional Data Warehouse Beginner-Expert

  • Shell scripting Beginner-Expert

  • Project Documentation Beginner-Expert

  • SQL and Data Warehousing Beginner-Expert

  • AWS and strong Python scripting Beginner-Expert

  • Assignment correction Beginner-Expert

  • GitHub Beginner-Expert

  • Backend Development (Django) Beginner-Expert

  • AWS ETL Beginner-Expert

  • AI and Data Analytics Beginner-Expert

  • ETL Data warehouse Beginner-Expert

  • Large Language Models LLMs Beginner-Expert


Experience

  • Senior Data Engineer (Sep, 2024Present) at Suadi Telecom Company, Riyadh, Saudia Arabia
    • Worked with STC to build a single Data Access layer for the business and the analytical user. To provide cross platform integration and ease of data insights without data movement.
    • Designed and implemented ETL pipelines for the Customer Experience Management (CEM) project, enabling efficient data extraction from Kafka topics and loading into HDFS, resulting in a 40% improvement in data ingestion and processing efficiency.
    • Architected and managed the ETL structure for STC’s Corporate Customer Experience (CCEx) department, ensuring seamless 24/7 data ingestion into staging directories and automated processing into the semantic layer for next-day reporting.
    • Developed and configured Trino to access and query both databases, including Teradata and HDFS. Managed & created access polices for Trino users on Ranger reducing response time for ad-hoc reporting requests by 70%.
    • Built monitoring scripts to oversee operational activities, improving system reliability and reducing manual intervention by 60%.
    • Created an ETL control system to manage failures in real-time data streams, addressing/resolving issues like Kafka checkpoint unavailability, HDFS downtime, and container errors. The system ensures 100% data integrity, prevents data loss, and provides real-time alerts for proactive monitoring.
    • Processed and normalized raw data into business-ready facts, enabling the creation of downstream tables for semantic reporting and reducing manual workloads by 50-60%.
    • Automated daily fact ETL pipelines using crontab to process and load data into the semantic layer, ensuring timely reporting for business teams and 100% SLA compliance.
    • Developed CLI automation tool to streamline external table creation on HDFS. This tool accepts inputs such as file path, table name, and database, then automatically generates the required SQL, executes it and repairs the table.
    • Developed Incremental Hive migration (35 reports) on Cloudera using shell scripting with dynamic SQL, partition checks, retry logic, logging, and dependency handling for data retention.
  • Manager Data Engineer (Feb, 2024Aug, 2024) at Habib Bank Limited, Islamabad
    • Designed and implemented dimensional data models (conceptual → logical → physical) to support internal audit and compliance reporting; deployed these models as new aggregated tables on a separate analytics cluster for Power BI self-service dashboards.
    • Queried enterprise Data Lake (Cloudera Hadoop-based) using Impala via Hue, and connected via ODBC in Python to automate data extraction, transformation, and loading into the reporting layer.
    • Automated sampling and audit workflows using Python scripts and SQL, reducing manual processing time from days to hours and improving audit team efficiency by 90%.
    • Built and maintained daily ETL jobs to populate business-aligned tables with high accuracy and timeliness, enabling delivery of 10+ key audit projects and KRIs under tight SLAs.
    • Authored comprehensive business metadata documentation for all created models, ensuring field-level traceability, data definitions, and alignment with internal audit governance.
    • Supported stakeholder reporting requirements with ad-hoc queries and Power BI visuals, aligned with compliance, risk, and audit needs.
    • oped Incremental Hive migration (35 reports) on Cloudera using shell scripting with dynamic SQL, partition checks, retry logic, logging, and dependency handling for data retention.
    Business or Sector: Banking| Department: Internal Audit | Website: https://www.hbl.com/
    Address: HBL Tower Jinnah Avenue, 44000, Islamabad, Pakistan
  • Data Development Engineer (Aug, 2022Feb, 2024) at Easypaisa Digital Bank Islamabad
    Built and managed end-to-end pipelines on a Big Data platform Hadoop ecosystem (HDFS, YARN & Spark for resource management), processing of structured and semi-structured data from sources in (JSON, Parquet, CSV, and MySQL).
    • Ingested data from various external tools and platforms, including:
    - Firebase & Appsflyer logs via GCP BigQuery & Google Cloud Storage
    - Internal & partner sources via SFTP/FTP/SSH (e.g., Ericsson logs, 1Link, Raast)
    - On-premise application databases via direct pulls (MySQL/Oracle)
    • Used PySpark and Shell scripts to extract data from GCP (Firebase, Appsflyer), chunk and compress it, and load into partitioned Linux paths for ingestion into the on-prem Hadoop-based EDW via DAAS (DataGo, DeepInsight, LDAP).
    • Developed reusable ETL control frameworks in Shell and Python for file validation, flag checking, and automated alerting — ensuring downstream DAAS job success.
    • Designed and orchestrated multi-layered pipelines using DataGo, the data integration platform within DAAS:
    - ODS Layer: Raw data from Linux servers/filesystems
    - DWD Layer: Cleansed and dimensionally linked datasets
    - ADM Layer: Aggregated, business-ready datasets used for ad-hoc reports and dashboarding
    • Scheduled data flows using Crontab and DAAS job scheduling, implementing dependency chaining (e.g., job runs only after flag file appears or upstream job succeeds).
    • Built reporting pipelines from DAAS to dedicated reporting servers, allowing business teams to fetch daily dashboards and custom extracts.
    • Administered user access to schema, table, and column levels via DAAS LDAP service, managing L1/L2 roles and ensuring secure platform access.
    • Worked closely with stakeholders and Ops teams to onboard new data sources, define DDLs, monitor data consistency, and troubleshoot platform issues.
    • Performed RCA on failures, tuned Spark jobs, and optimized DAAS pipelines for high-volume processing.
  • Data Engineer/Python developer (Jun, 2020Jul, 2022) at Taleemabad, Islamabad
    • Taleemabad was never hired before data engineer. Designed & executed ETL pipelines using Python & SQL, managing AWS-hosted data sources (MongoDB, PostgreSQL, GCP BigQuery, Google Analytics logs.
    • Collaborated closely with analysts to streamline data transformation & modeling for diverse sources, and enhanced data accuracy and reliability.
    • Automated reconciliation, data validation, & ETL pipelines via Crontab on AWS EC2, reducing manual intervention by 70% and improving operational efficiency by 90%.
    • Configured Google Analytics (LMS events log data) data flow into Google Cloud Platform (GCP) Bigquery, enabling data transformation from pivoted to unpivoted or nested formats for storage.
    • Defined the data governance practices, comprising access controls, and archival or deletion data on GCP.
    • Managed Google BigQuery warehouse & provided timely ad-hoc reports for urgent needs.

Education

  • Database SQL Certification (Jun, 2021Jun, 2021) from Oracle Certified Professional (SQL Developer)
  • Python Certification (Apr, 2021May, 2021) from Microsoft certified Professional(in python programming)
  • BS Computer Science (Oct, 2017Jun, 2021) from University of Engineering and Technology, Taxila

Fee details

    Rs4,50015,000/hour (US$16.1653.85/hour)

    Vary on learning level I will charge according to task or learning level of student/Project Manager


9 Reviews
5 out of 5

User Photo April 8, 2026
Payment verified US$ 18 (1000 Coins)

Great Mentor for learning Data engineering from scratch

I am currently a Power BI developer and recently started learning data engineering from scratch.
Khaqan is a well organized and professional mentor with strong expertise in Data Engineering and well teaching experience Python & SQL, and his teaching reflects 25+ years of experience in the data domain. He explains technical topics using real world scenarios, which makes complex concepts easy to understand.

I feel proud to have found such a knowledgeable mentor and highly recommend him for data engineering learning.


User Photo March 25, 2026
Payment verified US$ 18 (1000 Coins)

Python knowledge expert

I strongly recommend Khaqan Azam as an AI & Data Engineering mentor. I come from a Payroll Specialist and MBA background and currently work for a UK based company and I am transitioning into AI & Data Engineering with his guidance. What makes him stand out it is his ability to explain complex concepts in a very clear and structured way, making it easy to understand.


User Photo March 12, 2026
Payment verified US$ 22 (1100 Coins)

Data Engineering Mentor – Clear Concepts and Practical Learning

I highly recommend Khaqan Azam as a Data Engineering mentor. I come from a Telecommunications and Networking background and currently work for a New Zealand based company and I am transitioning into Data Engineering with his guidance.

Khaqan has strong expertise in Python, SQL, PySpark, Databases, Big Data Engineering, RAG and LLM technologies. What makes him stand out is his ability to explain complex concepts in a very clear and structured way. What impressed me the most is his ability to start from the absolute fundamentals. Instead of jumping directly into tools, he first helped me understand the core concepts of data, such as what data is, how it is generated, how it is stored in real systems, and how modern data pipelines work in real-world environments. This strong conceptual foundation makes learning advanced topics much easier.

His teaching approach combines solid theoretical understanding with practical, real-world examples and hands on exercises, which helps in understanding how things actually work in the industry. The sessions are highly interactive, and his communication skills make learning technical concepts much easier.

If you are looking for someone with strong industry knowledge, excellent teaching skills and deep expertise in modern data engineering technologies, I strongly recommend learning from Khaqan Azam.


User Photo March 20, 2024
Payment verified US$ 1 (50 Coins)

Highly Recommended Teacher

Khaqan Azam is a awesome teacher and has clear communication through out my SQL classes. I am new to Sequel or SQL language and have no knowledge on this language. He has given me the confidence and showed me how easy it is to navigate through this language. He was referred to me via my family as a great educator and that was definiteley great recommendation.


User Photo March 20, 2024
Payment verified US$ 1 (50 Coins)

Best python and sql tutor

Thank you to Khaqan for helping me to excel from 0 knowledge in python and sql to being very confident in my day to day work as a data engineer, only within 3 months. He has lots of knowledge in coding languages, databases and ETL processes. His teaching is very clear to understand, he can communicate both in English, Urdu and Hindi as needed. He is responsive and available 24/7 I can ask him any question he will immediately reply. Thanks again Khaqan for your invaluable support and guidance! Looking forward to continuing to learn from you.


User Photo March 9, 2024
Payment verified US$ 1 (50 Coins)

Simply the best

I was a complete Fresher but Azam Bhai took me under his wing and trained me into becoming a fierce professional. He taught me everything from scratch. All the way from SQL, Python, Data Warehousing, ETL and Databases and now most recently in downstream tools like Power BI. He is always there for me on WhatsApp and responds to my questions immediately even though he is in Pakistan and I am in the U.S. If and when he is not available he creates artifacts/documents for me that give me step by step instructions on how to resolve the issue. I am so lucky to call him my mentor, friend and brother. You can hire him for your work without any hesitation. I am now earning six figures and my career has taken an upward swing. Long live Azam Bhai!


User Photo June 25, 2022
Payment verified US$ 1 (50 Coins)

depth knowledge in python, etl

khaqan has depth knowledge in python,etl and sql . he can understand our requirements and he can explain topics according to our requirements.


User Photo June 24, 2022
Payment verified US$ 1 (50 Coins)

best python teacher

wish you good luck and a successful journey in your teaching work, you have great expertise in Python and database. secondly, I appreciate your teaching style you are the perfect teacher.Lastly you are great with time management


User Photo June 23, 2022
Payment verified US$ 1 (50 Coins)

Quick-turn-around, Python Programming, and Communication

Azam has strong knowledge on python and its libraries. He got my task done in a fast manner. He organizes code well. Will come back for future tasks.

He also tutor’s python from beginner to pro