Suraj Jaiswal Data Engineer, Spark, PySpark, Python, SQL
No reviews yet

I am a Data Engineer with 4.5+ years of industry experience, currently working at Adidas and previously at Wipro. I have built and managed production-grade data pipelines, worked with large datasets, and optimized end-to-end data workflows used by real businesses.

As a teacher, I focus on helping students truly understand data engineering and modern AI, not just memorize tools. I teach core concepts like Python, SQL, Spark, PySpark, ETL pipelines, Databricks, Airflow, and cloud-based data systems, and then connect these foundations to real industry use cases.

- In addition to data engineering, I actively work with modern AI systems, including:
AI Agents for workflow automation and decision support
RAG (Retrieval-Augmented Generation) for combining LLMs with structured data
MCP-based architectures to build reliable, modular AI solutions
This allows me to guide students toward future-ready Data + AI roles.

- My teaching approach
Explain concepts from basics to production level
Use practical examples and real project scenarios
Focus on problem-solving, clarity, and strong fundamentals
Share insights on career paths, interviews, and industry expectations

Whether you are a beginner, student, or working professional, my goal is to help you build clear understanding, confidence, and job-relevant skills through structured and practical learning.

Subjects

  • SQL Beginner-Expert

  • PySpark Beginner-Expert

  • Python and AI Beginner-Expert

  • AI Agents & Workflows Beginner-Expert

  • Guidance for career transformation to data engineering Beginner-Expert


Experience

  • Data Engineer (Jul, 2024Present) at Adidas
    ● Data Pipelines with Databricks: Using Databricks as the primary engine to orchestrate and
    manage data pipelines, transforming and processing large datasets read from AWS S3, ensuring data quality and efficiency.
    ● Utilized historical sales and operational data to support forecasting decentralization, enabling more accurate, data-driven demand predictions across regions and product segments.
    ● Developing PySpark UDFs: Creating and optimizing User-Defined Functions (UDFs) in
    PySpark to enhance data transformation and processing tasks, improving performance and
    enabling complex data operations.
    ● Implemented Advanced Data Transformations: Leveraged PySpark and Pandas for
    sophisticated data manipulation and transformation, contributing to the streamlined preparation and analysis of datasets for actionable insights.
    ● Collaborating on Sales Forecasting Project: Actively contributing to the SFD (Sales Forecast Decentralization) project by working closely with the data science team, using historical sales data to develop predictive models for future sales trends.
    ● Optimized Data Processing Workflows: Improved data processing workflows through efficient Spark configurations and optimizations, resulting in reduced processing times and cost savings for the data engineering team.
  • Data Engineer (Aug, 2021Jul, 2024) at Wipro, Gurugram
    ● Extraction, transformation, and Loading (ETL) of customer network data to prepare the Key
    Performance Indicator (KPI).
    ● Cell Metrics Anomaly Detection using scripts written in Spark, and PySpark.
    ● Preparing data as per the custom requirement of the business.
    ● Subscriber grid level and cell level mapping with IMSI Enrichment of Groundhog data.
    ● Scheduling and monitoring of jobs/workflows using Apache Airflow.
    ● Transfer of data from one location to another and from server to server using Apache NiFi.
    ● Data visualization, interpretation, and understanding using Grafana.
    ● Batch processing, stream processing, interactive processing, and graph processing using YARN UI (monitoring).

Education

  • B. Tech in Computer Science and Engineering (Jul, 2017Jun, 2021) from IMS ENGINEERING COLLEGE,GHAZIABADscored 8 CGPA

Fee details

    6001,000/hour (US$6.3210.53/hour)

    Fee Structure:
    - Recurring classes: ₹600 per hour
    - One-time / ad-hoc sessions: ₹1000 per hour
    Fees may vary based on session frequency and requirements.


Reviews

No reviews yet. Be the first one to review this tutor.