IN-Sr Associate_Databricks Engineer _D&A _Advisory _Mumbai
Line of Service
AdvisoryIndustry/Sector
FS X-SectorSpecialism
Data, Analytics & AIManagement Level
Senior AssociateJob Description & Summary
At PwC, our people in data and analytics focus on leveraging data to drive insights and make informed business decisions. They utilise advanced analytics techniques to help clients optimise their operations and achieve their strategic goals.In business intelligence at PwC, you will focus on leveraging data and analytics to provide strategic insights and drive informed decision-making for clients. You will develop and implement innovative solutions to optimise business performance and enhance competitive advantage.
Responsibilities:
Minimum 5 years of professional experience with working knowledge in a Data and Analytics role with a Global organization
-5 to 8 years of experience in working with Databricks tech stacks
-Experience in leading development of Data and Analytics products, from Requirement Gathering State to Driving User Adoption
-Develop and optimize ETL processes using Databricks and related tools like Apache Spark
-Design efficient data processing systems and pipelines using Databricks, APIs, and other cloud services
-Candidate with strong data transformation experience on Unity Catalog, Delta Tables, DLT
-Strong proficiency in writing and optimizing SQL queries and working with databases
-Ability to acquire specialized domain knowledge required to be more effective in all work activities
-BI & Data-warehousing concepts are a must.
-Design, develop, and maintain scalable ETL/ELT pipelines using PySpark on Databricks.
-Ingest and transform data from multiple structured and unstructured sources including cloud storage (Azure Data Lake, AWS S3, etc.).
-Optimize Spark jobs for performance and cost-efficiency on the Databricks platform.
-Collaborate with data scientists, analysts, and stakeholders to understand data requirements and deliver high-quality solutions.
-Implement best practices in data engineering, including modular coding, unit testing, and version control (e.g., Git).
-Automate data workflows and schedule jobs using Databricks Workflows or external orchestration tools (e.g., Airflow, Azure Data Factory).
-Ensure data quality, integrity, and governance in all data pipelines.
-Participate in code reviews, performance tuning, and system monitoring.
-Document solutions, processes, and configurations.
Mandatory skill sets:
Pyspark, Databricks
Preferred skill sets:
Databricks, Apache Spark
Years of experience required:
7 – 10 yrs
Education qualification:
Btech/MBA/MCA
Education (if blank, degree and/or field of study not specified)
Degrees/Field of Study required: Bachelor of Engineering, MBA (Master of Business Administration)Degrees/Field of Study preferred:Certifications (if blank, certifications not specified)
Required Skills
GCP DataflowOptional Skills
Accepting Feedback, Accepting Feedback, Active Listening, Analytical Thinking, Applied Macroeconomics, Business Case Development, Business Data Analytics, Business Intelligence and Reporting Tools (BIRT), Business Intelligence Development Studio, Communication, Competitive Advantage, Continuous Process Improvement, Creativity, Data Analysis and Interpretation, Data Architecture Development, Database Management System (DBMS), Data Collection, Data Pipeline, Data Quality, Data Science, Data Visualization, Embracing Change, Emotional Regulation, Empathy, Geopolitical Forecasting {+ 24 more}Desired Languages (If blank, desired languages not specified)
Travel Requirements
Available for Work Visa Sponsorship?
Government Clearance Required?
Job Posting End Date
April 12, 2026