Data Engineer

Commvault · Bangalore, India

Recruitment Fraud Alert

We’ve learned that scammers are impersonating Commvault team members—including HR and leadership—via email or text. These bad actors may conduct fake interviews and ask for personal information, such as your social security number.

What to know:

Commvault does not conduct interviews by email or text.
We will never ask you to submit sensitive documents (including banking information, SSN, etc) before your first day.

If you suspect a recruiting scam, please contact us at [email protected]

About Commvault

Commvault (NASDAQ: CVLT) is the gold standard in cyber resilience. The company empowers customers to uncover, take action, and rapidly recover from cyberattacks – keeping data safe and businesses resilient. The company’s unique AI-powered platform combines best-in-class data protection, exceptional data security, advanced data intelligence, and lightning-fast recovery across any workload or cloud at the lowest TCO. For over 25 years, more than 100,000 organizations and a vast partner ecosystem have relied on Commvault to reduce risks, improve governance, and do more with data.

Data Engineer

The Opportunity:

The Data Engineer is a critical member of the Business Intelligence and Analytics organization, responsible for architecting, building, and scaling enterprise data platforms that power advanced analytics and decision-making. This role focuses on designing and operationalizing robust, cloud-native data pipelines and data stores, ensuring reliable, high-quality, and governed data is available across the enterprise.

The Data Engineer partners closely with analytics, data science, and business teams to transform raw data into trusted, consumable assets. By leveraging modern data technologies and best practices, this role drives data accessibility, performance optimization, and scalability while enforcing strong data governance and quality standards.

What you’ll do…

Data Engineering & Pipeline Development

Design, build, and maintain scalable, reliable ELT/ETL pipelines across multiple data sources
Develop data ingestion frameworks for batch and near real-time data processing
Ensure data integrity through validation, monitoring, and error handling mechanisms
Optimize pipelines for performance, scalability, and cost efficiency

Data Modeling & Architecture

Design and implement logical and physical data models for analytics and reporting
Build and maintain data warehouses, data marts, and Lakehouse structures
Apply best practices in schema design (e.g., star/snowflake models)
Support enterprise data architecture initiatives and standards

Cloud Platform & Tools

Develop and optimize solutions using Azure Synapse, Azure Databricks, SQL Server, and related cloud services
Support and enhance data platform scalability, reliability, and performance
Leverage distributed processing frameworks (e.g., Spark) for large-scale data transformation

Data Quality, Governance & Reliability

Implement data quality checks, monitoring, and alerting for critical data assets
Ensure alignment with enterprise data governance, metadata, and security standards
Maintain data lineage and support auditability and compliance requirements
Uphold data accuracy, consistency, and availability SLAs

Analytics & BI Enablement

Deliver curated, modeled datasets to support BI tools such as Power BI
Partner with analytics teams to enable dashboards, reporting, and self-service analytics
Ensure data structures are optimized for performance and usability in reporting environments

Performance Optimization

Tune SQL queries, data models, and pipelines for optimal performance
Identify and resolve performance bottlenecks across data systems
Recommend improvements to data infrastructure and processing workflows

Cross-Functional Collaboration

Collaborate with business areas and BSA teams to align data requirements
Translate business needs into scalable and maintainable data solutions
Support project planning, requirements gathering, and solution design
Identify opportunities for process improvement and data-driven innovation

Engineering Best Practices

Follow best practices for version control, testing, and deployment of data pipelines
Contribute to CI/CD processes for data engineering workflows
Document data flows, models, and processes for maintainability and knowledge sharing

AI / ML & Advanced Analytics Enablement

Awareness of emerging AI/GenAI capabilities and their data requirements
Familiarity of integration patterns for Model Context Protocol (MCP) or similar tool-based interfaces, enabling AI agents and copilots to interact with enterprise data
Support integration of AI/ML solutions by enabling reliable data flows between enterprise data platforms and downstream applications
Support data requirements for model explainability, lineage, and governance
Exposure to tools such as Python (Pandas, NumPy), MLflow, or similar
Experience working with large-scale datasets for predictive or AI-driven applications

Who you are?

2-4+ years of experience in data engineering, BI/analytics engineering, or data architecture

Strong proficiency in SQL, data modeling, and performance tuning, aligning with the original expectations for SQL expertise.

Experience building and maintaining ETL/ELT pipelines
Hands-on experience with cloud data platforms (Azure preferred, including Synapse and Databricks)
Knowledge of data warehousing, data modeling, and database design
Experience with distributed data processing (e.g., Spark)
Experience with Power BI or similar BI tools

Education

Bachelor’s degree in Computer Science, Engineering, Information Systems, Data Science, or a related field
Master’s degree in a relevant discipline is a plus

Certifications (Preferred)

Microsoft Azure Data Engineer Associate or similar cloud certification
Databricks or Spark-related certifications
Relevant certifications in data engineering, cloud platforms, or analytics

You’ll love working here because...

Continuous professional development, product training, and career pathing
Annual health check-ups, Tuition Reimbursement
An inclusive company culture, an opportunity to join our Community Guilds
Personal accident and Term life coverage

#LI-VK

Commvault is an equal opportunity workplace and is an affirmative action employer. We are always committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity or Veteran status and we will not discriminate against on the basis of such characteristics or any other status protected by the laws or regulations in the locations where we work.

Commvault’s goal is to make interviewing inclusive and accessible to all candidates and employees. If you have a disability or special need that requires accommodation to participate in the interview process or apply for a position at Commvault, please email [email protected] For any inquiries not related to an accommodation please reach out to [email protected].

Commvault's Privacy Policy

Data & ML pay context

Based on 1,495 disclosed Data & ML salaries on RoleSuite, the role pays a median of $162K/year, with most offers between $127K and $203K (10th–90th percentile: $106K–$245K).

See the full Data & ML salary breakdown →

Apply →

Data Engineer

Data & ML pay context

Other roles at Commvault

More Data & ML roles