About Us
Visa is a world leader in payments technology, facilitating transactions between consumers, merchants, financial institutions and government entities across more than 200 countries and territories, dedicated to uplifting everyone, everywhere by being the best way to pay and be paid.
At Visa, you'll have the opportunity to create impact at scale — tackling meaningful challenges, growing your skills and seeing your contributions impact lives around the world.
Join Visa and do work that matters – to you, to your community, and to the world. Progress starts with you.
Job Description
Key Responsibilities:
- Big Data Pipeline Development: Design, implement, and optimize robust data pipelines using PySpark in Hadoop environments to extract, transform, and load data at scale from diverse sources
- Data Quality Management: Develop comprehensive data quality frameworks with automated validation checks, anomaly detection, and reconciliation processes to ensure data accuracy and integrity throughout the entire data lifecycle
- Analytics Automation: Create scalable scripts and workflows to automate complex reporting, visualization, and insights generation to drive business decision-making
- Cross-functional Collaboration: Partner with data scientists, business analysts, and stakeholders to translate business requirements into technical specifications and deliver data solutions that meet evolving needs
- Performance Optimization: Troubleshoot performance bottlenecks, implement best practices for distributed computing, and maintain high-availability data systems with proactive monitoring and maintenance
Visa requires at least 3 days in office, expectations of these days will be confirmed by your Hiring Manager.
Qualifications
Technical Proficiency:
- Strong programming skills in Python with expertise in PySpark and Spark SQL
- Advanced SQL knowledge for complex data manipulation and analytics
- Experience with Hadoop ecosystem components (HDFS, Hive, MapReduce, YARN)
- Familiarity with data orchestration tools (Airflow, Oozie, or similar)
Big Data Experience:
- Proven track record building and maintaining data pipelines processing TB-scale datasets
- Experience with distributed computing concepts and optimization techniques
- Knowledge of data modeling, ETL design patterns, and performance tuning
Additional Technical Skills:
- Version control systems (Git) and CI/CD practices
- Experience with cloud platforms (AWS/Azure/GCP) and their data services
- Understanding of data governance and security principles
Soft Skills:
- Strong problem-solving abilities and analytical thinking
- Excellent communication skills to explain complex technical concepts
- Self-motivated with ability to work independently and as part of a team
Education: Bachelor's or Master's degree in Computer Science, Data Engineering, Information Systems, or related technical field
Experience: 3-5 years of hands-on experience in data engineering with at least 2 years working specifically with PySpark and Hadoop technologies
Certifications: Professional certifications in relevant technologies (Cloudera, Databricks, cloud platform certifications) preferred but not required
Visa is an EEO Employer
Qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability or protected veteran status. Visa will also consider for employment qualified applicants with criminal histories in a manner consistent with EEOC guidelines and applicable local law.