IN_Manager_ Azure DE__Data and Analytics_Advisory_Kolkata

47 Minutes ago • 4-7 Years

Data Analysis

Job Description

At PwC, data and analytics engineers design and develop robust data solutions, transforming raw data into actionable insights. This role focuses on building and optimizing scalable ETL processes using PySpark, Scala, or SparkSQL for large datasets. Responsibilities include implementing cloud-based data ingestion, processing, and storage solutions on Azure (Databricks, Data Lake Storage, Synapse Analytics), developing data models, monitoring pipeline performance, and ensuring data security and compliance.

Good To Have:

Exposure to big data frameworks and tools like Hadoop, Hive, and HBase.
Amazon Web Services (AWS)
Apache Airflow
Azure Data Factory
Data Anonymization
Data Architecture
Database Administration
Database Management System (DBMS)
Database Optimization
Database Security Best Practices
Databricks Unified Data Analytics Platform
Data Engineering Platforms
Data Infrastructure
Data Integration
Data Lake
Data Modeling

Must Have:

Build and optimize scalable ETL processes using PySpark, SQL, and Apache Spark.
Implement data ingestion, processing, and storage solutions on Azure (Databricks, Data Lake Storage, Synapse Analytics, SQL Database).
Proficiency in Python, PySpark, and SQL.
Experience with software development practices, version control, and CI/CD pipelines.
Familiarity with data warehousing concepts, dimensional modeling, and relational databases.
Monitor and optimize data pipeline performance for scalability, reliability, and cost-efficiency.
Enforce data security protocols and compliance measures.

Perks:

Inclusive benefits
Flexibility programmes
Mentorship
Support for wellbeing

Add these skills to join the top 1% applicants for this job

team-management

data-analytics

game-texts

agile-development

mysql

postgresql

html

hbase

aws

azure

hadoop

spark

amazon-web-services

microsoft-azure

ci-cd

python

scala

sql

At PwC, our people in data and analytics engineering focus on leveraging advanced technologies and techniques to design and develop robust data solutions for clients. They play a crucial role in transforming raw data into actionable insights, enabling informed decision-making and driving business growth.

In data engineering at PwC, you will focus on designing and building data infrastructure and systems to enable efficient data processing and analysis. You will be responsible for developing and implementing data pipelines, data integration, and data transformation solutions.

Why PWC

At PwC, you will be part of a vibrant community of solvers that leads with trust and creates distinctive outcomes for

our clients and communities. This purpose-led and values-driven work, powered by technology in an environment that drives innovation, will enable you to make a tangible impact in the real world. We reward your contributions, support your wellbeing, and offer inclusive benefits, flexibility programmes and mentorship that will help you thrive in work and life. Together, we grow, learn, care, collaborate, and create a future of infinite experiences for

our each other. Learn more about us

At PwC, we believe in providing equal employment opportunities, without any discrimination on the grounds of gender, ethnic background, age, disability, marital status, sexual orientation, pregnancy, gender identity or expression, religion or other beliefs, perceived differences and status protected by law. We strive to create an environment where each one of our people can bring their true selves and contribute to their personal growth and the firm’s growth. To enable this, we have zero tolerance for any discrimination and harassment based on the above considerations.

Job Description & Summary: -

Design and Develop Data Pipelines: Build and optimize scalable ETL processes using PySpark or Scala or SparkSQL to handle large volumes of structured and unstructured data from diverse sources. -Cloud-Based Data Solutions: Implement data ingestion, processing, and storage solutions on the Azure cloud platform, utilizing services such as Azure Databricks, Azure Data Lake Storage, and Azure Synapse Analytics. -Data Modeling and Management: Develop and maintain data models, schemas, and metadata to ensure efficient data access, high query performance, and support for analytics requirements. -Pipeline Monitoring and Optimization: Monitor the performance of data pipelines, troubleshoot issues, and enhance workflows for scalability, reliability, and cost-efficiency. -Security and Compliance: Enforce data security protocols and compliance measures to safeguard sensitive information and meet regulatory standards.

Responsibilities:

-Experience: Proven track record as a Data Engineer with expertise in building and optimizing data pipelines using PySpark, SQL, and Apache Spark. -Cloud Proficiency: Hands-on experience with Azure cloud services, including Azure Databricks, Azure Data Lake Storage, Azure Synapse Analytics, and Azure SQL Database. -Programming Skills: Strong proficiency in Python ,PySpark, and SQL with experience in software development practices, version control systems, and CI/CD pipelines. -Data Warehousing Knowledge: Familiarity with data warehousing concepts, dimensional modeling, and relational databases such as SQL Server, PostgreSQL, and MySQL. -Big Data Technologies: Exposure to big data frameworks and tools like Hadoop, Hive, and HBase is a plus.

Mandatory skill sets:

-Hands-on experience with Azure cloud services, including Azure Databricks, Azure Data Lake Storage, Azure Synapse Analytics, and Azure SQL Database. -Monitor the performance of data pipelines, troubleshoot issues, and enhance workflows for scalability, reliability, and cost-efficiency.

Preferred skill sets:

-Hands-on experience with Azure cloud services, including Azure Databricks, Azure Data Lake Storage, Azure Synapse Analytics, and Azure SQL Database.

Years of experience required:

-4-7 years experience req.

Education qualification:

-B.Tech / M.Tech (Computer Science, Mathematics & Scientific Computing etc.)

Education

Degrees/Field of Study required: Bachelor of Technology

Degrees/Field of Study preferred:

Certifications

Required Skills

Microsoft Azure Cloud Services, Microsoft Azure Databricks

Optional Skills

Accepting Feedback, Accepting Feedback, Active Listening, Agile Scalability, Amazon Web Services (AWS), Analytical Thinking, Apache Airflow, Apache Hadoop, Azure Data Factory, Coaching and Feedback, Communication, Creativity, Data Anonymization, Data Architecture, Database Administration, Database Management System (DBMS), Database Optimization, Database Security Best Practices, Databricks Unified Data Analytics Platform, Data Engineering, Data Engineering Platforms, Data Infrastructure, Data Integration, Data Lake, Data Modeling {+ 32 more}