Merck Group

Associate Data Stewart / Data Engineer

30 May 2024
Apply Now
Deadline date:
£90000 - £158000 / year

Job Description

Work Your Magic with us! 

 

Ready to explore, break barriers, and discover more? We know you’ve got big plans – so do we! Our colleagues across the globe love innovating with science and technology to enrich people’s lives with our solutions in Healthcare, Life Science, and Electronics. Together, we dream big and are passionate about caring for our rich mix of people, customers, patients, and planet. That`s why we are always looking for curious minds that see themselves imagining the unimageable with us.

 

Job Title: Data Engineer / Associate Data Stewart

Job Location: Bangalore

 

In this role, you will be part of a growing, global team of data engineers, who collaborate in DevOps mode, to enable Merck’s Life Science business with state-of-the-art technology to leverage data as an asset and to take better informed decisions.

 

The Life Science Data Engineering Team is responsible for designing, developing, testing, and supporting automated end-to-end data pipelines and applications on Life Science’s data management and analytics platform (Palantir Foundry, AWS and other components).

 

The Foundry platform comprises multiple different technology stacks, which are hosted on Amazon Web Services (AWS) infrastructure. Developing pipelines and applications on Foundry requires: 

 

  • Proficiency in Python and SQL
  • Proficiency in PySpark for distributed computation
  • Familiarity with common databases (e.g., Oracle, MySQL, SQL Server). Not all types required
  • Familiarity with any cloud infrastructure/tools with respect to data engineering and data visualization
  • Familiarity with HTML, CSS, Typescript and JavaScript and basic design/visual competency
  • Familiarity with Postgres and Elasticsearch

In this position, you may be required to work across multiple use cases and data analytics products, utilizing an agile project methodology.

 

Roles & Responsibilities: 

  • Data ingestion engineering work for new ingestions or enhancements mainly supporting Data Steward Globally
  • Helping clients design and establish data pipeline in and outside of Foundry
  • Stakeholder management by directly engage with clients
  • Self managed demands from capture up to delivery
  • Implement new processes like Self-Service data ingestion using SharePoint (document and evaluate connection and transformation logic including code examples from loading external data into Foundry)
  • Managing data connections and transformations e.g. API calls
  • Participate in end-to-end project lifecycle, from requirements analysis to go-live and operations of an application
  • Building data pipelines using PySpark code to UI development using low code and no code application building services in foundry
  • Ensuring code quality and overall project quality as per defined maturity standards
  • Direct interaction with business stakeholders and product managers to clarify requirements.
  • Review code developed by other data engineers and check against platform-specific standards, coding and configuration standards.
  • Document technical work in a professional and transparent way. Create high quality technical documentation
  • Work out the best possible balance between technical feasibility and business requirements (the latter can be quite strict)
  • Develop data pipelines by ingesting various data sources – structured and un-structured – into Palantir Foundry
  • Besides working on projects, provide support for critical live applications; analyze and resolve complex incidents/problems.
  • Work closely with business users, product managers, Solution architects, data scientists/analysts.

 

Education 

  • B.Sc , BE(or higher),   degree in Computer Science, Engineering, Mathematics, or related fields 

Professional Experience  

  • 5+ years of experience in software engineering and application development
  • 3+ years of experience in data and analytics.

Skills

Hadoop General

Deep knowledge of big data, distributed file system concepts, map-reduce principles and distributed computing.  Knowledge of Spark and differences between Spark and Map-Reduce.  Familiarity of encryption and security in a Hadoop cluster.

Spark

Deep understanding of Apache Spark framework and proficiency in building spark pipelines.

Programming 

Must be proficient in data engineering tasks using Pythion and Spark.

XML/JSON knowledge

Experience working with REST APIs

SQL 

Must be an expert in manipulating database data using SQL.  Familiarity with views, functions, stored procedures, and exception handling.

Application Development

Familiarity with HTML, CSS, and JavaScript and basic design/visual competency. Experience with any data visualization tool like Tableau is a plus.

AWS 

General knowledge of AWS Stack (EC2, S3, Glue, lambda, Athena)

Git

Must be experienced in the use of source code control systems such as Git

ETL 

Experience with developing ELT/ETL processes with experience in loading data from enterprise sized RDBMS systems such as Oracle, DB2, MySQL, etc.

Authorization

Basic understanding of user authorization and authentication

IT Process Compliance

SDLC experience and formalized change controls

Working in DevOps teams, based on Agile principles (e.g., Scrum)

ITIL knowledge (especially incident, problem, and change management)

Languages 

Fluent English skills

 

Specific information related to the position:

  • Physical presence in primary work location (Bangalore)
  • Flexible to work CEST and US EST time zones (according to team rotation plan)
  • Willingness to travel to Germany, US and potentially other locations (as per project demand)

Behavioural Competencies:

  • Results-driven

We are an equal opportunity employer that values workforce diversity. We want everyone to be able to bring their best self to work every day which is why equality and inclusion is at the forefront of all our activities. We are dedicated to a policy of non-discrimination in employment on any basis including race, caste, creed, color, religion, sex, age, disability, marital status, sexual orientation, and gender identity.

 

What we offer: We are curious minds that come from a broad range of backgrounds, perspectives, and life experiences. We celebrate all dimensions of diversity and believe that it drives excellence and innovation, strengthening our ability to lead in science and technology. We are committed to creating access and opportunities for all to develop and grow at your own pace. Join us in building a culture of inclusion and belonging that impacts millions and empowers everyone to work their magic and champion human progress!

Apply now and become a part of our diverse team!