Novo Nordisk
Senior Data Engineer – Data Management and Informatics
Job Description
About the Department
Novo Nordisk Data Management and Informatics within the Digital Science and Innovation Organization provides informatics solutions, data products and analysis support to the research organization in Novo Nordisk. Data Management and Informatics established a data products organization across our research sites. Staff is co-located to one of our global sites in Seattle, WA, Fremont, CA, Lexington, MA, Oxford, UK, and Denmark.
The Position
As a Sr Data Engineer, you will be an individual contributor with expertise on various types of data that are used for target selection and validation. There are numerous data sources coming from internal data generators and external data providers. Ultimately, you will set the direction and deliver analysis ready data products via appropriately architected data pipelines to solve complex scientific problems. This role focuses on providing findable, accessible, interoperable, and reusable (FAIR) data within our therapeutic areas that include diabetes, obesity, cardiovascular and rare diseases. You will provide support to both data generators such as laboratory scientists and data consumers including computational scientists, data scientists, and bots. You will use your knowledge of digital to speed up the ability of lab and data scientists to access and work with high-dimensional data.
You will use hands-on expertise to provide solutions using preferred tooling and technologies, and will work with other data engineers, product owners and specialists across Data Management & Informatics (DMI) and Global IT to address larger needs. You will also enable the visualization needs for data, be well versed in best practices required to streamline data handling and represent the local needs at the sites in the context of the global Data Management and Informatics organization.
Do you believe that the digitalization journey in Research and Early Development (R&ED) is crucial for the success of pharmaceutical companies in the future? Then apply to become part of the next wave of scientific discovery by joining Digital Science & Innovation (DSI).
Relationships
This position will report to the Senior Director, Data and Analytics Engineering, Targets and Translational. It will be a key Data and Solution Engineer for our strategic initiatives on the digitalization of R&ED, with a special focus on the development of our data products.
Essential Functions
- Develop, implement, and maintain data models.
- Establish data pipelines from raw data sources to cloud service publication.
- Identify and establish storage solutions for historical data and align with existing data models.
- Understand what is in the data to design more efficient data generation methods in the future.
- Gather/organize large, complex data sets and develop transformations to move data through the processing pipeline. This will involve profiling, cleansing, transforming, and developing data structures, schemas, and dictionaries to create more efficient workflows.
- Build automated monitoring mechanisms to ensure compliance and integrity of the pipelines and database.
- Be proficient in concepts relating to DevOps including continuous integration, continuous delivery
- Ensure scientists and data scientists are aware of available data and can access, integrate and query it in a performant manner.
- Enable streamlined sharing of rich data with collaborators through cloud, accelerated compute and AI/ML approaches.
- Assist with provisioning of compute and data pipelines to deliver performant data products via the research and enterprise data ecosystem.
- Optimize workflows and exchange of research data.
- Participate in sustaining a suite of tools and applications such as Python, R, Jupyter Hub, Domino, DataLab.
Physical Requirements
Up to 10% overnight travel required.
Qualifications
- Master’s degree, or PhD highly preferred. Bachelor’s degree required; Degree within Life Sciences, Biomedical Engineering, Physics, Statistics, Computer Engineering preferred
- Master’s Degree with 3+ years’ relevant experience, or PhD with little to no postdoctoral years of experience OR A Bachelor’s degree with 5+ years’ relevant experience can be considered
- Relevant experience includes:
- Experience in building and productionizing data pipelines with ETL.
- Experience with database platforms and cloud services (AWS/Azure).
- Experience in compiled, scripting programming language such as Python.
- Experience working independently or with occasional guidance from manager/senior colleagues.
- Experience communicating with bench scientist and/or other end-users and with middle management
- Preferred experience:
- 2+ years’ experience in life sciences, medical device, or pharmaceutical industry
- Automated testing skills
- Domain knowledge in omics and/or high throughput cell assays
We commit to an inclusive recruitment process and equality of opportunity for all our job applicants.
At Novo Nordisk we recognize that it is no longer good enough to aspire to be the best company in the world. We need to aspire to be the best company for the world and we know that this is only possible with talented employees with diverse perspectives, backgrounds and cultures. We are therefore committed to creating an inclusive culture that celebrates the diversity of our employees, the patients we serve and communities we operate in. Together, we’re life changing.
Novo Nordisk is an equal opportunity employer. Qualified applicants will receive consideration for employment without regard to race, ethnicity, color, religion, sex, gender identity, sexual orientation, national origin, disability, protected veteran status or any other characteristic protected by local, state or federal laws, rules or regulations.
If you are interested in applying to Novo Nordisk and need special assistance or an accommodation to apply, please call us at 1-855-411-5290. This contact is for accommodation requests only and cannot be used to inquire about the status of applications.